How are extension methods compiled? - c#

How the C# compiler implement extension methods?

The process is exactly the same as overload resolution:
Func(myObject);
The compiler checks all functions named "Func" and tries to match the static type of myObject to the parametrs (possibly using conversions, upcasting to base class). If it succeeds, then calls the appropriate function.
If you realize that you can call extensions methods "in a normal way", then it clears up:
static class MyExtensions
{
public static void MyFunc(this string arg)
{
// ...
}
}
string a = "aa";
MyExtensions.MyFunc(a); // OK
a.MyFunc(); // same as above, but nicer
For the given type (here string), the compiler just looks for all static functions with "this" modifier on the first argument and tries to match the static type on the left of the . (in this example "a") with the parameter type in the function.

Instance methods of a class have a hidden argument. An example:
class Example {
public void Foo(int arg) {}
}
actually looks like this when the JIT compiler is done with it, converted back to C# syntax:
static void Foo(Example this, int arg) {}
That hidden argument is the reason that you can use this in an instance method. The JIT compiler figures out the argument to pass from the object reference you provide to call the Foo method.
As you can tell, it is now a very short hop to an extension method.

The compiler first looks in the base class for a function matching the signature of the function. If it can't find it than it looks for an extension. If an extension has the same signature as a base class method than the base class method is called instead.
This might help:
Extension Methods

Related

How does extension method works [duplicate]

This question already has answers here:
"this" in function parameter
(5 answers)
Closed 9 years ago.
I want to understand how extension method works?Can we define extension methods in non static classes?
*
Why do we put extension methods inside static class?
*
According to MSDN,
**Their first parameter specifies which type the method operates on, and the parameter is preceded by the this modifier. Extension methods
are only in scope when you explicitly import the namespace into your
source code with a using directive.
**
What is the role of this operator here and how does it associates that extension method to that argument?
No, you can't define an extension method on a class that is not static.
The this is syntactic sugar that allows to call your static extension method on an instance. But at the end of the day, an extension method is nothing more than a static method in a static class.
So basically:
var test = myInstance.MyExtensionMethod();
is the same as
var test = MyExtensionClass.MyExtensionMethod(myInstance);
They are 4 requirements for method to be an extension method:
It has to be declared in static class
It has to be static (which actually is always true if the first one is met)
It has to be public
It has to have first parameter marked with this keyword
So you can't define extension method in non-static class.
Whole Extension Method functionality is some kind of syntax sugar. Following extension method declared on MyClass:
// The following extension methods can be accessed by instances of any
// class that is or inherits MyClass.
public static class Extension
{
public static void MethodA(this MyClass myInterface, int i)
{
Console.WriteLine
("Extension.MethodA(this IMyInterface myInterface, int i)");
}
}
can be called in two ways:
var myClassObject = new MyClass();
Extension.MethodA(myClassObject);
Or
myClassObject.MethodA();
However, the second one will be transformed into the first one by compiler anyway.
What is the role of this operator here and how does it associates that
extension method to that argument?
In this context this is not an operator, it is a modifier. It could have been called something else, it has no relation to this object which refers to the current object within a normal method call.
The role of this modifier is to tell the compiler that this is actually an extension method and not a standard static method, so that it will not complain when you call it in a way which looks like an instance method call, although it is not.
No, extension methods have to be in a static class, that's just the rule. It could have been possible to allow extension methods to be defined anywhere, but to make it easier to find them they are not allowed to be buried inside classes with a lot of other code.
The this keyword is used on the first parameter of an extension method to specify that it is an extension method.
(The internal implementation of a regular method also has a reference to the object as a first parameter, so what the compiler does with extension methods is just to add them to the other methods in the class.)

How do extension methods work with inheritance?

I knew the first half part of the magic. Assume I have:
public class Foo {}
public class static FooExt
{
public static void M(this Foo f) {}
}
When I invoke foo.M() the compiler changes it to FooExt.M(foo).
But how about the inheritance? For example:
public class Bar : Foo {}
public class static BarExt
{
public static void M(this Bar b) {}
}
When I invoke bar.M() will it call FooExt.M() or BarExt.M()? In fact I tested it and the answer is BarExt, but why? What happens when I call wow.M() if I have another Wow : Foo but no WowExt.M()?
The compiler will look for the extension method whose arguments most closely match those at the invocation site. If you have a variable of type Bar then the BarExt extension would be used, since Bar is more specific a type than Foo, and therefore matches more closely than the alternative method that takes a Foo instance. This really is not very different from how ambiguous method overloads are resolved.
It is worth noting that this code will call FooExt.M():
Foo bar = new Bar();
bar.M();
This is because extension methods are not virtual. Since the variable you are invoking the method on is type Foo, the Bar version of the extension method will not even be considered. Binding of extension methods happens entirely at compile time.
In the second case you indicated (invoking the extension method on a variable of type Wow), the FooExt.M() method would be used, since there is nothing that matches better.
I tested it and the answer is BarExt, but why?
Because the compiler will choose the extension method that "best fits" the call. Since there is a method that takes a Bar directly, that one is chosen.
What happens when I call wow.M() if I have another Wow : Foo but no WowExt.M()?
Again, it will choose the extension that "best fits" the usage. Since there is no method that takes a Wow, but one that does take it's parent class Foo it will choose FooExt.M()
Remember that extension methods are really just syntactic sugar for static method calls, so the same rules apply to them as "normal" method resolution.
When resolving extension methods, the most specific type match will be selected. In your first case, BarExt.M() is the most specific (targets Bar rather than Foo).
In the second case, there is no Wow extension so the most specific match would be FooExt.M().

Is there a reason why extension methods can't be invoked directly on "this"? [duplicate]

Can someone explain to me why in the following the 3rd invocation of DoSomething is invalid?
( Error message is "The name 'DoSomething' does not exist in the current context" )
public class A { }
public class B : A
{
public void WhyNotDirect()
{
var a = new A();
a.DoSomething(); // OK
this.DoSomething(); // OK
DoSomething(); // ?? Why Not
}
}
public static class A_Ext
{
public static void DoSomething(this A a)
{
Console.WriteLine("OK");
}
}
Extension methods can be invoked like other static methods.
Change it to A_Ext.DoSomething(this).
If you're asking why it isn't implicitly invoked on this, the answer is that that's the way the spec was written. I would assume that the reason is that calling it without a qualifier would be too misleading.
Because DoSomething takes a parameter.
DoSomething(a) would be legal.
Edit
I read the question a bit wrong here.
Since your calling it a a normal static method, and not a extension method, you need to prefic with the class name.
So A_Ext.DoSomething(a); will work.
If you call it like a normal static method, all the same rules apply.
Your second variant works because B inhetits A, and therefore you still end up calling it as an extension method, but the third does not.
sorry about the first version above that does not work. I'll leave it to keep the comment relevant.
Extension methods are still static methods, not true instance calls. In order for this to work you would need specific context using instance method syntax (from Extension Methods (C# Programming Guide))
In your code you invoke the extension
method with instance method syntax.
However, the intermediate language
(IL) generated by the compiler
translates your code into a call on
the static method. Therefore, the
principle of encapsulation is not
really being violated. In fact,
extension methods cannot access
private variables in the type they are
extending.
So while normally, both syntaxes would work, the second is without explicit context, and it would seem that the IL generated can't obtain the context implicitly.
DoSomething requires an instance of A to do anything, and without a qualifier, the compiler can't see which DoSomething you need to invoke. It doesn't know to check in A_Ext for your method unless you qualify it with this.

Why do I get this compile error trying to call a base constructor/method that takes a dynamic argument?

While refactoring some code, I came across this strange compile error:
The constructor call needs to be dynamically dispatched, but cannot be because it is part of a constructor initializer. Consider casting the dynamic arguments.
It seems to occur when trying to call base methods/constructors that take dynamic arguments. For example:
class ClassA
{
public ClassA(dynamic test)
{
Console.WriteLine("ClassA");
}
}
class ClassB : ClassA
{
public ClassB(dynamic test)
: base(test)
{
Console.WriteLine("ClassB");
}
}
It works if I cast the argument to object, like this:
public ClassB(dynamic test)
: base((object)test)
So, I'm a little confused. Why do I have to put this nasty cast in - why can't the compiler figure out what I mean?
The constructor chain has to be determined for certain at compile-time - the compiler has to pick an overload so that it can create valid IL. Whereas normally overload resolution (e.g. for method calls) can be deferred until execution time, that doesn't work for chained constructor calls.
EDIT: In "normal" C# code (before C# 4, basically), all overload resolution is performed at compile-time. However, when a member invocation involves a dynamic value, that is resolved at execution time. For example consider this:
using System;
class Program
{
static void Foo(int x)
{
Console.WriteLine("int!");
}
static void Foo(string x)
{
Console.WriteLine("string!");
}
static void Main(string[] args)
{
dynamic d = 10;
Foo(d);
}
}
The compiler doesn't emit a direct call to Foo here - it can't, because in the call Foo(d) it doesn't know which overload it would resolve to. Instead it emits code which does a sort of "just in time" mini-compilation to resolve the overload with the actual type of the value of d at execution time.
Now that doesn't work for constructor chaining, as valid IL has to contain a call to a specific base class constructor. (I don't know whether the dynamic version can't even be expressed in IL, or whether it can, but the result would be unverifiable.)
You could argue that the C# compiler should be able to tell that there's only actually one visible constructor which can be called, and that constructor will always be available... but once you start down that road, you end up with a language which is very complicated to specify. The C# designers usually take the position of having simpler rules which occasionally aren't as powerful as you'd like them to be.

Why does C# compiler overload resolution algorithm treat static and instance members with equal signature as equal?

Let we have two members equal by signature, but one is static and another - is not:
class Foo
{
public void Test() { Console.WriteLine("instance"); }
public static void Test() { Console.WriteLine("static"); }
}
but such code generate brings a compiler error:
Type 'Foo' already defines a member called 'Test' with the same parameter types
But why?
Let we compiled that successfully, then:
Foo.Test() should output "static"
new Foo().Test();should output "instance"
Can't call the static member instead of instance one because in this case another, more reasonable compiler error will occur:
Member 'Foo.Test()' cannot be accessed with an instance reference; qualify it with a type name instead
What about, from an instance method:
Test();
What would that call? You'd probably want to give the instance method "priority" over the static method, but both would be applicable.
I would say that even if it were allowed, it would be a fundamentally bad idea to do this from a readability point of view... for example, if you changed a method which called Test from being static to instance, it would change the meaning in a subtle way.
In other words, I have no problem with this being prohibited :)

Categories