Gene Msil_2234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2234 
Symbol 
ID7091356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2418723 
End bp2420051 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content62% 
IMG OID643465555 
Productsodium:dicarboxylate symporter 
Protein accessionYP_002362530 
Protein GI217978383 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.709095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGG TATCGGCAGG CGCGGAGCCG AGCGCTCCGG CGAAACCGTT CTACAAGGTC 
CTCTATGTGC AGGTCCTGTT CGGCATCCTG GTCGGCGCCC TGTTTGGTTG GCTTTGGCCG
GAATATGCGA CCGCGCCCTG GGTGAAGGCG CTCGGCGACG GCTTCATCAA GCTGATCAAG
ATGCTGATCG CGCCGATCAT TTTCTGCACC GTCGTCGCCG GCATCGCCCA TGTCTCGGAC
GCCAAGAAGG TGGGCCGCGT CGCCGTCAAG GCGTTGATCT ATTTCGAGAT CGTCTCGACC
TTCGCGCTCG GCTTCGGACT GCTCATGGGC AATGTCGTGC GGCCCGGGGC GGGATTTTCG
GGCAGCCATG GCGACGCGGC CGCGGCGATC GCCTTCGAGA AGCAGGGGGA GGGACATTCG
ACGGTCGACT TCCTGCTCGG GATCATTCCC GACAGCGTCG TCGGCGCCTT CGCCAAGGGC
GACGTGCTGC AGGTGCTGCT GTTCGCCATT CTGTTCGGCT TCGCGCTGAT GGCCCTCGGC
GACCGCGGCA AGGTCGTGCT GCATGTGATT GACGAGGCGG GGCACGCCAT CTTCGGCGTC
ATCAATATTG TGATGAAGCT CGCGCCGCTC GGCGCCTTTG GCGCGATGGC CTTTACCGTC
GGCAAATATG GGCCGCAATC GCTTGGAAAC CTCGCCGGCC TGATCGCCAC CTTCTACGCG
ACGTCAGCGC TGTTCATTTT CCTCATTCTT GGGACAATCG CCCGCATCGC CGGCTTCAAC
ATCTTCAAAT TCCTCAATTA CATCAAATCC GAACTCCTCA TCGTGCTCGG CACCAGCTCC
TCGGAGAGCG CCTTGCCGGC CCTGATGGAA AAGCTCGAAC GGCTCGGCTG CTCGCGGCCG
GTCGTCGGCC TCGTCGTGCC GACCGGCTAC TCCTTCAATC TCGACGGCAC CAATATTTAC
ATGACGCTGG CGACGCTGTT CATCGCCCAG GCGCTCAACG TCGATCTGAC CTTCGGGCAG
CAGATGACCA TTCTCATCGT CGCCATGCTG ACCTCGAAAG GGGCGAGCGG CGTCACCGGC
GCGGGCTTCG TCACGCTGGC GGCGACCCTC GCCGTGGTCA ATCCGGCGCT CGTGCCGGGC
ATGGCGATCG TGCTTGGAAT CGACAAATTC ATGAGCGAAT GCCGCGCGCT GACCAATATC
ATCGGCAATG GCGTCGCGAC CGTTGTGATC TCCTGGTCGG AAGGCGAGCT CGATCGCGAA
AAACTCAACT TGGCGCTCGG CAAGAATATC GATGTGAGCG ACATCAAGAC AGGCGTCGCC
ACCCCTTGA
 
Protein sequence
MVAVSAGAEP SAPAKPFYKV LYVQVLFGIL VGALFGWLWP EYATAPWVKA LGDGFIKLIK 
MLIAPIIFCT VVAGIAHVSD AKKVGRVAVK ALIYFEIVST FALGFGLLMG NVVRPGAGFS
GSHGDAAAAI AFEKQGEGHS TVDFLLGIIP DSVVGAFAKG DVLQVLLFAI LFGFALMALG
DRGKVVLHVI DEAGHAIFGV INIVMKLAPL GAFGAMAFTV GKYGPQSLGN LAGLIATFYA
TSALFIFLIL GTIARIAGFN IFKFLNYIKS ELLIVLGTSS SESALPALME KLERLGCSRP
VVGLVVPTGY SFNLDGTNIY MTLATLFIAQ ALNVDLTFGQ QMTILIVAML TSKGASGVTG
AGFVTLAATL AVVNPALVPG MAIVLGIDKF MSECRALTNI IGNGVATVVI SWSEGELDRE
KLNLALGKNI DVSDIKTGVA TP