Gene Msil_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1050 
Symbol 
ID7091878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1137354 
End bp1138550 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID643464389 
Productsodium/glutamate symporter 
Protein accessionYP_002361381 
Protein GI217977234 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0733546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTCA TTGAAGCCTC GGGGCTCCTC ACCTTCACCC TTGCGATCGT CGTGTTCTTT 
ATCGGCGCCG GCCTCAATCA TTTGATCGCG CCGCTGCGCC GCTGGAACAT TCCCGAAGCG
GTGACGGGCG GACTGACGGC CGCGCTCGCG ACGCTCGTCG CCTATCGGGT GTTCGGCGTC
GAAATTCATT TCAGTCTCGA CGCGCGGGAC ATGCTGCTGT TGTATTTCTT CACCGGCATC
GGCCTCAACG CCAAGCTTGG CGATCTCGTC GCCGGAGGGC GGCCGCTTCT CGTGCTGCTG
GCGGTGACGC TTGTCTATCT GGTGATCCAG AATTTGATCG CGGCGGGCGC GGCATCTGCC
CTGCGTCTCC CCGAGGGAAT CACTGTGCTG CTCGGCTCGG CTTCGCTGAT CGGCGGCCAT
GGCACGACCA TCGCCTGGGC GCCTCTGATC ACGGAGCGCT TCGGGCTCGC CAACGCGATG
GAGATTGGCG TCGCCTCCGC GACGCTCGGC CTCGTCATCG CCAGCCTGAT CGGCGGCCCC
GTCGCCGGCG TCCTGATATC GCGCTACAAG CTCGTCGGGC CGATGAATGA GGCGCCCTCC
GTCGGCCTGC CCGACGATAC GAAACTCGAC GACCTCAACC ACGTCAATCT GCTGCGGACG
ATTCTCGTTC TCAACATCGT GATATTGATC GGCTTTCTGG CGCATGAAGC GCTGGTCGAA
GCGGGCGTGC GCATGCCGCG CTTCGTCGTC TGCCTGCTGG TCGCAATCTT GTTCACAAAC
ACCATCCCGC GCCTGCTGCC GCGCCTCGAT TGGCCCTCGC GCAGCCGCTC GCTCGCGCTG
ATTTCCGATC TGTCGCTGAA CGTCTTTCTC GTGATGTCCT TGATGAGCAT GCAGCTCTGG
ACGCTCGGCG GCCTCGGCCC GGCGCTGGTC GCCGTGCTCG CCGCGCAGAC GATCGTCGCG
GTGATCTATA TGTTGTTCGT CGTCTTCCCG GCCATGGGCC GCAACTATGA GGCGGCGGTT
ATCGCGGCGG GATTTGGCGG CATCAGCCTC GGCGCCACGC CGACCGCCAT CGCCAATATG
ACGGCGATCA CCAAGGTCCA TGGCGCGGCG CCGACCGCAT TCATTATCTT GCCGCTCGTG
TCGGCGTTTT TCATCGACAT CGCCAATGCG GGCGCCATCG GCTTTCTGGT GCGCTAG
 
Protein sequence
MRVIEASGLL TFTLAIVVFF IGAGLNHLIA PLRRWNIPEA VTGGLTAALA TLVAYRVFGV 
EIHFSLDARD MLLLYFFTGI GLNAKLGDLV AGGRPLLVLL AVTLVYLVIQ NLIAAGAASA
LRLPEGITVL LGSASLIGGH GTTIAWAPLI TERFGLANAM EIGVASATLG LVIASLIGGP
VAGVLISRYK LVGPMNEAPS VGLPDDTKLD DLNHVNLLRT ILVLNIVILI GFLAHEALVE
AGVRMPRFVV CLLVAILFTN TIPRLLPRLD WPSRSRSLAL ISDLSLNVFL VMSLMSMQLW
TLGGLGPALV AVLAAQTIVA VIYMLFVVFP AMGRNYEAAV IAAGFGGISL GATPTAIANM
TAITKVHGAA PTAFIILPLV SAFFIDIANA GAIGFLVR