Gene EcSMS35_4195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4195 
SymbolmetE 
ID6146586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4294245 
End bp4296506 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content55% 
IMG OID641619018 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionYP_001746146 
Protein GI170683914 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00351399 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAATTC TTAATCACAC CCTCGGTTTC CCTCGCGTTG GCCTGCGTCG CGAGCTGAAA 
AAAGCGCAAG AGAGTTATTG GGCGGGGAAC TCCACGCGTG AAGAACTGCT GGCGGTAGGG
CGTGAATTAC GTGCTCGTCA CTGGGAGCAA CAAAAGCAAG CGGGTATCGA CCTGCTGCCG
GTGGGCGATT TTGCCTGGTA CGATCATGTA CTGACCACCA GTCTGCTGCT GGGTAATGTT
CCGGCGCGTC ACCAGAACAA CGATGGTTCG GTAGATATCG ACACCCTGTT CCGTATTGGT
CGTGGACGTG CACCGACTGG CGAACCTGCG GCGGCAGCGG AAATGACCAA ATGGTTTAAC
ACCAACTATC ACTACATGGT GCCGGAGTTC GTTAAAGGCC AACAGTTCAA ACTGACCTGG
ACGCAGCTGC TGGAGGAAGT GGACGAGGCG CTGGCGCTGG GCCACAAGGT GAAACCTGTG
CTGCTGGGGC CAGTTACCTA CCTGTGGCTG GGTAAAGTGA AAGGTGAACA GTTTGATCGC
CTGAGCCTGC TGAACGACAT TCTGCCGGTT TATCAGCAAG TGCTGGCAGA ACTGGCGAAA
CGCGGCATCG AGTGGGTACA GATTGATGAA CCCGCGCTGG TACTGGAACT GCCGCAGGCG
TGGCTGGACG CATACAAACC CGCTTACGAC GCGCTCCAGG GACAGGTGAA ACTGCTGCTG
ACCACCTATT TTGAAGGCGT AACACCAAAT CTCGACACGA TTACTGCGCT GCCTGTTCAG
GGTCTGCATG TCGATCTTGT ACATGGTAAA GATGACGTTG CTGAACTGCA CAAGCGTCTG
CCTTCTGACT GGCTGCTGTC TGCGGGTCTT ATCAATGGTC GTAACGTCTG GCGCGCCGAT
CTTACCGAGA AATATGCGCA AATTAAGGAC ATTGTCGGCA AACGTGATTT GTGGGTGGCA
TCTTCCTGCT CACTGCTGCA CAGTCCCATT GACCTGAGTG TGGAAACACG TCTTGATGCA
GAAGTGAAAA GCTGGTTTGC CTTCGCCCTG CAAAAATGTC ATGAACTGGC ATTGCTGCGC
GATGCGTTGA ACAGTGGTGA TACGGCAGCT CTGGCAGCGT GGAGCGCTCC GATTCAGGCG
CGTCGTCACT CTACTCGTGT ACATAATCCG GCAGTAGAAA AGCGTCTGGC GGCGATCACC
GCTCAGGACA GTCAGCGTGC GAATGTCTAT GAAGTGCGTG CTGAAGCCCA GCGTGCGCGT
TTTAAACTCC CAGCATGGCC GACCACCACG ATTGGTTCTT TCCCGCAAAC CACGGAGATT
CGTACCCTGC GTCTGGATTT CAAAAAGGGT AATCTCGACG CCAACAACTA CCGCACAGGC
ATTGCGGAAC ATATCAAGCA GGCCATTGTT GAGCAGGAAC GTTTGGGACT GGATGTGCTG
GTACATGGCG AGGCCGAGCG TAATGACATG GTGGAATACT TTGGCGAGCA TCTGGATGGC
TTTGTCTTTA CGCAAAACGG TTGGGTACAG AGCTACGGTT CCCGCTGCGT GAAGCCACCG
ATTGTTATTG GTGACGTTAG CCGCCCGGCA CCGATTACCG TGGAGTGGGC AAAATATGCG
CAATCCCTGA CTGATAAACC GGTGAAAGGG ATGTTGACCG GCCCGGTGAC TATTCTCTGC
TGGTCGTTCC CGCGTGAAGA TGTCAGCCGT GAAACCATCG CCAAACAAAT TGCGCTGGCG
CTGCGTGATG AAGTCGCGGA CCTGGAAGCC GCTGGAATTG GCATCATTCA GATTGACGAA
CCGGCATTGC GCGAAGGTTT ACCACTGCGT CGCAGCGACT GGGATGCCTA TCTCCAGTGG
GGCGTGGAGG CTTTCCGTAT CAACGCCGCT GTGGCGAAAG ATGACACACA AATCCACACT
CACATGTGTT ACTGCGAGTT CAACGACATC ATGGATTCGA TTGCGGCGCT GGACGCAGAC
GTCATCACCA TCGAAACCTC GCGTTCCGAC ATGGAGTTGC TGGAGTCGTT CGAAGAGTTT
GATTATCCAA ATGAAATCGG TCCTGGCGTC TATGACATTC ACTCGCCAAA CGTACCGAGC
GTGGAATGGA TTGAAGCCTT GCTGAAGAAA GCGGCAAAAC GCATTCCGGC AGAGCGTCTG
TGGGTCAACC CGGACTGTGG CCTGAAAACG CGCGGCTGGC CAGAAACCCG CGCGGCACTG
GCGAACATGG TGCAGGCGGC GCAGAATTTG CGTCGGGGAT GA
 
Protein sequence
MTILNHTLGF PRVGLRRELK KAQESYWAGN STREELLAVG RELRARHWEQ QKQAGIDLLP 
VGDFAWYDHV LTTSLLLGNV PARHQNNDGS VDIDTLFRIG RGRAPTGEPA AAAEMTKWFN
TNYHYMVPEF VKGQQFKLTW TQLLEEVDEA LALGHKVKPV LLGPVTYLWL GKVKGEQFDR
LSLLNDILPV YQQVLAELAK RGIEWVQIDE PALVLELPQA WLDAYKPAYD ALQGQVKLLL
TTYFEGVTPN LDTITALPVQ GLHVDLVHGK DDVAELHKRL PSDWLLSAGL INGRNVWRAD
LTEKYAQIKD IVGKRDLWVA SSCSLLHSPI DLSVETRLDA EVKSWFAFAL QKCHELALLR
DALNSGDTAA LAAWSAPIQA RRHSTRVHNP AVEKRLAAIT AQDSQRANVY EVRAEAQRAR
FKLPAWPTTT IGSFPQTTEI RTLRLDFKKG NLDANNYRTG IAEHIKQAIV EQERLGLDVL
VHGEAERNDM VEYFGEHLDG FVFTQNGWVQ SYGSRCVKPP IVIGDVSRPA PITVEWAKYA
QSLTDKPVKG MLTGPVTILC WSFPREDVSR ETIAKQIALA LRDEVADLEA AGIGIIQIDE
PALREGLPLR RSDWDAYLQW GVEAFRINAA VAKDDTQIHT HMCYCEFNDI MDSIAALDAD
VITIETSRSD MELLESFEEF DYPNEIGPGV YDIHSPNVPS VEWIEALLKK AAKRIPAERL
WVNPDCGLKT RGWPETRAAL ANMVQAAQNL RRG