Gene EcSMS35_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4472 
SymbolmetH 
ID6142641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4566731 
End bp4570414 
Gene Length3684 bp 
Protein Length1227 aa 
Translation table11 
GC content56% 
IMG OID641619288 
ProductB12-dependent methionine synthase 
Protein accessionYP_001746400 
Protein GI170681236 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0646] Methionine synthase I (cobalamin-dependent), methyltransferase domain
[COG1410] Methionine synthase I, cobalamin-binding domain 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR02082] 5-methyltetrahydrofolate--homocysteine methyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.346478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGCA AAGTGGAACA ACTGCGTGCG CAGTTAAATG AACGTATTCT GGTGCTGGAC 
GGCGGTATGG GCACCATGAT CCAGAGCTAT CGACTGAACG AAGCCGATTT TCGTGGTGAA
CGCTTTGCCG ACTGGCCATG CGACCTCAAA GGCAACAACG ACCTGCTGGT ACTCAGTAAA
CCGGAAGTGA TCGCCGCTAT CCACAACGCC TACTTTGAAG CGGGCGCGGA TATCATCGAA
ACCAACACCT TCAACTCCAC GACCATTGCG ATGGCGGATT ACCAGATGGA ATCCCTGTCG
GCGGAAATCA ACTTTGCGGC GGCGAAACTG GCGCGAGCTT GTGCTGACGA GTGGACCGCG
CGCACGCCAG AGAAACCGCG CTACGTTGCC GGTGTTCTCG GCCCGACCAA CCGCACGGCG
TCTATTTCTC CGGACGTCAA CGATCCGGCA TTTCGTAATA TCACTTTTGA CGGGCTGGTG
GCGGCTTATC GAGAGTCCAC CAAAGCGCTG GTGGAAGGTG GCGCGGATCT GATCCTGATT
GAAACCGTTT TCGACACCCT TAACGCCAAA GCGGCGGTCT TTGCGGTGAA AACGGAGTTT
GAAGCGCTGG GCGTTGAGCT GCCGATTATG ATCTCCGGCA CCATCACCGA CGCCTCCGGG
CGCACGCTCT CCGGGCAGAC CACCGAAGCG TTTTATAACT CTCTTCGCCA CGCCGAAGCG
CTGACTTTCG GCCTTAACTG TGCGCTGGGG CCAGACGAAC TGCGCCAGTA CGTGCAGGAG
CTGTCACGGA TTGCGGAATG CTACGTCACC GCGCACCCAA ACGCCGGGCT ACCCAACGCC
TTTGGTGAGT ACGATCTCGA CGCCGACACG ATGGCAAAAC AGATACGTGA ATGGGCGCAA
GCGGGCTTTC TCAATATCGT CGGCGGCTGC TGTGGCACCA CGCCACAACA TATTGCAGCG
ATGAGTCGTG CAGTAGAAGG ATTAGCGCCG CGCAAACTGC CGGAAATCCC CGTAGCCTGC
CGTTTGTCCG GCCTGGAGCC GCTGAACATT GGCGAAGATA GCCTGTTTGT GAACGTGGGG
GAACGCACCA ACGTCACCGG TTCCGCTAAG TTCAAGCGCC TGATCAAAGA AGAGAAATAC
AGCGAGGCGC TGGATGTGGC GCGTCAGCAG GTGGAAAACG GTGCGCAGAT TATCGATATC
AACATGGATG AAGGGATGCT CGACGCCGAA GCGGCGATGG TGCGTTTTCT CAATCTGATT
GCCGGTGAAC CGGATATCGC TCGCGTGCCG ATTATGATCG ACTCCTCAAA ATGGGATGTC
ATTGAAAAAG GTCTGAAGTG TATCCAGGGC AAAGGCATTG TTAACTCTAT TTCGATGAAA
GAAGGCGTCG ACGCTTTTAT CCATCACGCG AAATTGTTGC GTCGTTACGG TGCGGCAGTC
GTGGTAATGG CTTTTGACGA ACAGGGCCAG GCCGATACCC GCGCACGGAA AATCGAGATT
TGCCGTCGGG CGTACAAAAT CCTCACCGAA GAGGTTGGCT TCCCGGCTGA AGATATCATC
TTCGACCCGA ACATCTTCGC GGTCGCAACT GGCATTGAAG AGCACAACAA TTACGCGCAG
GACTTTATCG GCGCGTGTGA AGACATCAAA CGTGAACTGC CGCACGCACT GATTTCCGGC
GGCGTTTCTA ACGTTTCTTT CTCATTCCGT GGCAACGATC CGGTGCGCGA AGCCATTCAC
GCGGTGTTCC TCTACTACGC TATTCGCAAT GGCATGGATA TGGGGATCGT CAACGCCGGG
CAGCTGGCGA TTTACGACGA CCTACCCGCT GAACTGCGCG ACGCGGTGGA GGATGTGATC
CTCAACCGTC GCGACGATGG CACCGAGCGT TTACTCGAGC TTGCCGAGAA ATATCGCGGC
AGCAAAACCG ACGACACCGC TAACACCCAG CAGGCAGAGT GGCGTTCGTG GGAAGTGAAT
AAACGTCTGG AATACTCGCT GGTCAAAGGC ATTACCGAGT TTATCGAGCA GGATACCGAA
GAAGCCCGCC AGCAGGCTAC GCGTCCGATT GAAGTGATTG AAGGCCCGTT GATGGACGGC
ATGAACGTGG TCGGCGACCT GTTTGGCGAG GGCAAAATGT TCCTGCCGCA GGTGGTGAAA
TCGGCGCGCG TCATGAAACA GGCGGTGGCG TATCTCGAGC CGTTCATTGA AGCCTGCAAA
GAGCAGGGTA AAACCAACGG CAAGATGGTG ATCGCCACAG TGAAGGGCGA TGTCCACGAC
ATCGGTAAAA ATATCGTTGG CGTGGTGCTG CAATGTAACA ACTACGAAAT TGTCGATCTC
GGCGTGATGG TGCCTGCGGA AAAAATTCTC CGTACCGCTA AAGAAGTGAA TGCTGATTTG
ATTGGCCTTT CGGGGCTGAT CACGCCGTCG CTGGACGAGA TGGTTAACGT GGCGAAAGAG
ATGGAGCGTC AGGGCTTCAC CATTCCGCTA TTGATTGGCG GCGCGACCAC CTCAAAAGCG
CACACGGCGG TGAAAATTGA GCAGAACTAC AGCGGCCCGA CGGTGTATGT GCAGAACGCT
TCACGCACCG TTGGTGTGGT GGCGGCGTTG CTTTCCGACA CTCAGCGCGA TGATTTCGTT
GCTCGTACCC GCAAGGAGTA CGAAACCGTG CGTATTCAGC ACGGACGAAA GAAACCACGC
ACACCACCGG TCACGCTGGA AGCGGCGCGT GATAACGACT TCGCTTTTGA CTGGCAGGCT
TACACGCCGC CGGTGGCGCA CCGTCTCGGT GTGCAGGAAG TCGAAGCCAG CATCGAAACG
CTGCGTAATT ACATCGACTG GACGCCGTTC TTTATGACCT GGTCGCTGGC CGGGAAGTAT
CCGCGCATTC TGGAAGATGA AGTGGTAGGT GTTGAGGCGC AGCGGCTGTT TAAAGACGCC
AACGACATGC TGGATAAATT AAGCGCCGAA AAAATGCTGA ACCCGCGTGG CGTGGTGGGC
CTGTTCCCGG CAAACCGTGT GGGCGATGAC ATTGAAATCT ACCGTGACGA AACGCGTACC
CATGTGATCA ACGTCAGCCA CCATCTGCGC CAACAGACCG AAAAAACCGG CTTCGCTAAC
TACTGTCTCG CTGACTTCGT TGCGCCGAAG CTTTCTGGTA AAGCAGATTA CATCGGCGCA
TTTGCCGTGA CTGGCGGGCT GGAAGAGGAC GCACTGGCTG ATGCCTTTGA AGCGCAGCAC
GATGATTACA ACAAAATCAT GGTGAAAGCC CTTGCAGACC GTCTGGCGGA AGCCTTTGCG
GAGTATCTCC ATGAGCGTGT GCGTAAAGTC TACTGGGGCT ATGCGCCGAA CGAGAACCTC
AGCAACGAAG AACTGATCCG CGAAAACTAC CAGGGCATCC GTCCGGCACC GGGCTATCCG
GCCTGCCCGG AACATACGGA AAAAGCCACC ATCTGGGAGC TGCTGGAAGT GGAAAAACAC
ACTGGCATGA AACTCACAGA ATCTTTCGCC ATGTGGCCCG GTGCATCGGT TTCGGGTTGG
TATTTCAGCC ACCCGGACAG CAAGTACTAC GCGGTGGCGC AAATTCAGCG CGATCAGGTT
GAAGACTATG CCCGCCGTAA AGGTATGAGC GTCTCAGATG TTGAGCGCTG GCTGGCACCG
AATCTGGGGT ATGACGCGGA CTGA
 
Protein sequence
MSSKVEQLRA QLNERILVLD GGMGTMIQSY RLNEADFRGE RFADWPCDLK GNNDLLVLSK 
PEVIAAIHNA YFEAGADIIE TNTFNSTTIA MADYQMESLS AEINFAAAKL ARACADEWTA
RTPEKPRYVA GVLGPTNRTA SISPDVNDPA FRNITFDGLV AAYRESTKAL VEGGADLILI
ETVFDTLNAK AAVFAVKTEF EALGVELPIM ISGTITDASG RTLSGQTTEA FYNSLRHAEA
LTFGLNCALG PDELRQYVQE LSRIAECYVT AHPNAGLPNA FGEYDLDADT MAKQIREWAQ
AGFLNIVGGC CGTTPQHIAA MSRAVEGLAP RKLPEIPVAC RLSGLEPLNI GEDSLFVNVG
ERTNVTGSAK FKRLIKEEKY SEALDVARQQ VENGAQIIDI NMDEGMLDAE AAMVRFLNLI
AGEPDIARVP IMIDSSKWDV IEKGLKCIQG KGIVNSISMK EGVDAFIHHA KLLRRYGAAV
VVMAFDEQGQ ADTRARKIEI CRRAYKILTE EVGFPAEDII FDPNIFAVAT GIEEHNNYAQ
DFIGACEDIK RELPHALISG GVSNVSFSFR GNDPVREAIH AVFLYYAIRN GMDMGIVNAG
QLAIYDDLPA ELRDAVEDVI LNRRDDGTER LLELAEKYRG SKTDDTANTQ QAEWRSWEVN
KRLEYSLVKG ITEFIEQDTE EARQQATRPI EVIEGPLMDG MNVVGDLFGE GKMFLPQVVK
SARVMKQAVA YLEPFIEACK EQGKTNGKMV IATVKGDVHD IGKNIVGVVL QCNNYEIVDL
GVMVPAEKIL RTAKEVNADL IGLSGLITPS LDEMVNVAKE MERQGFTIPL LIGGATTSKA
HTAVKIEQNY SGPTVYVQNA SRTVGVVAAL LSDTQRDDFV ARTRKEYETV RIQHGRKKPR
TPPVTLEAAR DNDFAFDWQA YTPPVAHRLG VQEVEASIET LRNYIDWTPF FMTWSLAGKY
PRILEDEVVG VEAQRLFKDA NDMLDKLSAE KMLNPRGVVG LFPANRVGDD IEIYRDETRT
HVINVSHHLR QQTEKTGFAN YCLADFVAPK LSGKADYIGA FAVTGGLEED ALADAFEAQH
DDYNKIMVKA LADRLAEAFA EYLHERVRKV YWGYAPNENL SNEELIRENY QGIRPAPGYP
ACPEHTEKAT IWELLEVEKH TGMKLTESFA MWPGASVSGW YFSHPDSKYY AVAQIQRDQV
EDYARRKGMS VSDVERWLAP NLGYDAD