Gene EcHS_A4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4053 
SymbolmetE 
ID5591107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4043411 
End bp4045672 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content56% 
IMG OID640923157 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionYP_001460623 
Protein GI157163305 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTC TTAATCACAC CCTCGGTTTC CCTCGCGTTG GCCTGCGTCG CGAGCTGAAA 
AAAGCGCAAG AAAGTTATTG GGCGGGGAAC TCCACGCGTG AAGAACTGCT GACAGTAGGG
CGTGAACTGC GTGCTCGTCA CTGGGATCAA CAAAAGCAAG CGGGTATCGA CCTGCTGCCG
GTGGGCGATT TTGCCTGGTA CGATCATGTA CTGACCACCA GTTTGCTGCT GGGTAATGTT
CCGCCACGTC ATCAGAACAA AGACGGTTCG GTAGATATCG ACACCCTGTT CCGTATTGGT
CGTGGACGTG CGCCGACTGG CGAACCTGCG GCGGCAGCGG AAATGACCAA ATGGTTTAAC
ACCAACTATC ACTACATGGT GCCGGAGTTC GTTAAAGGCC AACAGTTCAA ACTGACCTGG
ACGCAGCTGC TGGAGGAAGT GGACGAGGCG CTGGCGCTGG GCCACAACGT TAAGCCTGTG
CTGCTGGGGC CGGTTACTTA CCTGTGGCTG GGGAAAGTGA AAGGTGAACA GTTTGACCGC
CTTAGCCTGC TGAACGACAT TCTGCCGGTT TATCAGCAAG TGCTGGCAGA ACTGGCGAAA
CGCGGCATCG AGTGGGTACA GATTGATGAA CCCGCGCTGG TACTGGAACT ACCACAGGCG
TGGCTGGACG CATACAAACC CGCTTACGAC GCGCTCCAGG GACAGGTGAA ACTGCTGCTG
ACCACCTATT TTGAAGGCGT AACGCCAAAT CTCGACACGA TTACTGCGCT GCCTGTTCAG
GGTCTGCATG TTGACCTCGT ACATGGTAAA GATGACGTTG CTGAACTGCA CAAGCGCCTG
CCTTCTGACT GGTTGCTGTC TGCGGGTCTG ATCAATGGTC GTAACGTCTG GCGCGCCGAT
CTTACCGAGA AATATGCGCA AATTAAGGAC ATTGTCGGCA AACGTGATTT GTGGGTGGCA
TCTTCCTGCT CGTTGCTGCA CAGCCCCATC GACCTGAGCG TGGAAACGCG TCTTGATGCA
GAAGTGAAAA GCTGGTTTGC CTTCGCCCTA CAAAAATGCC ATGAACTGGC ACTGCTGCGC
GATGCGCTGA ACAGTGGTGA CACGGCAGCT CTGGCAGAGT GGAGCGCCCC GATTCAGGCA
CGTCGTCACT CTACCCGCGT ACATAATCCG GCGGTAGAAA AGCGTCTGGC GGCGATCACC
GCCCAGGACA GCCAGCGTGC GAATGTCTAT GAAGTGCGTG CTGAAGCCCA GCGTGCGCGT
TTTAAACTGC CAGCGTGGCC GACCACCACG ATTGGTTCCT TCCCGCAAAC CACGGAAATT
CGTACCCTGC GTCTGGATTT CAAAAAGGGC AATCTCGACG CCAACAACTA CCGCACGGGC
ATTGCGGAAC ATATCAAGCA GGCCATTGTT GAGCAGGAAC GTTTGGGACT GGATGTGCTG
GTACATGGCG AGGCCGAGCG TAATGACATG GTGGAATACT TTGGCGAGCA CCTCGACGGA
TTTGTCTTTA CGCAAAACGG TTGGGTACAG AGCTACGGTT CCCGCTGCGT GAAGCCACCG
ATTGTCATTG GTGACATTAG CCGCCCGGCA CCGATTACCG TGGAGTGGGC GAAGTATGCG
CAATCGCTGA CCGACAAACC GGTGAAAGGG ATGCTGACGG GGCCGGTGAC CATACTCTGC
TGGTCGTTCC CGCGTGAAGA TGTCAGCCGT GAAACCATCG CCAAACAGAT TGCGCTGGCG
CTGCGTGATG AAGTGGCCGA TCTGGAAGCC GCTGGAATTG GCATCATCCA GATTGACGAA
CCGGCGCTGC GCGAAGGTTT ACCGCTGCGT CGTAGCGACT GGGATGCGTA TCTCCAGTGG
GGCGTAGAGG CCTTCCGTAT CAACGCCGCC GTGGCGAAAG ATGACACACA AATCCACACT
CACATGTGTT ATTGCGAGTT CAACGACATC ATGGATTCGA TTGCGGCGCT GGACGCAGAC
GTCATCACCA TCGAAACCTC GCGTTCCGAC ATGGAGTTGC TGGAGTCGTT TGAAGAGTTT
GATTATCCAA ATGAAATCGG TCCTGGCGTC TATGACATTC ACTCGCCAAA CGTACCGAGC
GTGGAATGGA TTGAAGCCTT GCTGAAGAAA GCGGCAAAAC GCATTCCGGC AGAGCGCCTG
TGGGTCAACC CGGACTGTGG CCTGAAAACG CGCGGCTGGC CAGAAACCCG CGCGGCACTG
GCGAACATGG TGCAGGCGGC GCAGAACTTG CGTCGGGGGT AA
 
Protein sequence
MTILNHTLGF PRVGLRRELK KAQESYWAGN STREELLTVG RELRARHWDQ QKQAGIDLLP 
VGDFAWYDHV LTTSLLLGNV PPRHQNKDGS VDIDTLFRIG RGRAPTGEPA AAAEMTKWFN
TNYHYMVPEF VKGQQFKLTW TQLLEEVDEA LALGHNVKPV LLGPVTYLWL GKVKGEQFDR
LSLLNDILPV YQQVLAELAK RGIEWVQIDE PALVLELPQA WLDAYKPAYD ALQGQVKLLL
TTYFEGVTPN LDTITALPVQ GLHVDLVHGK DDVAELHKRL PSDWLLSAGL INGRNVWRAD
LTEKYAQIKD IVGKRDLWVA SSCSLLHSPI DLSVETRLDA EVKSWFAFAL QKCHELALLR
DALNSGDTAA LAEWSAPIQA RRHSTRVHNP AVEKRLAAIT AQDSQRANVY EVRAEAQRAR
FKLPAWPTTT IGSFPQTTEI RTLRLDFKKG NLDANNYRTG IAEHIKQAIV EQERLGLDVL
VHGEAERNDM VEYFGEHLDG FVFTQNGWVQ SYGSRCVKPP IVIGDISRPA PITVEWAKYA
QSLTDKPVKG MLTGPVTILC WSFPREDVSR ETIAKQIALA LRDEVADLEA AGIGIIQIDE
PALREGLPLR RSDWDAYLQW GVEAFRINAA VAKDDTQIHT HMCYCEFNDI MDSIAALDAD
VITIETSRSD MELLESFEEF DYPNEIGPGV YDIHSPNVPS VEWIEALLKK AAKRIPAERL
WVNPDCGLKT RGWPETRAAL ANMVQAAQNL RRG