Gene EcolC_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4179 
Symbol 
ID6067326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4616346 
End bp4618607 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content56% 
IMG OID641603607 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionYP_001727103 
Protein GI170022149 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0336494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATTC TTAATCACAC CCTCGGTTTC CCTCGCGTTG GCCTGCGTCG CGAGCTGAAA 
AAAGCGCAAG AAAGTTATTG GGCGGGGAAC TCCACGCGTG AAGAACTGCT GACAGTAGGG
CGTGAACTGC GTGCTCGTCA CTGGGATCAA CAAAAGCAAG CGGGTATCGA CCTGCTGCCG
GTGGGCGATT TTGCCTGGTA CGATCATGTA CTGACCACCA GTTTGCTGCT GGGTAATGTT
CCGCCACGTC ATCAGAACAA AGACGGTTCG GTAGATATCG ACACCCTGTT CCGTATTGGT
CGTGGACGTG CGCCGACTGG CGAACCTGCG GCGGCAGCGG AAATGACCAA ATGGTTTAAC
ACCAACTATC ACTACATGGT GCCGGAGTTC GTTAAAGGCC AACAGTTCAA ACTGACCTGG
ACGCAGCTGC TGGAGGAAGT GGACGAGGCG CTGGCGCTGG GCCACAACGT TAAGCCTGTG
CTGCTGGGGC CGGTTACTTA CCTGTGGCTG GGGAAAGTGA AAGGTGAACA GTTTGACCGC
CTTAGCCTGC TGAACGACAT TCTGCCGGTT TATCAGCAAG TGCTGGCAGA ACTGGCGAAA
CGCGGCATCG AGTGGGTACA GATTGATGAA CCCGCGCTGG TACTGGAACT ACCACAGGCG
TGGCTGGACG CATACAAACC CGCTTACGAC GCGCTCCAGG GACAGGTGAA ACTGCTGCTG
ACCACCTATT TTGAAGGCGT AACGCCAAAT CTCGACACGA TTACTGCGCT GCCTGTTCAG
GGGCTGCATG TGGACCTTGT ACATGGTAAA GATGACGTTG TTGAACTGCA CAAGCGCCTG
CCTTCTGACT GGCTGCTGTC TGCGGGTCTG ATCAATGGTC GTAACGTCTG GCGCGCCGAT
CTTACTGAGA AATATGCGCA AATTAAGGAC ATTGTCGGCA AACGTGATTT GTGGGTGGCA
TCTTCCTGCT CGTTGCTGCA CAGCCCCATC GACCTGAGCG TGGAAACGCG TCTTGATGCA
GAAGTGAAAA GCTGGTTTGC CTTCGCCCTA CAAAAATGCC ATGAACTGGC ACTGCTGCGC
GATGCGCTGA ACAGTGGTGA CACGGCAGCT CTGGCAGAGT GGAGCGCCCC AATTCAGGCG
CGTCGTAACT CTACTCGTGT ACATAATCCG GCGGTAGAAA AGCGTCTGGC GGCGATCACC
GCCCAGGACA GCCAGCGTGC GAATGTCTAT GAAGTGCGTG CCGAAGCCCA GCGTGCGCGT
TTTAAACTGC CAGCGTGGCC GACCACCACG ATTGGTTCTT TCCCGCAAAC CACGGAAATT
CGTACCCTGC GTCTGGATTT CAAAAAGGGT AATCTCGACG CCAATAACTA CCGCACGGGC
ATTGCGGAAC ATATCAGGCA GGCCATTGTT GAGCAGGAAC GTTTGGGACT GGATGTGCTG
GTACATGGCG AGGCTGAGCG TAATGACATG GTGGAATACT TTGGCGAGCA CCTCGACGGG
TTTGTCTTTA CGCAAAACGG TTGGGTACAG AGCTACGGTT CACGCTGCGT GAAGCCACCG
ATTGTTATTG GTGACGTTAG CCGCCCGGCA CCGATCACCG TGGAGTGGGC GAAGTATGCG
CAATCGCTGA CTGATAAACC GGTAAAAGGG ATGCTGACTG GCCCGGTGAC CATTCTCTGC
TGGTCGTTCC CGCGTGAAGA TGTCAGCCGT GAAACCATCG CCAAACAGAT TGCGCTGGCG
CTGCGTGATG AAGTGGCCGA TCTGGAAGCC GCTGGAATTG GCATCATCCA GATTGACGAA
CCGGCGCTGC GCGAAGGTTT ACCGCTGCGT CGTAGCGACT GGGATGCATA TCTCCAGTGG
GGCGTGGAGG CCTTCCGTAT CAACGCCGCC GTGGCGAAAG ATGACACACA AATCCACACT
CACATGTGTT ATTGCGAGTT CAACGACATC ATGGATTCGA TTGCGGCACT GGACGCAGAC
GTCATCACCA TCGAAACCTC GCGTTCTGAC ATGGAGTTGC TGGAGTCGTT CGAAGAGTTT
GATTATCCGA ATGAAATCGG TCCAGGCGTC TATGACATTC ACTCGCCAAA CGTACCGAGC
GTGGAATGGA TTGAAGCCTT GCTGAAGAAA GCGGCAAAAC GCATTCCGGC AGAGCGCCTG
TGGGTTAACC CGGACTGTGG CCTGAAAACG CGCGGCTGGC CAGAAACCCG CGCGGCACTG
GCGAACATGG TGCAGGCGGC GCAGAATTTG CGTCGGGGGT AA
 
Protein sequence
MTILNHTLGF PRVGLRRELK KAQESYWAGN STREELLTVG RELRARHWDQ QKQAGIDLLP 
VGDFAWYDHV LTTSLLLGNV PPRHQNKDGS VDIDTLFRIG RGRAPTGEPA AAAEMTKWFN
TNYHYMVPEF VKGQQFKLTW TQLLEEVDEA LALGHNVKPV LLGPVTYLWL GKVKGEQFDR
LSLLNDILPV YQQVLAELAK RGIEWVQIDE PALVLELPQA WLDAYKPAYD ALQGQVKLLL
TTYFEGVTPN LDTITALPVQ GLHVDLVHGK DDVVELHKRL PSDWLLSAGL INGRNVWRAD
LTEKYAQIKD IVGKRDLWVA SSCSLLHSPI DLSVETRLDA EVKSWFAFAL QKCHELALLR
DALNSGDTAA LAEWSAPIQA RRNSTRVHNP AVEKRLAAIT AQDSQRANVY EVRAEAQRAR
FKLPAWPTTT IGSFPQTTEI RTLRLDFKKG NLDANNYRTG IAEHIRQAIV EQERLGLDVL
VHGEAERNDM VEYFGEHLDG FVFTQNGWVQ SYGSRCVKPP IVIGDVSRPA PITVEWAKYA
QSLTDKPVKG MLTGPVTILC WSFPREDVSR ETIAKQIALA LRDEVADLEA AGIGIIQIDE
PALREGLPLR RSDWDAYLQW GVEAFRINAA VAKDDTQIHT HMCYCEFNDI MDSIAALDAD
VITIETSRSD MELLESFEEF DYPNEIGPGV YDIHSPNVPS VEWIEALLKK AAKRIPAERL
WVNPDCGLKT RGWPETRAAL ANMVQAAQNL RRG