Gene EcDH1_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4038 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4372249 
End bp4374750 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content55% 
IMG OID 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionACX41638 
Protein GI260451216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGA TTGTGGAATT TATTTGTGAG CTACCTAACG GCGTACATGC GCGTCCGGCA 
AGCCACGTTG AAACGCTGTG TAATACTTTT TCATCACAAA TTGAGTGGCA TAACCTGCGC
ACTGACCGCA AGGGCAACGC CAAAAGCGCC CTTGCGCTGA TTGGCACCGA TACGCTGGCG
GGCGATAACT GCCAGTTACT GATTTCCGGG GCCGACGAAC AGGAAGCGCA CCAGCGTTTA
AGCCAATGGC TGCGCGATGA ATTCCCCCAC TGCGATGCGC CGCTGGCGGA AGTTAAATCT
GACGAACTGG AACCACTGCC GGTTTCACTG ACCAATCTGA ATCCGCAAAT TATCCGCGCC
CGCACCGTGT GCAGCGGTAG TGCAGGCGGC ATTCTGACGC CGATCTCTTC TTTAGATCTC
AATGCGCTGG GTAATCTTCC CGCAGCCAAA GGCGTTGACG CCGAGCAATC CGCACTGGAA
AACGGCCTGA CGCTGGTACT GAAAAACATT GAGTTTCGTC TGCTGGATAG CGACGGTGCT
ACCAGCGCGA TTCTGGAAGC TCACCGATCC CTGGCTGGCG ATACTTCCCT GCGCGAACAT
TTACTGGCAG GCGTCAGCGC CGGATTAAGC TGCGCCGAAG CAATTGTTGC CAGCGCGAAT
CACTTTTGCG AAGAGTTTTC CCGTTCCAGC AGCAGCTACC TGCAAGAACG TGCCCTGGAC
GTACGCGACG TCTGCTTCCA GTTACTCCAG CAAATCTACG GTGAGCAACG CTTCCCGGCA
CCGGGCAAAC TGACGCAGCC CGCCATTTGT ATGGCTGATG AACTGACCCC CAGCCAGTTC
CTCGAACTGG ATAAAAATCA CCTCAAAGGA TTGTTGCTCA AAAGCGGCGG CACCACCTCA
CATACGGTGA TCCTTGCCCG TTCGTTCAAC ATTCCAACGC TGGTTGGTGT GGATATTGAT
GCCCTTACTC CGTGGCAGCA ACAAACGATT TATATCGACG GCAACGCCGG GGCGATTGTG
GTTGAGCCAG GGGAAGCCGT AGCTCGTTAT TATCAGCAAG AAGCCCGCGT ACAGGACGCC
CTGCGTGAGC AACAGCGTGT CTGGCTGACC CAACAAGCCC GTACCGCTGA CGGTATCCGC
ATTGAAATTG CCGCTAACAT CGCTCACTCC GTGGAAGCGC AGGCCGCATT CGGCAATGGT
GCGGAAGGCG TTGGTTTGTT CCGCACTGAA ATGCTCTATA TGGATCGCAC CAGCGCACCG
GGCGAAAGCG AGTTGTACAA CATTTTTTGT CAGGCGCTGG AATCCGCCAA CGGACGCAGC
ATTATTGTGC GCACTATGGA CATTGGCGGC GACAAACCCG TTGATTATCT GAACATTCCC
GCAGAGGCAA ACCCGTTCCT CGGTTATCGC GCCGTGCGTA TTTATGAAGA GTACGCGTCG
TTGTTTACCA CGCAGCTACG GTCGATCCTC CGCGCCTCCG CTCACGGCAG CCTGAAAATC
ATGATCCCGA TGATCTCCTC AATGGAAGAG ATCTTATGGG TGAAAGAAAA ACTGGCGGAA
GCCAAACAGC AACTACGTAA CGAACACATT CCGTTTGATG AGAAAATCCA GCTCGGCATC
ATGCTGGAAG TGCCGTCGGT GATGTTCATC ATCGATCAAT GCTGCGAAGA GATTGATTTC
TTTAGTATTG GTAGTAATGA CCTGACGCAG TATCTGCTGG CGGTGGATCG CGATAACGCT
AAGGTTACTC GTCACTACAA CAGCCTGAAT CCGGCATTCT TGCGGGCGCT CGATTACGCC
GTGCAAGCGG TGCATCGCCA GGGCAAATGG ATTGGTCTGT GCGGTGAGCT GGGAGCGAAA
GGTTCCGTGC TGCCGTTGCT GGTCGGCTTA GGGCTGGATG AACTCAGCAT GAGCGCACCA
TCAATTCCGG CGGCGAAAGC TCGGATGGCG CAACTTGATA GCCGTGAGTG CCGCAAGTTG
CTCAACCAGG CAATGGCCTG CCGTACTTCG CTGGAAGTAG AACACCTGCT GGCGCAATTC
CGCATGACCC AACAAGACGC ACCGCTGGTC ACCGCCGAGT GCATCACACT GGAAAGCGAC
TGGCGCAGCA AAGAAGAAGT GCTCAAAGGC ATGACCGATA ACCTGCTGCT GGCGGGCCGC
TGCCGCTATC CGCGTAAACT GGAAGCCGAC TTGTGGGCGC GCGAGGCCGT TTTCTCTACC
GGTCTGGGCT TTAGTTTTGC CATTCCACAC AGCAAATCAG AACACATTGA GCAATCCACC
ATCAGCGTGG CGCGTCTGCA AGCGCCGGTG CGCTGGGGCG ATGATGAAGC GCAATTCATC
ATTATGTTAA CCCTGAACAA ACACGCTGCG GGCGATCAGC ATATGCGCAT TTTCTCGCGC
CTCGCTCGCC GCATCATGCA CGAAGAATTC CGTAACGCGC TGGTTAACGC CGCCTCTGCC
GACGCTATCG CCAGCCTGCT GCAACATGAA CTGGAACTGT AA
 
Protein sequence
MALIVEFICE LPNGVHARPA SHVETLCNTF SSQIEWHNLR TDRKGNAKSA LALIGTDTLA 
GDNCQLLISG ADEQEAHQRL SQWLRDEFPH CDAPLAEVKS DELEPLPVSL TNLNPQIIRA
RTVCSGSAGG ILTPISSLDL NALGNLPAAK GVDAEQSALE NGLTLVLKNI EFRLLDSDGA
TSAILEAHRS LAGDTSLREH LLAGVSAGLS CAEAIVASAN HFCEEFSRSS SSYLQERALD
VRDVCFQLLQ QIYGEQRFPA PGKLTQPAIC MADELTPSQF LELDKNHLKG LLLKSGGTTS
HTVILARSFN IPTLVGVDID ALTPWQQQTI YIDGNAGAIV VEPGEAVARY YQQEARVQDA
LREQQRVWLT QQARTADGIR IEIAANIAHS VEAQAAFGNG AEGVGLFRTE MLYMDRTSAP
GESELYNIFC QALESANGRS IIVRTMDIGG DKPVDYLNIP AEANPFLGYR AVRIYEEYAS
LFTTQLRSIL RASAHGSLKI MIPMISSMEE ILWVKEKLAE AKQQLRNEHI PFDEKIQLGI
MLEVPSVMFI IDQCCEEIDF FSIGSNDLTQ YLLAVDRDNA KVTRHYNSLN PAFLRALDYA
VQAVHRQGKW IGLCGELGAK GSVLPLLVGL GLDELSMSAP SIPAAKARMA QLDSRECRKL
LNQAMACRTS LEVEHLLAQF RMTQQDAPLV TAECITLESD WRSKEEVLKG MTDNLLLAGR
CRYPRKLEAD LWAREAVFST GLGFSFAIPH SKSEHIEQST ISVARLQAPV RWGDDEAQFI
IMLTLNKHAA GDQHMRIFSR LARRIMHEEF RNALVNAASA DAIASLLQHE LEL