Gene EcDH1_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4035 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4368178 
End bp4370475 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID 
Productpyruvate formate-lyase 
Protein accessionACX41635 
Protein GI260451213 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG 
CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA
TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACATG TTGAAATTTC GATTCGTGAT
GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCGCGCG CCGGGATTAT GTCGCCGGAA
ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC
TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGTGAAG AGTTGTTCCC GTACTGGGAA
AAACGTTCGA TGAAAGATTT CATCAACGGG CAGATGACAG ATGAAGTAAA AGCCGCGACC
AACACGCAGA TTTTCAGCAT CAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT
TACCCACGCC TGTTGAATCA CGGGCTGGGT GAGCTGGTGG CACAGATGCA GCAACATTGT
CAGCAACAGC CGGAGAATCA CTTTTATCAG GCCGCGTTGT TACTGCTGGA AGCCTCGCAG
AAACACATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAACTG CACAGATGCC
CAGCGTCGCG AAGAGCTGCT GACTATTGCA GAGATCTCCC GCCATAACGC GCAACATAAG
CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC
GAATCCAACG CCAGTTCGCT ATCGTTGGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT
CAGACATCAT TAACCCAGGG CGAAGATGCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA
TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GTAGCGCGCG TTATTTCGCC
GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG
GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG
AACCTCGGCG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCCGAA
ACCATTCGCT TCGGTACCGG TATTCCGCAA ATCTTTAACG ATGAAGTGGT GGTGCCAGCG
TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCGCGCGACT ATTCCGTAGT GGGCTGTGTG
GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG
AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT
TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC
AATATTTGTG ATATCGGCCA TCGCGACTGG GCACCTGTAC CGCTGCTCTC ATCGTTTATC
AGCGATTGTC TGGAAAAAGG CCGCGATATT ACCGATGGCG GCGCGCGTTA TAACTTCTCC
GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGGATG
GTTTTTGAGC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTC
GCAACGCCAG AAGGCGAAAA AGTCCGCGCT CGCTTAATTA ACCGCTTCGA GAAATACGGT
AACGATATCG ACGAGGTGGA TAACATCAGC GCCGAACTGT TGCGCCACTA CTGCAAAGAA
GTGGAAAAAT ACCAGAACCC GCGCGGCGGC TACTTCACGC CGGGATCGTA TACCGTTTCT
GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA
CAGCTGGCAG ACGGCGGCTT GTCACCCATG CTGGGCCAGG ACGCACAAGG GCCAACAGCG
GTACTGAAGT CAGTCAGTAA GCTCGATAAC ACGCTGCTGT CTAACGGTAC GTTGCTGAAC
GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCAGGAT TACGCAAACT GGCCGACTTC
TTACGGGCGT TTACCCAGCT TAAGTTACAA CATATTCAGT TTAACGTGGT GAACGCCGAC
ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC
GGATACAGCG CCTTCTTTGT CGAACTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG
ACAGCGCATC AGCTGTAA
 
Protein sequence
MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD 
EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE
KRSMKDFING QMTDEVKAAT NTQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC
QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAANCTDA QRREELLTIA EISRHNAQHK
PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QTSLTQGEDA AFLKELLESL
WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP
NLGVRTNALI DTPFLMKTAE TIRFGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV
ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS
NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM
VFEQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE
VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA
VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD
TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL