Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4035 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4368178 |
End bp | 4370475 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | pyruvate formate-lyase |
Protein accession | ACX41635 |
Protein GI | 260451213 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATC GTATCTCTCG CCTCAAAACT GCACTGTTTG CCAATACCCG TGAAATCTCG CTGGAGCGGG CGCTGCTTTA TACCGCCAGC CATCGGCAAA CCGAAGGCGA ACCGGTGATA TTGCGCCGGG CGAAAGCAAC AGCGTATATC CTTGAACATG TTGAAATTTC GATTCGTGAT GAAGAACTGA TTGCCGGTAA CCGCACCGTA AAACCGCGCG CCGGGATTAT GTCGCCGGAA ATGGACCCTT ACTGGCTGCT GAAAGAGCTG GATCAATTCC CGACGCGTCC GCAGGACCGC TTTGCTATCA GCGAAGAAGA TAAACGTATC TACCGTGAAG AGTTGTTCCC GTACTGGGAA AAACGTTCGA TGAAAGATTT CATCAACGGG CAGATGACAG ATGAAGTAAA AGCCGCGACC AACACGCAGA TTTTCAGCAT CAACCAGACG GATAAAGGCC AGGGGCACAT TATTATTGAT TACCCACGCC TGTTGAATCA CGGGCTGGGT GAGCTGGTGG CACAGATGCA GCAACATTGT CAGCAACAGC CGGAGAATCA CTTTTATCAG GCCGCGTTGT TACTGCTGGA AGCCTCGCAG AAACACATTT TGCGTTACGC CGAACTGGCG GAAACGATGG CGGCAAACTG CACAGATGCC CAGCGTCGCG AAGAGCTGCT GACTATTGCA GAGATCTCCC GCCATAACGC GCAACATAAG CCGCAGACGT TCTGGCAGGC GTGCCAGTTA TTCTGGTACA TGAACATCAT TCTGCAATAC GAATCCAACG CCAGTTCGCT ATCGTTGGGG CGCTTCGACC AGTATATGTT GCCGTTCTAT CAGACATCAT TAACCCAGGG CGAAGATGCG GCGTTCCTGA AAGAACTGCT CGAATCTTTA TGGGTGAAAT GCAACGACAT CGTGCTGTTG CGCTCCACCA GTAGCGCGCG TTATTTCGCC GGTTTCCCGA CCGGCTATAC CGCACTGCTC GGCGGGTTAA CCGAGAACGG ACGTAGCGCG GTGAACGTGC TTTCGTTCCT TTGCCTTGAC GCCTATCAAA GCGTGCAATT ACCGCAACCG AACCTCGGCG TGCGCACTAA CGCCTTGATC GACACGCCGT TCCTGATGAA AACCGCCGAA ACCATTCGCT TCGGTACCGG TATTCCGCAA ATCTTTAACG ATGAAGTGGT GGTGCCAGCG TTCCTCAACC GTGGCGTTTC GCTGGAAGAT GCGCGCGACT ATTCCGTAGT GGGCTGTGTG GAATTATCTA TTCCCGGCAG AACCTACGGC TTGCATGACA TCGCGATGTT TAATCTGCTG AAAGTGATGG AAATCTGCCT GCATGAAAAT GAAGGCAATG CTGCGCTGAC TTATGAAGGT TTACTGGAGC AGATCCGCGC CAAGATCAGC CACTACATCA CCCTGATGGT TGAGGGCAGC AATATTTGTG ATATCGGCCA TCGCGACTGG GCACCTGTAC CGCTGCTCTC ATCGTTTATC AGCGATTGTC TGGAAAAAGG CCGCGATATT ACCGATGGCG GCGCGCGTTA TAACTTCTCC GGCGTACAGG GGATCGGTAT CGCCAACCTG AGCGATTCTC TCCATGCGTT GAAAGGGATG GTTTTTGAGC AACAGCGTTT AAGTTTTGAC GAATTGCTGT CGGTATTAAA AGCCAACTTC GCAACGCCAG AAGGCGAAAA AGTCCGCGCT CGCTTAATTA ACCGCTTCGA GAAATACGGT AACGATATCG ACGAGGTGGA TAACATCAGC GCCGAACTGT TGCGCCACTA CTGCAAAGAA GTGGAAAAAT ACCAGAACCC GCGCGGCGGC TACTTCACGC CGGGATCGTA TACCGTTTCT GCTCACGTCC CGTTGGGATC GGTGGTTGGC GCGACGCCAG ACGGTCGTTT TGCCGGAGAA CAGCTGGCAG ACGGCGGCTT GTCACCCATG CTGGGCCAGG ACGCACAAGG GCCAACAGCG GTACTGAAGT CAGTCAGTAA GCTCGATAAC ACGCTGCTGT CTAACGGTAC GTTGCTGAAC GTGAAATTCA CTCCGGCGAC CCTGGAAGGT GAAGCAGGAT TACGCAAACT GGCCGACTTC TTACGGGCGT TTACCCAGCT TAAGTTACAA CATATTCAGT TTAACGTGGT GAACGCCGAC ACGTTGCGGG AAGCGCAACA GCGCCCACAA GATTATGCCG GGCTGGTGGT GCGCGTTGCC GGATACAGCG CCTTCTTTGT CGAACTGTCG AAGGAGATCC AGGATGACAT CATCCGCCGG ACAGCGCATC AGCTGTAA
|
Protein sequence | MTNRISRLKT ALFANTREIS LERALLYTAS HRQTEGEPVI LRRAKATAYI LEHVEISIRD EELIAGNRTV KPRAGIMSPE MDPYWLLKEL DQFPTRPQDR FAISEEDKRI YREELFPYWE KRSMKDFING QMTDEVKAAT NTQIFSINQT DKGQGHIIID YPRLLNHGLG ELVAQMQQHC QQQPENHFYQ AALLLLEASQ KHILRYAELA ETMAANCTDA QRREELLTIA EISRHNAQHK PQTFWQACQL FWYMNIILQY ESNASSLSLG RFDQYMLPFY QTSLTQGEDA AFLKELLESL WVKCNDIVLL RSTSSARYFA GFPTGYTALL GGLTENGRSA VNVLSFLCLD AYQSVQLPQP NLGVRTNALI DTPFLMKTAE TIRFGTGIPQ IFNDEVVVPA FLNRGVSLED ARDYSVVGCV ELSIPGRTYG LHDIAMFNLL KVMEICLHEN EGNAALTYEG LLEQIRAKIS HYITLMVEGS NICDIGHRDW APVPLLSSFI SDCLEKGRDI TDGGARYNFS GVQGIGIANL SDSLHALKGM VFEQQRLSFD ELLSVLKANF ATPEGEKVRA RLINRFEKYG NDIDEVDNIS AELLRHYCKE VEKYQNPRGG YFTPGSYTVS AHVPLGSVVG ATPDGRFAGE QLADGGLSPM LGQDAQGPTA VLKSVSKLDN TLLSNGTLLN VKFTPATLEG EAGLRKLADF LRAFTQLKLQ HIQFNVVNAD TLREAQQRPQ DYAGLVVRVA GYSAFFVELS KEIQDDIIRR TAHQL
|
| |