Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0728 |
Symbol | |
ID | 8415018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 915671 |
End bp | 917245 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645023699 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_003181096 |
Protein GI | 257790490 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CTAAGGTCAA GCGCGTGCTC GTGTCCGTTA CGGACAAGAG CGGCGTTGCG GACTTCGCCC GCGCGCTCGT CGACGAGTTC GGGGCGGAAA TCATCTCGAC GGGCGGCACC GCCCGCGCGC TCAAGGACGC CGGCGTGCCG GTTACGCCCA TCGACGACGT GACGCAGTTC CCCGAGATGA TGGACGGTCG CGTGAAGACG CTGCACCCGC GCGTGCACGG CGGTCTGCTG GCCAAGCGCG ACAACGAGGC CCACATGGCC CAAGCGGCCG AGCACGGCAT CGAGATGATC GACATGGTGG TGGTGAACCT CTACGCCTTC GAGAAGACGG TGGAAAGCGG CGCCGATTTC GGCACCTGCA TCGAGAACAT CGACATCGGC GGCCCGTCCA TGCTGCGCTC CGCGGCCAAG AACTTCGAGA GCGTGGCCGT GGTCACGCGC CCGGCGAGCT ACGACGCTAT CCTGGCCGAG ATGCGCGCCA ACGACGGCGC CACCCTGCGC GACACGCGCG CCAAGCTGGC GCTCGACGTG TTCGAGACCA CGGCGGCTTA CGACGGCGCC ATCGCCGCGT GGATGGGCGC CCAGCTCAAG GACGAGGGCG ACGTGAAGTT CCCCGCCGAC CGCACGCTGC ATCTCTCGAA GGTGCAAGAC CTGCGCTACG GCGAGAACCC GCACCAGTCC GCCGCGTTCT ACCGTCGCGA CGACTACGCC GACGCCCCGC ACAGCCTGGC CCATGCCAAG CAGCATCAGG GCAAGGAGCT GTCGTACAAC AACTACCTCG ACCTCGACGC GGCTTGGACG GCCGTTCGCG AGTTCGACGA GCCGGCCTGC GTCATCGTCA AGCACCTCAC GCCCTGCGGC GTGTGCCAGA ACGACGACCT CGTCGAGGCC TACCAGCGCG CGCACGCGTG CGACCCGGTG AGCGCCTACG GCGGCGTCAT GGCGTTCAAC CGCCCCGTCA CCTCCGACGT GGTGGTGGCC ATCTTCGACA ACAAGCAGTT CGTCGAGGCC ATCATCGCTC CCGAGTTCGC GGGCGACGCG CTTGACATGT ACAGCGCGAA GAAGAACGCG CGCCTGCTGT CCACGGGCGG CGTGAACCCG GCCGGCGGGG AAGTGGAGTA CCGCTCGGTC GAGGGCGGCC TGCTGGCCCA GGATTCCGAC GCCGTGGCCG AGGATCCCGC GACGTTCACG GTTCCCACGA AGCGCCAGCC CAGCGAGGAA GAGCTCGCCG AGCTGCTGTT CGCGTGGAAG GTGTGCAAGT CCATCAAGTC CAACGCCATC GCCATCACGA AGGGCCACGC GACCATCGGC GTGGGCGGCG GCCAGCCGAA CCGCGTGAAC TCCGCGCGCA TTGCCGTGGA GCAGGCGGGC GAGGAGGCCA AGGGCGCCGT GGCCGCCTCC GACGCGTTCT TCCCGTTCCG CGACGGCCTC GACGCGCTGG CCGAGGCCGG CGTGACGGCC ATCATCGAGC CGGGCGGCTC CATCCGTGAC GAAGAGGTGA TCGCCGCCGC CGACGAGCAC GGCATCGCGC TCGTCTTCAC CGGCCACCGC CACTTCAGGC ACTAG
|
Protein sequence | MSNPKVKRVL VSVTDKSGVA DFARALVDEF GAEIISTGGT ARALKDAGVP VTPIDDVTQF PEMMDGRVKT LHPRVHGGLL AKRDNEAHMA QAAEHGIEMI DMVVVNLYAF EKTVESGADF GTCIENIDIG GPSMLRSAAK NFESVAVVTR PASYDAILAE MRANDGATLR DTRAKLALDV FETTAAYDGA IAAWMGAQLK DEGDVKFPAD RTLHLSKVQD LRYGENPHQS AAFYRRDDYA DAPHSLAHAK QHQGKELSYN NYLDLDAAWT AVREFDEPAC VIVKHLTPCG VCQNDDLVEA YQRAHACDPV SAYGGVMAFN RPVTSDVVVA IFDNKQFVEA IIAPEFAGDA LDMYSAKKNA RLLSTGGVNP AGGEVEYRSV EGGLLAQDSD AVAEDPATFT VPTKRQPSEE ELAELLFAWK VCKSIKSNAI AITKGHATIG VGGGQPNRVN SARIAVEQAG EEAKGAVAAS DAFFPFRDGL DALAEAGVTA IIEPGGSIRD EEVIAAADEH GIALVFTGHR HFRH
|
| |