Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0443 |
Symbol | |
ID | 7407520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 503893 |
End bp | 505059 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714830 |
Product | amidohydrolase |
Protein accession | YP_002572348 |
Protein GI | 222528466 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACATCA TAATAAAAAA TGCAAAAATT TACACAATGG ATGAAAAAGG AATTATTGAA AAAGGCGATA TACTTATCAA GGATGGCAAG ATAGCCAAGA TAGACCAAAA CATAAATGAA GATAGTAGCA TGGTAATAGA TGCAACAGGC AGGCTTGTCT TTCCAGGCTT TATAGATGCA CACTCACACA TAGGGATGTG GGAAGACTCT GTCGGGTTTG AAGGTGCCGA TGGGAACGAA GACTCAGACC CTGTCACGCC ACACTTAAGA GCAATTGATG CTATAAATCC ATTTGACAGA AGTTTTGAAG AGGCAATTGA AGGTGGCGTT ACATGTGTTG CAACAGGACC GGGAAGCGCT AACGTGATAG GCGGGCAGTT TTGTGTCATC AAGACGTTTG GCAAGAGAGT TGACAAGATG GTTGTGAAAG AACCTGCTGC AATGAAGGTT GCTTTCGGTG AAAATCCCAA AAGCGTTTAT CACGAAAAAC ACCAGATGCC TCAAACACGC ATGGCAACTG CTGCAATCTT GAGAGAGGCA CTTTTTAAGG CAAGAGAGTA CTTAAATAAA AAGCTTGAGG CTCAGCAGGA TGAGGAAAAA GATATGCCAG AGTTTGATAT GAAAAGTGAA AGCCTTATAA AGGTTTTAAC AAAAGAGATT CCGCTGAAAG CACATGCTCA CAGAGCAGAT GACATATTCA CGGCAATAAG GATTGCCAAA GAATTTGATG TAAATATTAC TCTTGATCAT GTGACAGACG GATATTTGAT TGTGGATGAG CTAAAACAAG AAAATATCCC ATGCATTGTT GGACCAAACC TTACTGATAG ATCAAAGGTT GAGCTTAAAA ACCTTGATTT TAAAAATCCA GGTATACTTT CTAAAGAAGG CATTCTTGTT GCTATTATGA CTGACCATCC TGTCATTCCG CAAAAATATC TTGTGCTATG TAGCGCGCTT GCGTGCAAGA GCGGAATGGA TGAGATAGAG GCTCTAAAAG CAATTACTAT AAACCCTGCA AAGATTTTAG GAATTGATAA CAGAGTTGGG AGTATAAAGG AAGGTAAAGA TGCTGATATT GTAATATACA AGGGTCATCC TTTTGACATA TTTTCTGAGG TTGAATATGT CTTGATTGAT GGTAGAGTTG TATATCATCG CAAATAA
|
Protein sequence | MDIIIKNAKI YTMDEKGIIE KGDILIKDGK IAKIDQNINE DSSMVIDATG RLVFPGFIDA HSHIGMWEDS VGFEGADGNE DSDPVTPHLR AIDAINPFDR SFEEAIEGGV TCVATGPGSA NVIGGQFCVI KTFGKRVDKM VVKEPAAMKV AFGENPKSVY HEKHQMPQTR MATAAILREA LFKAREYLNK KLEAQQDEEK DMPEFDMKSE SLIKVLTKEI PLKAHAHRAD DIFTAIRIAK EFDVNITLDH VTDGYLIVDE LKQENIPCIV GPNLTDRSKV ELKNLDFKNP GILSKEGILV AIMTDHPVIP QKYLVLCSAL ACKSGMDEIE ALKAITINPA KILGIDNRVG SIKEGKDADI VIYKGHPFDI FSEVEYVLID GRVVYHRK
|
| |