Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1833 |
Symbol | |
ID | 7408947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1908595 |
End bp | 1909605 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716210 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_002573699 |
Protein GI | 222529817 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGTAC TTGGGATTGA AACTTCATGC GATGAAACTT CTGCTGCAAT TGTAGAAGAT GGAAGAAAGA TTCTGTCAAA TGTAATATAC TCCCAGATAG ATATTCATTA TCAATTTGGT GGTGTTGTCC CAGAGATAGC TTCTCGCAAA CATGTTGAAA AAATTTCATA CGTTGTTGAT ATGGCGTTCA AACAGGCAGG TTTGACTATA GACGACATTG ATGGTATTGC TGCAACGTAC GGACCTGGTC TTGTAGGTTC GCTTCTTGTT GGACTTTCTT TTGCAAAGGC ACTAAGCTAT GCAAAAAGGC TTCCGTTTGT TGCTGTGAAT CACATTGAAG GGCATATTTA TGCAAATTTT ATAACATACC CTCAGCTTAC ACCACCACTG ATTGTTCTGG TGGTCTCTGG AGGGCATACT AATCTAATTA TTTTAAAAGA TTTTGAAGAG TATGAGGTGG TTGGTAAAAC TAGAGATGAT GCGGCAGGCG AGGCTTTTGA TAAGATTGCA AGATACTTAG GGCTTGGATA TCCTGGCGGT CCTGCCATAG ACAAAATCGC AAAGCAAGGT GATGAGGACA AATACAAATA TCCTGTTGCT GATGTTGGTG GATACAATTT TAGCTTCAGC GGATTAAAAT CTGCCGTGAT AAACCATGTT CATGGGCTTT GGCAGAGGGG CGAAGAATTT AAAATAGAAG ATGTTGCTGC GTCATTTCAA AAAACAGTTG TGAGCATACT TGTTGAAAAA ACAATCAATC TATCACTTGA GACAAATATA AGAAAAATAG CAGTTGCCGG TGGCGTTGCT GCAAACTCAA AGCTAAGAAG TGAATTTTAT AAAAAATGTG CCGAGCACAA TATTGAATTT TTTGTGCCGG AATTTAAGTA CTGTACAGAC AATGCAGCTA TGATTGCTTC GTGTGGATAC TTTAAGCTGC AAAAAGGAAT AGTATCATCT TACCGTGAAA ATGCTGTCCC TTATATAAAT CTTGTAAGCA AAAAAAGTTA A
|
Protein sequence | MLVLGIETSC DETSAAIVED GRKILSNVIY SQIDIHYQFG GVVPEIASRK HVEKISYVVD MAFKQAGLTI DDIDGIAATY GPGLVGSLLV GLSFAKALSY AKRLPFVAVN HIEGHIYANF ITYPQLTPPL IVLVVSGGHT NLIILKDFEE YEVVGKTRDD AAGEAFDKIA RYLGLGYPGG PAIDKIAKQG DEDKYKYPVA DVGGYNFSFS GLKSAVINHV HGLWQRGEEF KIEDVAASFQ KTVVSILVEK TINLSLETNI RKIAVAGGVA ANSKLRSEFY KKCAEHNIEF FVPEFKYCTD NAAMIASCGY FKLQKGIVSS YRENAVPYIN LVSKKS
|
| |