Gene Athe_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1833 
Symbol 
ID7408947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1908595 
End bp1909605 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content38% 
IMG OID643716210 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002573699 
Protein GI222529817 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGTAC TTGGGATTGA AACTTCATGC GATGAAACTT CTGCTGCAAT TGTAGAAGAT 
GGAAGAAAGA TTCTGTCAAA TGTAATATAC TCCCAGATAG ATATTCATTA TCAATTTGGT
GGTGTTGTCC CAGAGATAGC TTCTCGCAAA CATGTTGAAA AAATTTCATA CGTTGTTGAT
ATGGCGTTCA AACAGGCAGG TTTGACTATA GACGACATTG ATGGTATTGC TGCAACGTAC
GGACCTGGTC TTGTAGGTTC GCTTCTTGTT GGACTTTCTT TTGCAAAGGC ACTAAGCTAT
GCAAAAAGGC TTCCGTTTGT TGCTGTGAAT CACATTGAAG GGCATATTTA TGCAAATTTT
ATAACATACC CTCAGCTTAC ACCACCACTG ATTGTTCTGG TGGTCTCTGG AGGGCATACT
AATCTAATTA TTTTAAAAGA TTTTGAAGAG TATGAGGTGG TTGGTAAAAC TAGAGATGAT
GCGGCAGGCG AGGCTTTTGA TAAGATTGCA AGATACTTAG GGCTTGGATA TCCTGGCGGT
CCTGCCATAG ACAAAATCGC AAAGCAAGGT GATGAGGACA AATACAAATA TCCTGTTGCT
GATGTTGGTG GATACAATTT TAGCTTCAGC GGATTAAAAT CTGCCGTGAT AAACCATGTT
CATGGGCTTT GGCAGAGGGG CGAAGAATTT AAAATAGAAG ATGTTGCTGC GTCATTTCAA
AAAACAGTTG TGAGCATACT TGTTGAAAAA ACAATCAATC TATCACTTGA GACAAATATA
AGAAAAATAG CAGTTGCCGG TGGCGTTGCT GCAAACTCAA AGCTAAGAAG TGAATTTTAT
AAAAAATGTG CCGAGCACAA TATTGAATTT TTTGTGCCGG AATTTAAGTA CTGTACAGAC
AATGCAGCTA TGATTGCTTC GTGTGGATAC TTTAAGCTGC AAAAAGGAAT AGTATCATCT
TACCGTGAAA ATGCTGTCCC TTATATAAAT CTTGTAAGCA AAAAAAGTTA A
 
Protein sequence
MLVLGIETSC DETSAAIVED GRKILSNVIY SQIDIHYQFG GVVPEIASRK HVEKISYVVD 
MAFKQAGLTI DDIDGIAATY GPGLVGSLLV GLSFAKALSY AKRLPFVAVN HIEGHIYANF
ITYPQLTPPL IVLVVSGGHT NLIILKDFEE YEVVGKTRDD AAGEAFDKIA RYLGLGYPGG
PAIDKIAKQG DEDKYKYPVA DVGGYNFSFS GLKSAVINHV HGLWQRGEEF KIEDVAASFQ
KTVVSILVEK TINLSLETNI RKIAVAGGVA ANSKLRSEFY KKCAEHNIEF FVPEFKYCTD
NAAMIASCGY FKLQKGIVSS YRENAVPYIN LVSKKS