Gene Athe_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1053 
Symbol 
ID7409610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1147554 
End bp1148510 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content35% 
IMG OID643715419 
Productphosphoesterase RecJ domain protein 
Protein accessionYP_002572927 
Protein GI222529045 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000135026 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATAGAGA GTAAGATTAT TCAGCAGCTT TTAGAATCAA ATTCGATTGC TATTGTATCG 
CACGAGAATC CTGATGGGGA TTGTATCGGC TCAATGCTTG CGCTTTATAT GGCACTTAAA
AGAAAAGGTA AAAATGCAAG AATGTTCTTG AAAAATAATG TTCCAAAGAA TTTGAGGTTT
TTGCCTGCAG CAGAAAAAAT AGAGGTGGTA GACAGAATTG ACGAAAATTT TGATGTTCTT
GTCCTGCTTG ACACAGGTGA GCTTGAGAGG ACGGGAATTG AAAACATTGA AAATTGTTAT
TCAAAGCTAA TAAATATAGA CCACCATGTG ACAAGCGAAG GGATAGGAGA TCTGTTTTAT
ATAAATTCTT CCTCTGCTGC AACAGGTGAA ATTATATACC AGATTGTCAA ACTTATGGGG
ATTGATAATG ATAAAGAAAT TGCAACCTGT CTTTACACAA GTATTTTTAC CGACACAGGA
GGATTTAAAT ATTCAAACAC TACTTCAATA ACCCATCAGA TTGCAGGTGA TTTAATAAAC
ACTGGAATTG ACTTTGTGTA TATTATCAAC AAGGTATTTG ATGAGATGAG CCTTTCAAAG
TTTAATCTTT TGAAAGATGT TTTGCAAACA TTAGAACTTT TTGAGGGAAA CAAGATTGCT
TTTTTGACAG TGACAAAAGA GATGCTGAAG AAAAATGGTG CCTCACGAGA TGAGACAGAA
AACATTATAA ATTTTGCAAG AAACATTGAA GATGTTGAAA TAGCTGCAAT ATTTATTGAA
GAAGAAGACA AAATAAAGGT GAGTCTGAGG TCGAAATACT ATATTGACTG TGCCCAGATT
GCTAAGGAAT TTGGTGGAGG AGGGCATTTG AGGGCAGCTG GTTTTTCAAG TAGAAACGTT
TCTTTAGCTG CTGTAAAGGA AAACCTACTA AAAAGATTAA AAAGTGATCT GAGATGA
 
Protein sequence
MIESKIIQQL LESNSIAIVS HENPDGDCIG SMLALYMALK RKGKNARMFL KNNVPKNLRF 
LPAAEKIEVV DRIDENFDVL VLLDTGELER TGIENIENCY SKLINIDHHV TSEGIGDLFY
INSSSAATGE IIYQIVKLMG IDNDKEIATC LYTSIFTDTG GFKYSNTTSI THQIAGDLIN
TGIDFVYIIN KVFDEMSLSK FNLLKDVLQT LELFEGNKIA FLTVTKEMLK KNGASRDETE
NIINFARNIE DVEIAAIFIE EEDKIKVSLR SKYYIDCAQI AKEFGGGGHL RAAGFSSRNV
SLAAVKENLL KRLKSDLR