Gene Athe_0711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0711 
Symbol 
ID7407135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp799595 
End bp800599 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content38% 
IMG OID643715083 
Productdeoxyguanosinetriphosphate triphosphohydrolase-like protein 
Protein accessionYP_002572599 
Protein GI222528717 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTAAAA TAAGAGAGTA TCAAGAAAAT TTAGAAGAAA AGATTCTTTC TCCCTATGCA 
ATGCTTTCAA AAAATACAAA AGGAAGACAA AAGCAAGAAC AGAAATGCGA TGTGAGAACA
GAGTTTCAAA GAGATAGAGA TAGAATAATA CATTCCAAGT CTTTCAGGAG GCTAAAACAC
AAAACCCAGG TGTTCATATC TCCAGAAGGT GACCATTACA GAACAAGGCT CACACACGCT
TTGGAGGTTG CACAAATTGC AAGGACAATT GCAAGAGCTC TGAGGCTCAA CGAAGATTTG
ACAGAGGCAA TAGCACTTGG TCATGATTTG GGTCACACAC CCTTTGGCCA TGCAGGTGAG
GATATCTTAA ATAAAATAAC CACAACTGGA TTTTCACACA ACGTTCAGAG CTTGCGGGTT
GTTGATTTTT TGGAAGGTGA AGATGGTCTT AACCTCACGT TTGAGGTCAG AGACGGGATT
TTGAATCATG TGTGGGGAAG GACACCTGCC ACTTTGGAGG GCAGAGTTGT CCAGTTTGCA
GACAGGATTG CATACATCAA CCATGACATT GACGACGCAA TAAGAGCAGG TATATTGAAG
GAAGATGACC TGCCAAAAGA CTGTCTTAAA ATACTGGGAT ATTCTAAGAG AGAGAGAATT
AATACATTAA TTAGGGATAT AATAAAAAAT AGTATGGACA AGCCAGAAAT CTCCATGAGT
GAAGATGTTT TTTATGCTAT GCAGACTTTA CGAAGTTTCA TGTTTGAAAA TGTGTATATT
GGTTCTGAAG CAAAAAAGGA TGAAAGTAAA GCCAAATATA TTATACAAGC TCTCTATGAA
TATTTTATGT CGAACTGTGA CGTCTTACCT GACGATGTGA AAAAGAATAT CGACAGATTT
GGAAAGGAAC AGGTAATAGT TGACTATATA GCTGGAATGA CAGACAGGTA TGCTATGCGA
AAGTTTTATG AATTATTCTT ACCGTCACCA TGGAACAAGC TTTGA
 
Protein sequence
MLKIREYQEN LEEKILSPYA MLSKNTKGRQ KQEQKCDVRT EFQRDRDRII HSKSFRRLKH 
KTQVFISPEG DHYRTRLTHA LEVAQIARTI ARALRLNEDL TEAIALGHDL GHTPFGHAGE
DILNKITTTG FSHNVQSLRV VDFLEGEDGL NLTFEVRDGI LNHVWGRTPA TLEGRVVQFA
DRIAYINHDI DDAIRAGILK EDDLPKDCLK ILGYSKRERI NTLIRDIIKN SMDKPEISMS
EDVFYAMQTL RSFMFENVYI GSEAKKDESK AKYIIQALYE YFMSNCDVLP DDVKKNIDRF
GKEQVIVDYI AGMTDRYAMR KFYELFLPSP WNKL