Gene Athe_1223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1223 
Symbol 
ID7409697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1312336 
End bp1313976 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content34% 
IMG OID643715588 
ProductDak phosphatase 
Protein accessionYP_002573096 
Protein GI222529214 
COG category[R] General function prediction only 
COG ID[COG1461] Predicted kinase related to dihydroxyacetone kinase 
TIGRFAM ID[TIGR03599] DAK2 domain fusion protein YloV 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTT TAACTGCAGA TGTATTAAAA GATATGTTAA AAGCTGCAAA TAATTATTTA 
AAATTGCACA TAGATAAGAT AAACTCATTA AACGTCTTTC CAGTACCAGA TGGTGACACA
GGCACCAATA TGTCTGCCAC TCTTGACAGC AGCATAAAAG AAATAAATGG AAAGACTTTC
GAAAATGTGG ACAAACTTAT GAATGCAGTT GCGTTTGGCA GCTTAAAAGG TGCACGCGGT
AATTCTGGTG TTATTCTTTC TCAGCTTTTA CGCGGATTTG CCAAAGAGCT AAAAGGCAAA
GATGTTATAG ATATACCAAC ATTTGTTGCT TGTTTAAAAT CTGCGTCTGC AAGTGCTTAC
AAAGCAGTGA TGAAGCCTAC AGAAGGCACT ATGCTCACAG TTGCACGCGG GATTGCAGAG
GATGTTGAAA AAGAAGTGGC AGAAGGCATT GTGAGTGAAA TAGAGGATTT GCTGGAAGTG
TGTGTTTCAA GCGGGAAGAA GTGGCTTGCA AAGACACCAG AGATGCTTTC TATTTTAAAA
GAGGCAAATG TAGTTGACAG TGGCGGTATG GGTCTTGTAA TAATTTTTGA AGGGATGTAT
AAATTCTTAA AAGAAGGAAT GGTATTTGAA GAGCCATCAC AGCAGGAAGT TTATACAGTC
CTCACTTTTA AACCTGAAGA TATTAAGTTT ACTTACTGTA CAGAGTTTTT TATTACCGGT
TTGAAAAAGA ATATTGAAAA AGAATTTAAG GAATATCTTG AGACAATTGG TGATTCAATT
ATTGTAATCC AAGATGGCGA CATTCTCAAA ACACACGTTC ACACAAATTC ACCTGGCAAG
GTAATAGAAA AGGCTTTGAA ATATGGTGAG CTTATAAATA TAAAGATTGA TAATATGAAA
TATCAGCACC AGGAGTTTAT AAGTAAAAGA GAAAACCATG AGACAGAACT TCAAACACAG
GCTGAAGTTA TTATAAAAGA ATATGGTTTT GTAGCTGTAT CACAAGGAGA AGGATTCAAT
GAAATATTAA AAGGCTTGGG TGTTGATTTT GTAATTGAAG GCGGACAGAC TATGAATCCA
AGCGCTGAGG ACTTTGTAAA TGCTATAAAG AATGTACCAG CCAAAAATGT ATTTATTTTC
CCGAACAATA AAAACGTGAT TATGTCAGCA GAGCTTTCTT TACAGCTTAT TAATACAAAT
AAAAATATAG TGATTATGAA GACAACCAAT ATTCCTGAGT GCATTACTGC AATGATAAAG
TTTGATTTGA ACAAGAGTAT TGAAGAAAAT ATAAAGCTCA TGCAGCAAGC TATAAACTCA
GTAAAGGTTG TAGAAATAAC TAAGGCAGTG AGAAATACAA AAATAAACGG GTTTGAGATT
GAAGAAGGCG ATTTTATAGG GATTTCCAAA AAGGAAATAA TTGCATGTGA CAAAGATATG
TTAAAAGTAG CTTTGGCTTG TGTCGAAAAG ATTGTTGATT CTACAACCCA GATTTTGAGT
ATTTACTATG GCAAAGGTGT AGCCTTAGAA GATATAGAGG TGCTTGTTAA AAACATACAA
GAAATATACC CGAAAATTGA CATTGAGAGC TATGAAAGTG GAAATGAAAT TTATCAATTA
ATTATTGTAG CTGAGATGTG A
 
Protein sequence
MKFLTADVLK DMLKAANNYL KLHIDKINSL NVFPVPDGDT GTNMSATLDS SIKEINGKTF 
ENVDKLMNAV AFGSLKGARG NSGVILSQLL RGFAKELKGK DVIDIPTFVA CLKSASASAY
KAVMKPTEGT MLTVARGIAE DVEKEVAEGI VSEIEDLLEV CVSSGKKWLA KTPEMLSILK
EANVVDSGGM GLVIIFEGMY KFLKEGMVFE EPSQQEVYTV LTFKPEDIKF TYCTEFFITG
LKKNIEKEFK EYLETIGDSI IVIQDGDILK THVHTNSPGK VIEKALKYGE LINIKIDNMK
YQHQEFISKR ENHETELQTQ AEVIIKEYGF VAVSQGEGFN EILKGLGVDF VIEGGQTMNP
SAEDFVNAIK NVPAKNVFIF PNNKNVIMSA ELSLQLINTN KNIVIMKTTN IPECITAMIK
FDLNKSIEEN IKLMQQAINS VKVVEITKAV RNTKINGFEI EEGDFIGISK KEIIACDKDM
LKVALACVEK IVDSTTQILS IYYGKGVALE DIEVLVKNIQ EIYPKIDIES YESGNEIYQL
IIVAEM