Gene Athe_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0531 
Symbol 
ID7408656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp600110 
End bp601774 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content35% 
IMG OID643714913 
ProductAAA ATPase 
Protein accessionYP_002572430 
Protein GI222528548 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0572] Uridine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.777332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AAGATAATAC AGTAAAAGTC TTTTTTGAAG ATGCAAATGT ATATGAAGAT 
GTAGAGGTTG GGACAAATCT TTTGAGCTTT GTTCCAAGGT TTGAAAGTTA TTTTAAATCT
TCGATAGTTG CAGCAAAGGT AGACAATGAG ATTAAAGAAC TGAAGTATGT TATTCACAGA
GATTGCAAGG TAAAATTTAT TGACATGACC CAAGAAGATG GTATGAGGAT TTACAGAAGA
AGTCTCATTT TTGTTTTAAT TGTTGCAACA AGAATGCTTT TTAAAGAAAC TGTAAATGTT
CAGCATTCTC TTTCAAAAGG ACTTTATTGT GAGATTGAAA ACAGAAAGCT GACAGCTGAA
GATATAAATC TTATAAAACA AAAAATGAAA TGGATAGTAG ACCAAGATTT TCAATTTAGA
AGAGAAAAGG TTTCAAAAGA TGATGCAGTT AAACTTTTTG AAGAAAAAGG CTTTTACGAT
AAAGCAAGAA CAATAAAGTT TTCAGAAAAC GACTATGTTT ACATTTATTA CTGTGGGGAT
TATGTTGATT ATTTTTATGG TCATCTTGTG CCCTCTACAG GATATCTAAA AATATTTGAT
CTAATTCAAT ACCACGACGG TATGGTGCTT TTGTACCCAG ACAAGTCAGA CCCATTTAAG
CTTCAAGAGT TTGTTGAGAA CAAGAAACTG TTTGCAGTAT ACCATGAGTA CAAAAACTGG
GGCAAGATAC TTGGGGTAAG CAGCATCGGT GAGCTCAATG AGGTGATAGC AAGTGAAAAG
ATAAGAGAAT TTATAAGAGT CTCAGAAGCT CTACACGAAA AAAAGATAGC ATATTTGGCT
GACCAGATTT CGCAAAATCC GCTGATAAAA GTTGTTTTGA TATCCGGACC TTCATCATCC
GGAAAGACAA CCTTTGCACA AAGGCTTTCT ATTCAGCTTA AAGTAAATGG AAAAAACCCT
GTTTATATAG GGCTTGACGA TTATTTCTTT GAGGATAAAG TGCCACTTGA CGAAAATGGC
AAGCCTGACT ATGAATCGAT TGAAGCTATT GATGTTGAGC TTTTTAACAA ACAGCTAAAA
GATTTGATAG ATGGCAAAGA GGTTGTTCTG CCACGGTTTA ATTTTATAGA AAGAAAGCGA
ACATTTGAAA GACCAGTCAA GCTTGAAAAG AACGATATAA TAATAATTGA AGGAATACAT
GGACTAAACA GAAAACTTAC TCCAATGATA CCTGATGAAA GCAAGTTCAA AATATATGTA
AGTGCCTTGA CACATTTAAA TCTTGACAAA CACAATAGAA TACAGACAAC AGATTATAGA
ATTTTGAGAA GGATTGTCAG AGATGCCAGA ACAAGAGGCG CATCTGCTAA AAGAACAATT
TCTATGTGGC CGTCTGTTAG AAACGGTGAA GAAAAGAATA TTTTTCCATA CCAGGAGATG
GCAGATGCCA TGTTCAATTC AGCGCTAATT TATGAGCTGG CTGTGTTAAA GAAATATGCC
GTGCCGCTAC TTAGGACAAT CACAAGAGAA GATGAGGAAT ATAGCGAAGC GCAGAGGCTT
TTACACTTTT TGAGCTTTAT CCTCACAATT GAGGACGAAA GAGAAATCCC ACCACAATCA
ATCATAAGAG AGTTCATTGG AGGGTCTTGC TTTTATGACT TCTGA
 
Protein sequence
MKKQDNTVKV FFEDANVYED VEVGTNLLSF VPRFESYFKS SIVAAKVDNE IKELKYVIHR 
DCKVKFIDMT QEDGMRIYRR SLIFVLIVAT RMLFKETVNV QHSLSKGLYC EIENRKLTAE
DINLIKQKMK WIVDQDFQFR REKVSKDDAV KLFEEKGFYD KARTIKFSEN DYVYIYYCGD
YVDYFYGHLV PSTGYLKIFD LIQYHDGMVL LYPDKSDPFK LQEFVENKKL FAVYHEYKNW
GKILGVSSIG ELNEVIASEK IREFIRVSEA LHEKKIAYLA DQISQNPLIK VVLISGPSSS
GKTTFAQRLS IQLKVNGKNP VYIGLDDYFF EDKVPLDENG KPDYESIEAI DVELFNKQLK
DLIDGKEVVL PRFNFIERKR TFERPVKLEK NDIIIIEGIH GLNRKLTPMI PDESKFKIYV
SALTHLNLDK HNRIQTTDYR ILRRIVRDAR TRGASAKRTI SMWPSVRNGE EKNIFPYQEM
ADAMFNSALI YELAVLKKYA VPLLRTITRE DEEYSEAQRL LHFLSFILTI EDEREIPPQS
IIREFIGGSC FYDF