Gene Athe_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2229 
Symbol 
ID7407648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2362287 
End bp2363957 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content41% 
IMG OID643716595 
Productribulokinase 
Protein accessionYP_002574074 
Protein GI222530192 
COG category[C] Energy production and conversion 
COG ID[COG1069] Ribulose kinase 
TIGRFAM ID[TIGR01234] L-ribulokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00201608 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGT TCAGTATAGG AATTGATTTT GGAACACAGT CAGGAAGGGC AGTGCTTGTT 
AATGTTGAGA CAGGTGAAGA GGTTGCAACA AGTGTGAAAG AGTACACTCA TGGAGTTATG
GACGAGAGCT TGCCAGATGG TACAAAACTT CCTCATGACT GGGCACTTCA ACATCCACAG
GATTATATAG AAGTCTTGGC AACAACTGTT CCAGATGTCT TGAAAAAAGC AGGAGTTTCC
AAAGACGATG TGATTGGAAT AGGAATTGAC TTTACAGCCT GTACAATGCT TCCTATCAAA
AAGGATGGGA CACCGCTTTG TGAGCTTCCC AAGTTCAAAT CAAATCCTCA TGCCTATGTT
AAGCTGTGGA AACACCATGC TGCTCAAAAG TATGCAAATA GGCTAAATAG AATAGCTCAA
GAAAGAGGAG AGAAGTTTTT ACAAAGATAT GGCGGAAAGA TTTCATCCGA ATGGCTATTC
CCCAAAATCA TGCAGATTTT AGAAGAAGCA CCAGAAGTCT ATGAAGAGGC AGACAAATTT
ATAGAAGCAG CTGACTGGAT TGTGTTTAAG ATGACAGGGG TTGAGAAGAG AAACTCATGT
ACAGCAGGAT ATAAAGCTAT CTGGAGCAAG AGGGAAGGGT ATCCTTCAAA AGAGTTTTTC
AAAGCACTTC ATCCAAGGCT TGAAAATGTT GTTGATGAAA AGCTATCACG CGAGATATAC
CCGATTGGGC AAAAAGCTGG TGAGCTCACA GAAGAGATGG CAAAGCTTAT GGGATTAAAT
CCTGGGACAG CTGTTGCAAT TGCAAATGTT GATGCTCATG TGTCAGTTCC TGCTGTTGGG
ATTACTGATA TTGGCAAGAT GCTGATGATA ATTGGAACAT CTACATGCCA CATGCTTTTG
TGGAATGAAG AGAAGATGGT ACCAGGAATT TGCGGATATG TTGAAGATGG AATTTTACCT
GGTTTTTATG GGTATGAGGC AGGACAGAGC TGTGTTGGAG ATCATTTTGA GTGGTTTGTT
GAAAACTGCG TGCCAGCTCA ATATCACGAT GAGGCAAAAC AAAAAGGACT CAACATATAT
CAGCTACTCA AAGAAAAAGC AAAGGCTTTA AAGCCTGGCC AGAGCGGTCT TTTGGCGCTT
GACTGGTGGA ATGGCAACAG GTCAATTCTG GTTGATGCAG ACCTCACTGG TATGATGCTT
GGCATGACCT TGACTACAAA GCCAGAGGAG ATGTACAGAG CACTGATTGA GGCAACGGCA
TATGGCACAA AGATAATAAT TGATAACTTC AATGAGCACG GTGTTGAAGT AAGAGAGCTT
TATGCGTGTG GAGGAATTGC TGAAAAAGAT GAGCTTTTGA TGCAGATTTA TGCTGATGTG
ACAGGGCTTG AAATAAAGGT TTCTGCATCA CCTCAAACAC CGGCACTTGG TTCTGCAATG
TTTGGTGCTG TTGCTGCAGG AAAAGAAAGA GGTGGTTATG ATAGCATATT TGAAGCTGCA
AAGAAAATGG CAAAACTTAA AGACTATTCT TATAAACCAA ATCCGCAAAA CCATGAGATT
TACAAAAAGC TATACAGAGA ATACAGAATA CTTCATGACT ATTTTGGAAG AGGAGCAAAT
GATGTAATGA AGAGATTAAA AGAGATAAAA GATGAAGTTT CGAAAATATA A
 
Protein sequence
MAKFSIGIDF GTQSGRAVLV NVETGEEVAT SVKEYTHGVM DESLPDGTKL PHDWALQHPQ 
DYIEVLATTV PDVLKKAGVS KDDVIGIGID FTACTMLPIK KDGTPLCELP KFKSNPHAYV
KLWKHHAAQK YANRLNRIAQ ERGEKFLQRY GGKISSEWLF PKIMQILEEA PEVYEEADKF
IEAADWIVFK MTGVEKRNSC TAGYKAIWSK REGYPSKEFF KALHPRLENV VDEKLSREIY
PIGQKAGELT EEMAKLMGLN PGTAVAIANV DAHVSVPAVG ITDIGKMLMI IGTSTCHMLL
WNEEKMVPGI CGYVEDGILP GFYGYEAGQS CVGDHFEWFV ENCVPAQYHD EAKQKGLNIY
QLLKEKAKAL KPGQSGLLAL DWWNGNRSIL VDADLTGMML GMTLTTKPEE MYRALIEATA
YGTKIIIDNF NEHGVEVREL YACGGIAEKD ELLMQIYADV TGLEIKVSAS PQTPALGSAM
FGAVAAGKER GGYDSIFEAA KKMAKLKDYS YKPNPQNHEI YKKLYREYRI LHDYFGRGAN
DVMKRLKEIK DEVSKI