Gene Athe_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2071 
Symbol 
ID7408780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2187622 
End bp2188755 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content37% 
IMG OID643716438 
Productglycerate kinase 
Protein accessionYP_002573921 
Protein GI222530039 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1929] Glycerate kinase 
TIGRFAM ID[TIGR00045] glycerate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000043752 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATATT TGGTTGCACC GGATAAATAT AAAGGGTCAT TTGACGCTTT AGTCGCATCT 
GAGATAATAA AAGAAGCTAT TGTTGAGGTT GACAAAAGCG CAGAAGTTTT TCAGCTTCCG
CTTGCCGACG GTGGAGAAGG AACCTTGACA GCTCTATCTA AAATTTTTGG TGCCAAAATA
GAAGAGGTTG AGGTAAATGA CCCTCTTTTT AGGAAGATCA AAAGCAGAAT AGGATTTTTT
GAAGACAAAG CAATCATTGA AATAGCAGAA TGTTCAGGTC TTCTTATTTT AAAAGATGAA
GAAAGAAATC CTCTTTACAC AACAACATAT GGTGTTGGTG AGCTCATCAA ATACGCAATT
TCAAAGAAAG TTAAAGAAAT CCTCATAGGC ATTGGCGGCT CTTCAACAAA TGATGCAGGC
ACAGGAATGC TAAATGCACT TGGAATGAAA TTTTTAGATG AAAATGGAGA GGAATTAAAA
CCAATCGGAG AAAACCTGAT AAAAATAAAA AAGATAGATG ATTCAGAATT TTTAAAAGAT
GTTCGCAAAG TAAAATTTAC AGTTCTGTGC GATGTTACAA ATCCATTATA CGGAGAAAAC
GGTGCAGCGT ATGTGTTTGC ACCTCAAAAA GGGGCAGATG AAAATGCTGT AAAGCTTCTT
GATATGGGAC TTAGAAATTT TGCAAATGTC GCTAAAGAGT ATCTTGGAAA AGACATGTCG
CTATCCAGCG GAGCTGGTGC TGCAGGAGGC TTGGGATTTG CACTTTTGGC TTTTTTGAAC
GCTCAGTATG TATCGGGAAT AGATTATATA CTAAGCGCTT CGAACGCTGA AGAACACGTT
AAATGGGCAG ACATTATAAT CACTGGTGAA GGAAGATTTG ACAGGCAAAG CTTATCTGGA
AAATCTACAA TTGGAATTGC AAGACTTGGA GTTAAACTTG GCAAGATGGT AATTGTTATT
TCAGGGTCTA TTGATTGTCC TTTTGAAGAG TACACAAAAG AAGGAATAAC CTCAATTTTC
TCTATTGTTG ATATGGCATC ATCGCTTGAC AGATGTCTCA AAGAAGCACC ACGGCTTTTG
AAAGAAACTA CAAAGAGCAT TGTGAATTTG ATTTTAAGGG CAAAAAATTT TTAA
 
Protein sequence
MKYLVAPDKY KGSFDALVAS EIIKEAIVEV DKSAEVFQLP LADGGEGTLT ALSKIFGAKI 
EEVEVNDPLF RKIKSRIGFF EDKAIIEIAE CSGLLILKDE ERNPLYTTTY GVGELIKYAI
SKKVKEILIG IGGSSTNDAG TGMLNALGMK FLDENGEELK PIGENLIKIK KIDDSEFLKD
VRKVKFTVLC DVTNPLYGEN GAAYVFAPQK GADENAVKLL DMGLRNFANV AKEYLGKDMS
LSSGAGAAGG LGFALLAFLN AQYVSGIDYI LSASNAEEHV KWADIIITGE GRFDRQSLSG
KSTIGIARLG VKLGKMVIVI SGSIDCPFEE YTKEGITSIF SIVDMASSLD RCLKEAPRLL
KETTKSIVNL ILRAKNF