Gene Athe_0151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0151 
Symbol 
ID7408513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp187409 
End bp189061 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content35% 
IMG OID643714553 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002572076 
Protein GI222528194 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGATAA AAGGCATTCC TGTCTCAGAA GGGATTGGTT TGGGGAGAGC AGTTGTAATC 
AAAGAAAGTG AATATACAAT CAAAAAGACA AAAATAGAGG ATACTGATGC TGAGCTTAGA
CGCTTTTTGG ATAGCATAGA AAAAGCAAAA GAACAGATAA GGAAGATAAA AGCTGCAACT
CAGGAAAGTT TGGGCAAAAA AAATGCAATG ATTTTTGATG CCCATCTTTT AATCCTTGAT
GACCCAGAAT TTGTAAATAT GGTAAGAGGA AAGATAGAAG AAGGGATAAA TGCTGAGTTT
GCCATTGATG AGTCGGCAAG GTTTTTTGAA AATATGCTTT TGAGCTTAGA AGATGAATAT
ATGAGAGAGA GAACAAATGA TATAAAAGAT GTAGCTTTGA GACTAATTAA AAATTTGAAT
GGAGAAGAAC AAATAGACCT AAAAAATCTT CCTGAGGACA GTATTTTGAT TGCGCATGAC
CTTACTCCTT CACAAACAGC TCAAATAAAT AAACAAAATG TGCGGGGATT TGTCACAGAG
AAAGGTGGCA AAACTTCTCA TACAGCAATA ATTGCAAGAA CATACGAAAT TCCTGCAGTT
GTGGGTGTAG AAGGTATAGT CAATAGGATA AAAGATGGAG ATTTTTTGAT TGTGGATGGG
TATGAGGGGT TTGTTTATGT AAATCCTGAA GAAGATTTAA TAAAGGAATA TGAAAAAAAA
CTTGACGAAG AAAATAAGAG AAAAGAAGAG TTAAAAAGCT TTTTGTATGT TGAGTCCAAA
ACACAAGATG GGAAAAGGAT AAAACTGTTT GCAAATATTG CGCATATAGA AGAGATTGAC
GCTGCCCTGA AAAATGGAGC AGAAGGAATT GGGCTTTTCA GAACAGAGTT TTTGTTCATG
GATAGAAGCC AGCCACCATC AGAAGATGAA CAGTTTGAAG TTTATAAAAC TGTACTTGAA
AAGATGGAAG GCAAGCCGGT TATTATAAGA ACTTTGGATG TTGGGGGAGA CAAGAATATT
TCGTATTTGA ATATAGATAA AGAAGAAAAT CCTTTTTTGG GGTACAGAGC TATCAGGCTC
TGCTTAGGAA ATAAAGAGCT TTTTAAAACT CAGCTGAGGG CGCTTTTAAG AGCATCTATT
TATGGAAAAC TCAAAATAAT GTTTCCTATG ATAACCTGTA TTGATGAAGT GTATCAGGCA
AAATGGATTA TCCAGGAAGC CAAAGAAGAG CTCAAAAAAG AAAATATTCT TTTCTCACAA
AACATCGAAA TTGGTATAAT GATAGAAACT CCTGCTGCAG CAGTTATCTC AGATATTTTG
GATAAAGAAG TTGACTTTTT TAGCATAGGC ACAAATGACC TTATTCAATA TACACTTGCA
ATTGACAGGA CAAATGATAA AGTGTCGTAC CTGTACAATC CTTTGCATCC GGCTGTTTTG
AGACTTATTA AGATGACAGT TGAAAATGCT CACAAAAGAG GCATAGAGGT TGGGGTGTGC
GGAGAGATTG CATCAAACCA GGAATTTGTT CCTGTTTTGA TAGGACTTGG TGTTGATGAG
CTAAGCGTAA ATCCTTCTAA GATATTAAAT GTAAAGAAGA AAATTTTACA AACAAGGTTT
GAGGAAGAAA ACCTTCGTGT AAAAGAGCTC TGA
 
Protein sequence
MVIKGIPVSE GIGLGRAVVI KESEYTIKKT KIEDTDAELR RFLDSIEKAK EQIRKIKAAT 
QESLGKKNAM IFDAHLLILD DPEFVNMVRG KIEEGINAEF AIDESARFFE NMLLSLEDEY
MRERTNDIKD VALRLIKNLN GEEQIDLKNL PEDSILIAHD LTPSQTAQIN KQNVRGFVTE
KGGKTSHTAI IARTYEIPAV VGVEGIVNRI KDGDFLIVDG YEGFVYVNPE EDLIKEYEKK
LDEENKRKEE LKSFLYVESK TQDGKRIKLF ANIAHIEEID AALKNGAEGI GLFRTEFLFM
DRSQPPSEDE QFEVYKTVLE KMEGKPVIIR TLDVGGDKNI SYLNIDKEEN PFLGYRAIRL
CLGNKELFKT QLRALLRASI YGKLKIMFPM ITCIDEVYQA KWIIQEAKEE LKKENILFSQ
NIEIGIMIET PAAAVISDIL DKEVDFFSIG TNDLIQYTLA IDRTNDKVSY LYNPLHPAVL
RLIKMTVENA HKRGIEVGVC GEIASNQEFV PVLIGLGVDE LSVNPSKILN VKKKILQTRF
EEENLRVKEL