Gene Apar_0864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0864 
Symbol 
ID8413730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp958827 
End bp960191 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content49% 
IMG OID645022447 
Productthiamine pyrophosphokinase 
Protein accessionYP_003179884 
Protein GI257784667 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase
[COG1564] Thiamine pyrophosphokinase 
TIGRFAM ID[TIGR01378] thiamine pyrophosphokinase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000134495 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.492449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTGA CCGGTGCAAT TTTTGATTGC GATGGAACCC TCGTTGATTC AATGTGTGTT 
TGGCACAATG TGTTTAGTGC TGTTCTTCCT AAATATGGCA AGACCGTTGA TCCAGACATT
TTTAATCGCG TAGAGGCAGT TTCACTCATT GGTGGATGTC AGATTTGTGT TGATGAACTT
GCTTTGCCTG TCACAGCAGA AACCTTGTAT GAAGAGTTTT GCGCGTACGC AACTGACCAG
TATCAACATC ACGTTTCAAT CGTTCCCGGC GCAAAGGAGT TCTTGCAGGA ACTCTACGAT
GCGGGTATTC CTCTGGCTGT TGCATCATCA ACCCCCGTGC GAGAAGTACG TGCAGCCCTT
GCAGCTCAAG GTATTGAACA TCTCTTTAAA ACGGTAGTTT CAACGGAGGA TGTTGGGGGA
GTGGACAAGG TTGAACCAGA TGTTTACCTT GAAGCTCTCC GTCGTCTCGG CACCGATAAA
GCGACAACCT GGGTTTTTGA GGACGCTCCG TTTGGTGCTC AGACGGCTCA AAAAGCAGGT
TTCCCCGTGG TGGCACTTTA TAACGATCAT GACGGTAGAG ATCCCGTCTT TATGAGAGAA
CACTCTAATA TCTTTGCCCA CACTTACGGC GAGTTGTCGC TTCTGCGTCT TTGTGACTAC
GAGCGTCCTC TGACCAGTGC TCCTTCTGGC GAGAAGCCCC TTGAGGTTCT TATTGTGGGC
GGATCCCCAG AGGCTGTCTC AAAAACGACG CTTTCTACCT GTGTCCAAAG CGCTGATTAC
CTGATAGCGG TTGACCACGG TGCAGATGCA TGTCACGTTG CCGGTGTGGT TCCACAGCTT
GCGCTTGGAG ACTTTGACTC AGCTTCATTA GAAACGGTGA CTTGGCTCAA AGAACAGCAG
GTACCTTGTA TGAAGTTCAA TGCGGATAAA TATGATACCG ACCTTGCTCT TGCTCTTAAG
TCCGCTGAGC ACGAGGCAAT TCGCAGAAAT AGTAAACTCT CACTCACGGT CGTCTCTACG
TCTGGTGGCC ATCTGGATCA TCAGCTTGTA GTGCTCGGTC TTCTTGCCGC GTGGGCAAAG
ACGGGCAAGG CAAAGGTTCG TGTTGTTGAG AATGATTTTG AGATGCGCTT TTTAGCCGCG
GATCAAATTG ATTCTTGGCA GCTTGATGCA TCTGCAACAG GTAAAAAGAT TTCTCTTGTT
GCTTTGTCAG AGGAGTGCGA GGTTTCTGAG TCTGGCATGA GGTGGAATCT TAATCACGAG
AAGTTCACCT TACTTGGAGA TGACGGAATC TCAAATATTG TTGAAGCTGA CGGGGCTTGG
GTCAAGTGCG AGAAGGGCTG TCTTTTGGTG CAGCTTTGGA ATTAA
 
Protein sequence
MQVTGAIFDC DGTLVDSMCV WHNVFSAVLP KYGKTVDPDI FNRVEAVSLI GGCQICVDEL 
ALPVTAETLY EEFCAYATDQ YQHHVSIVPG AKEFLQELYD AGIPLAVASS TPVREVRAAL
AAQGIEHLFK TVVSTEDVGG VDKVEPDVYL EALRRLGTDK ATTWVFEDAP FGAQTAQKAG
FPVVALYNDH DGRDPVFMRE HSNIFAHTYG ELSLLRLCDY ERPLTSAPSG EKPLEVLIVG
GSPEAVSKTT LSTCVQSADY LIAVDHGADA CHVAGVVPQL ALGDFDSASL ETVTWLKEQQ
VPCMKFNADK YDTDLALALK SAEHEAIRRN SKLSLTVVST SGGHLDHQLV VLGLLAAWAK
TGKAKVRVVE NDFEMRFLAA DQIDSWQLDA SATGKKISLV ALSEECEVSE SGMRWNLNHE
KFTLLGDDGI SNIVEADGAW VKCEKGCLLV QLWN