Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0864 |
Symbol | |
ID | 8413730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 958827 |
End bp | 960191 |
Gene Length | 1365 bp |
Protein Length | 454 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 645022447 |
Product | thiamine pyrophosphokinase |
Protein accession | YP_003179884 |
Protein GI | 257784667 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase [COG1564] Thiamine pyrophosphokinase |
TIGRFAM ID | [TIGR01378] thiamine pyrophosphokinase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000134495 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.492449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTGA CCGGTGCAAT TTTTGATTGC GATGGAACCC TCGTTGATTC AATGTGTGTT TGGCACAATG TGTTTAGTGC TGTTCTTCCT AAATATGGCA AGACCGTTGA TCCAGACATT TTTAATCGCG TAGAGGCAGT TTCACTCATT GGTGGATGTC AGATTTGTGT TGATGAACTT GCTTTGCCTG TCACAGCAGA AACCTTGTAT GAAGAGTTTT GCGCGTACGC AACTGACCAG TATCAACATC ACGTTTCAAT CGTTCCCGGC GCAAAGGAGT TCTTGCAGGA ACTCTACGAT GCGGGTATTC CTCTGGCTGT TGCATCATCA ACCCCCGTGC GAGAAGTACG TGCAGCCCTT GCAGCTCAAG GTATTGAACA TCTCTTTAAA ACGGTAGTTT CAACGGAGGA TGTTGGGGGA GTGGACAAGG TTGAACCAGA TGTTTACCTT GAAGCTCTCC GTCGTCTCGG CACCGATAAA GCGACAACCT GGGTTTTTGA GGACGCTCCG TTTGGTGCTC AGACGGCTCA AAAAGCAGGT TTCCCCGTGG TGGCACTTTA TAACGATCAT GACGGTAGAG ATCCCGTCTT TATGAGAGAA CACTCTAATA TCTTTGCCCA CACTTACGGC GAGTTGTCGC TTCTGCGTCT TTGTGACTAC GAGCGTCCTC TGACCAGTGC TCCTTCTGGC GAGAAGCCCC TTGAGGTTCT TATTGTGGGC GGATCCCCAG AGGCTGTCTC AAAAACGACG CTTTCTACCT GTGTCCAAAG CGCTGATTAC CTGATAGCGG TTGACCACGG TGCAGATGCA TGTCACGTTG CCGGTGTGGT TCCACAGCTT GCGCTTGGAG ACTTTGACTC AGCTTCATTA GAAACGGTGA CTTGGCTCAA AGAACAGCAG GTACCTTGTA TGAAGTTCAA TGCGGATAAA TATGATACCG ACCTTGCTCT TGCTCTTAAG TCCGCTGAGC ACGAGGCAAT TCGCAGAAAT AGTAAACTCT CACTCACGGT CGTCTCTACG TCTGGTGGCC ATCTGGATCA TCAGCTTGTA GTGCTCGGTC TTCTTGCCGC GTGGGCAAAG ACGGGCAAGG CAAAGGTTCG TGTTGTTGAG AATGATTTTG AGATGCGCTT TTTAGCCGCG GATCAAATTG ATTCTTGGCA GCTTGATGCA TCTGCAACAG GTAAAAAGAT TTCTCTTGTT GCTTTGTCAG AGGAGTGCGA GGTTTCTGAG TCTGGCATGA GGTGGAATCT TAATCACGAG AAGTTCACCT TACTTGGAGA TGACGGAATC TCAAATATTG TTGAAGCTGA CGGGGCTTGG GTCAAGTGCG AGAAGGGCTG TCTTTTGGTG CAGCTTTGGA ATTAA
|
Protein sequence | MQVTGAIFDC DGTLVDSMCV WHNVFSAVLP KYGKTVDPDI FNRVEAVSLI GGCQICVDEL ALPVTAETLY EEFCAYATDQ YQHHVSIVPG AKEFLQELYD AGIPLAVASS TPVREVRAAL AAQGIEHLFK TVVSTEDVGG VDKVEPDVYL EALRRLGTDK ATTWVFEDAP FGAQTAQKAG FPVVALYNDH DGRDPVFMRE HSNIFAHTYG ELSLLRLCDY ERPLTSAPSG EKPLEVLIVG GSPEAVSKTT LSTCVQSADY LIAVDHGADA CHVAGVVPQL ALGDFDSASL ETVTWLKEQQ VPCMKFNADK YDTDLALALK SAEHEAIRRN SKLSLTVVST SGGHLDHQLV VLGLLAAWAK TGKAKVRVVE NDFEMRFLAA DQIDSWQLDA SATGKKISLV ALSEECEVSE SGMRWNLNHE KFTLLGDDGI SNIVEADGAW VKCEKGCLLV QLWN
|
| |