Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4694 |
Symbol | |
ID | 4070744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5554479 |
End bp | 5555435 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637986739 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_593768 |
Protein GI | 94971720 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.711974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.226637 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCTTT CCGAACGAGC TTTGATTGAG CGCATCCGGC GGCGAGCGTC ATCGCGCAAG TTCTCGGCGA TCACGCGCGG CATTGGCGAC GATTGCGCCG TGCTCGATCC GCCACCGGGA CATGAACTAC TGGTGACGAC GGATTTCTGC CTGGAGAACG TGCACTTCCG TCGCGAATGG CATCCGGCGA AAGCTGTCGG ACACCGGTGT CTCGTGCGCG GGCTGAGTGA CATCGCGGCA ATGGGTGGCG ATCCGCTGGC TGCGTTCCTC TCGCTCGCGT TGCCGGCGGA GATTCCGCAG AAGTGGGTGG ATGGATTCTT CGATGGCTTG TTAGCGCTGG CGGAGAGATG GAAGGTGCCG CTTGCAGGTG GTGATATTGC GCAGTCGCCA CAAGGCGTGA TGGCGGACAT TATGGTGCTG GGCTCGGTGC CGCGCGGCAA AGCGATTCTA CGTTCCGGCG GGAAGCCCGG CGATGTGCTG TATGTCACGG GAACGCTGGG ATTGTCGGTC GCAGCGTTGC AGGCGTTTCG CGCGGGAAAG AAGCCGACAG CGAAGTCTAC GCCGCGACAT TTCTTCCCGG ACCCGCGAAT TGATATCGGC CGCCTACTGC GCGAACGCAA GCTGGCTACG GCGATGATCG ACCTGAGCGA TGGACTATCA ACCGATCTTT CGCATATTTG CGATGAGAGC GGCGTGGGGG CAGTGGTGTA TGCCACATCG GTGCCGTACG TGGGCGGGGA AAACGGATTG GAATTCGCGC TGAATGGTGG TGAGGACTAT GAACTGCTTT TCACCGCGAA CCCGCGGGCG CGAGTGCCGA AAGAAATCGA CGGCGTACCT GTGACTGCGA TTGGCGAGAT CGTGCAGCGG CGCGGCATGT GGCTGGAAGA CAAGCGCGGT AAAGTCGCGA AATTGAGGCC GCGCGGCTGG GAACACTTCC GCGGAGAAAA GAAATAG
|
Protein sequence | MPLSERALIE RIRRRASSRK FSAITRGIGD DCAVLDPPPG HELLVTTDFC LENVHFRREW HPAKAVGHRC LVRGLSDIAA MGGDPLAAFL SLALPAEIPQ KWVDGFFDGL LALAERWKVP LAGGDIAQSP QGVMADIMVL GSVPRGKAIL RSGGKPGDVL YVTGTLGLSV AALQAFRAGK KPTAKSTPRH FFPDPRIDIG RLLRERKLAT AMIDLSDGLS TDLSHICDES GVGAVVYATS VPYVGGENGL EFALNGGEDY ELLFTANPRA RVPKEIDGVP VTAIGEIVQR RGMWLEDKRG KVAKLRPRGW EHFRGEKK
|
| |