Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0336 |
Symbol | |
ID | 4570535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 367091 |
End bp | 368158 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639764934 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_910819 |
Protein GI | 119356175 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATATA AAGCTATTTC CGATCTTGGT GAATTCGGAC TCATCGATCG GATATCCTCG CTTGTCGGGC CAACGCTTGA TGCTTCACCA AACCTGCTTA CAGGAATCGG AGATGACTGC GCCGTTTATC AGCCGACCGC CGGTATGCTC GAAGTAACGA CAACCGACCT GCTTGTGGAA AAGGTACATT TTGACCTGCT TACAACTCCT CTTAAACATC TCGGAAGTAA ATCAATCAGC GTCAATGTCT CCGACATCTG CGCCATGAAT GCAACACCTC AATATGCGTT GATCGGCATC GCTGTTCCGC CATCATTTTC GGTAGAGATG ATAGAAGAAC TCTACAAGGG CATGAGCCAT GCCGCACGCA TCTATGGAGT CGCCATTGCA GGAGGCGACA CATCTGTCTC CCGATCAGGT CTCTTTATTT CGGTGACCAT GACCGGTGAG GTTTCCGGGG AGCGACTCAC CAGACGATCC GGAGCAAAAC CGGGTGAAAT GATCTGTGTT ACCGGCACTC TTGGCGGAGC GGCTGCAGGA CTGCGCTTGC TTATGCGTGA AAAGAACATC ATGCTGGAGC ATATTGAACA CCATGAACCG TATAACAAAA GCCTCATGGT TGATCTCGAA GAGTACGCTG ATGCCATAAA AGAGCAGCTT CTTCCCGCTG CACGCATTGA TATCATCCGG TTCTTTCACT CCAGGAACAT CAATCCGACA GCCATGATTG ATATTTCGGA TGGCTTGAGC TCTGACCTGA GACATCTCTG CAACAGTTCG GGCACCGGAG CGGTAATCCA CGAAGGAAGA ATACCGGTTC ATTCCGGAGC AAGAAGAATT GCCGATGAGC TGCGCGATGA TGCGCTCGGC TGGTCATTGA CCGGCGGAGA GGACTACCAA CTGCTTTTTA CGCTGCCTAA AGAACGGTTT CCTGATATTG CAGAACACGA CGATATTTCG ATCATCGGAG AAATAACAGA AAAGGATGCC GGCATCTCAT TGGTCGACAT CTACGGCATG AGCATAGACC TCGAAAAACT GCCCGGATAT GACCATTTCC GCTCATAA
|
Protein sequence | MPYKAISDLG EFGLIDRISS LVGPTLDASP NLLTGIGDDC AVYQPTAGML EVTTTDLLVE KVHFDLLTTP LKHLGSKSIS VNVSDICAMN ATPQYALIGI AVPPSFSVEM IEELYKGMSH AARIYGVAIA GGDTSVSRSG LFISVTMTGE VSGERLTRRS GAKPGEMICV TGTLGGAAAG LRLLMREKNI MLEHIEHHEP YNKSLMVDLE EYADAIKEQL LPAARIDIIR FFHSRNINPT AMIDISDGLS SDLRHLCNSS GTGAVIHEGR IPVHSGARRI ADELRDDALG WSLTGGEDYQ LLFTLPKERF PDIAEHDDIS IIGEITEKDA GISLVDIYGM SIDLEKLPGY DHFRS
|
| |