Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0358 |
Symbol | |
ID | 6374020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 371849 |
End bp | 372916 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642682877 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_001958806 |
Protein GI | 189499336 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00127511 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTCA AACCTATTTC CGATATCGGC GAATTCGGGC TTATAGACCG TATCGCCTCG ATCACCGCCC CGACACTTGA GACAACACCC GGCATTACTG AAGGTATCGG TGATGATTGT GCCGTCTATG AAATTTCAAG ATCGATGGTG CAGGTTACCA CAACCGATCT TCTGGTTGAA CATGTTCATT TCGATCTGCT GACCACCCCC CTGCACCACC TGGGCAGCAA AGCGATAAGC GTCAACGTTT CGGATATCTG TGCGATGAAT GCAAAGCCGC GTTACGCTCT TGTGTCGATC GCCATCCCGT CGAAAACCTC GGTAGACCTT GTTGAAACCC TTTATCGTGG TATGAGCGAG ACTTCCCGTA TCTACGGACT GGCCATTGCT GGCGGAGACA CCTCTCTTTG CCCGGGGGCA ATGGTCATAT CCGTTACCGT TGCAGGCGAC ATCGAAAAAG AGAACATCAC CTATCGAAAA GGAGCGGAAC CCGGTGATAT GATCTGCGTG AGCGGGACTC TTGGAGGCGC TGCGGCCGGG CTCAGGGTAC TCATGCGGGA AAAATCGGTC ATGATGGAAC ACATCAGGCA TGGAGAAACC TACGACAAGG ATGTCATGAG TAATCTGAGC GATTACGATG ACGCCATCCG CCAGCAGCTT CTTCCTTCAG CTCGCATGGA TATCATCGGC TTTTTCGAAA AAGAAAACAT CGTCCCGACC TCCATGATCG ACATTTCAGA CGGGCTTGCA TCCGACCTCG CTCACATCTG TAAACGTTCA GGTGTCGGGG CACAGATCGA GGAGAGCCGA ATTCCGATCC TGTCGCAGAC CAGGCATATC GCCGACGAAT TTCAGGAAGA CGCCATGAAC TATGCGCTGA CCGGCGGAGA AGATTACCAG CTTCTTTTCA CGCTGAAACC GGAAAATTTT TCTGCCATCT CTTCCCATCC TGATATTTCT GTCATCGGGA AAATCACGCC GAAAGATGAC GGATTGCTGC TTCGTGACAT CTATGGAATG ACTGTTGACA TACAATCCTT GTCAGGGTTC GACCACTTTT CCGGCTGA
|
Protein sequence | MSFKPISDIG EFGLIDRIAS ITAPTLETTP GITEGIGDDC AVYEISRSMV QVTTTDLLVE HVHFDLLTTP LHHLGSKAIS VNVSDICAMN AKPRYALVSI AIPSKTSVDL VETLYRGMSE TSRIYGLAIA GGDTSLCPGA MVISVTVAGD IEKENITYRK GAEPGDMICV SGTLGGAAAG LRVLMREKSV MMEHIRHGET YDKDVMSNLS DYDDAIRQQL LPSARMDIIG FFEKENIVPT SMIDISDGLA SDLAHICKRS GVGAQIEESR IPILSQTRHI ADEFQEDAMN YALTGGEDYQ LLFTLKPENF SAISSHPDIS VIGKITPKDD GLLLRDIYGM TVDIQSLSGF DHFSG
|
| |