Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1213 |
Symbol | |
ID | 3748247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1612181 |
End bp | 1613290 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637773747 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_379518 |
Protein GI | 78189180 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00911735 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACTGA ATGGAGAATT CACCCTTATT GATACCATAG CGCACCTTGT GCAACCAACG CTTGCCAATG CCCCCACACT GCTGCAAGGT ATTGGCGACG ATTGCGCTAT TATGCAACCA ACCGCCGGCA TGGTAGAAGT TGCCACTACC GACCTTTTAG TTGAATCGGT ACACTTCGAC CTCTTAACCA CGCCACTCTC GCATCTCGGC AGCAAAGCCA TAAGCGTTAA CGTCTCCGAC ATTTGCGCTA TGAATGCCCT ACCACGCTAC GCGTTAGTGA GCCTTGCCCT TCCTCCTACC TTTTCTAAAA AAATGGTGGA AGAACTCTAT GGCGGTATGG TACACGCCGC TCAAGCCTAC GGTATTGCAA TTGCAGGCGG CGATACCTCC GCCTCTCGTT CGGGACTGAT GATCTCCATT ACTGCAATTG GCGAAGCATT GCCCACGCAA CTCACGCGTC GTAGCGGTGC TCAACTTGAC GACTTGCTTT GCGTTACGGG CACCTTAGGT GGTTCAATGG CTGGGCTCAA GCTCTTAATG CGCGAAAAAG AGATTATGTT AGAACACTTG CGCAATAACG AACCTGTTAA CCGCAATCTC TTAGCGGATT TAGATGAGTA TCGAGAGTTA ATGCAGCGCC ACCTACTACC AACCGCCCGC CTCGACGTTG TGCGCCTTTT CCACCGCCTT GGCATACAAC CCACCGCCAT GATTGATATT TCCGATGGAC TCAGCTCCGA AGTGCAACAC ATCTGCCGCC ATTCCAACTG CGGAGCGTTG CTACACGAAA GCCGCATTCC CATCCACGCC ACCACACGCC AACTTGCCGA CGAAATGCAA GAAGAGCCGC TAACATGGGC ACTAACGGGC GGCGAAGAAT ACCAGCTCCT TTTTACGCTC CCCGAAGCCA CCTACCAGCA ACTTGCCCAC GAGCGCGACA TACACGTTAT TGGCACCATC ACGCCCACCA ATGAAGGCAT GGTGCTTGAA GAGATGTTTG GCATTCGCAT TGACCTTACC ACCATTCACG GCTTTGACCA TTTTGCTCCA TCAGGTAATG ACGATGGTAA CACGGAAAAT GAGGAAGAGG AGTTTGAGGA TGGCGTGTAA
|
Protein sequence | MPLNGEFTLI DTIAHLVQPT LANAPTLLQG IGDDCAIMQP TAGMVEVATT DLLVESVHFD LLTTPLSHLG SKAISVNVSD ICAMNALPRY ALVSLALPPT FSKKMVEELY GGMVHAAQAY GIAIAGGDTS ASRSGLMISI TAIGEALPTQ LTRRSGAQLD DLLCVTGTLG GSMAGLKLLM REKEIMLEHL RNNEPVNRNL LADLDEYREL MQRHLLPTAR LDVVRLFHRL GIQPTAMIDI SDGLSSEVQH ICRHSNCGAL LHESRIPIHA TTRQLADEMQ EEPLTWALTG GEEYQLLFTL PEATYQQLAH ERDIHVIGTI TPTNEGMVLE EMFGIRIDLT TIHGFDHFAP SGNDDGNTEN EEEEFEDGV
|
| |