Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1653 |
Symbol | thiL |
ID | 3104761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1756835 |
End bp | 1757812 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637170815 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_114096 |
Protein GI | 53804264 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0108412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGGC CGGGTGAGTT CGAGATCATC CGGCGCTTTT TTTCCGGACA GCGGGTAGCG GACGTGAACA CCGAGATCGG CATCGGCGAC GACTGCGCGG TCCTGAACTT CACGGCAGCT GCGCGGTTAG CCCTGACCAC CGACACACTG GTCGCGGGCA TCCATTTTTT TTCCGATACC GATCCGACAA GGCTCGGCCA TAAGGCGCTC GCGGTCAATC TCAGCGACCT GGCCGCCATG GGAGCCAGAC CGAGATGGGC TTTGCTGGCC TTGACCCTGC CGGAAAACGA TCCCGAATGG ATCGCGGCTT TTGCGCAAGG CTTCTTCCGC CTCGCCGAGC GCCACAGGGT ACAGCTGATC GGGGGGGACA CCACGCGGGG GCCGCTGGCC ATCACGGTCC ACGCCCTGGG AACCGTGGAC GGCAGGCCTG GAGTAAAGCG TGCGGGTGCC AAACCCGGAG ATGCCATTTA TCTCACCGGC AGCATCGGCC TCGCCGGTCT GGGGCTGAGG ATCAGGCAGG GGGCCTATCT TGTCCCGGAC GAGGAGGCGC TGGATCGGCT GGAAGCCCCC CAGCCCCGGA CCGACTTCGG CATACGCCTC GGAGAATTTG CCAGCGCCGG CATCGATGTC TCGGATGGTC TGGCGGCAGA CCTCGGGCAC ATCCTCGAGC AAAGTTCCGT TGGCGCCGAG GTGGACTGGG AGTCTCTGCC GTTGTCGGCA GGCGTCGGCC GTTACGTGGC CGACTCCGGC GACTGGCAGA TGCCGTTGGT CGCCGGTGAC GACTACGAAC TGTGTTTCAC CGCGTCCCCC GCCTATGCCC AGGCGATCGC GTCTGCCGCC GATGCAACCG CCACCCCGGT GAGGCGCATC GGGACGATCC GTACCGGATC GGGTCTGATC ATCCGGCGGC GCGGCCGTCC GGTCGAGCTG TCACGCTCTG GGTATCTGCA CTTCGTTTCC GGGGAGGCGC GAAGATGA
|
Protein sequence | MQRPGEFEII RRFFSGQRVA DVNTEIGIGD DCAVLNFTAA ARLALTTDTL VAGIHFFSDT DPTRLGHKAL AVNLSDLAAM GARPRWALLA LTLPENDPEW IAAFAQGFFR LAERHRVQLI GGDTTRGPLA ITVHALGTVD GRPGVKRAGA KPGDAIYLTG SIGLAGLGLR IRQGAYLVPD EEALDRLEAP QPRTDFGIRL GEFASAGIDV SDGLAADLGH ILEQSSVGAE VDWESLPLSA GVGRYVADSG DWQMPLVAGD DYELCFTASP AYAQAIASAA DATATPVRRI GTIRTGSGLI IRRRGRPVEL SRSGYLHFVS GEARR
|
| |