Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0210 |
Symbol | |
ID | 4783994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 227595 |
End bp | 228566 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640088760 |
Product | thiamine-phosphate kinase |
Protein accession | YP_001019407 |
Protein GI | 124265403 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.631674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCG GCGAGTTCGA CCTGATCGCG CGCTACTTCG GCCGCCCGAC CCGCCGTGCG GCGCTCGGGC CGGGCGACGA TTGCGCGCTG CTCGCACTGC GGCCCGGCAT GCAGCTCGCG GTGTCCTGCG ACATGCTGGT CGAAGGCCGG CACTTCCTGC CCACCGTGGA TCCGCGCCGC CTGGGCCACA AGGCGCTGGC GGTCAATCTC AGCGATCTCG CCGCCTGCGG CGCACAGCCG CTGGCCTGCA CGCTGGCGCT GGCGCTGCCG CGGGTCGACG AGGCCTGGCT CGACGGCTTC TCGGCCGGGC TGTACGAGCT GGCCGATGCA CACGGCTGCG AACTGATCGG CGGCGACACC ACGGCCGGCC CGCTCAACAT CTGCATCACA GTGTTCGGTG AGGTCCCAGC GGGCGAGGCG CTGCTGCGCA GCAGCGCGGA GCCGGACGAC GAGATCTGGG TCAGCCACGC GATCGGTGGG GGCATCGGTG ACGCGCGGCT CGCGCTCGAG GCCTTTCGCG GTCAGGTGTC GCTGCCCGGC GCGGTGTTCG AGCGGGTGCG GCGTGCGATG GAACAGCCGC AGCCGCGCGT GGCGCTGGGC CTTGCACTGC GTGGCGTGGC CCGTGCCGCC GTCGACGTGT CGGACGGCCT GCTGGGCGAC CTGTCCCACC TCCTGCACCG CAGCGACGTC GGTGCCAGCA TCGATGTCGG TCGCGTGCCG CGCAGCGCCG ACCTCGCGGC CCAGCCGCTG GCCTTGCAGC GCCTGTGCAC GCTGGCCGGA GGCGACGACT ACGAGCTCGT CTTCACGGCG CCGCCAGCGG CCCATGCGCG GGTGCTCGAG GCCGCGCGCG GGGCCGACGT GGCGGTCAGC CGAATCGGCC GCATCGAGCA CGAGGCGGGC CTGCGCCTCG TCGACGCCGA CGGCCAGGCC GTGGATCGCG TCTGGGCCTC GTTCGATCAC TTCTCCGCAT GA
|
Protein sequence | MSLGEFDLIA RYFGRPTRRA ALGPGDDCAL LALRPGMQLA VSCDMLVEGR HFLPTVDPRR LGHKALAVNL SDLAACGAQP LACTLALALP RVDEAWLDGF SAGLYELADA HGCELIGGDT TAGPLNICIT VFGEVPAGEA LLRSSAEPDD EIWVSHAIGG GIGDARLALE AFRGQVSLPG AVFERVRRAM EQPQPRVALG LALRGVARAA VDVSDGLLGD LSHLLHRSDV GASIDVGRVP RSADLAAQPL ALQRLCTLAG GDDYELVFTA PPAAHARVLE AARGADVAVS RIGRIEHEAG LRLVDADGQA VDRVWASFDH FSA
|
| |