Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1157 |
Symbol | |
ID | 3833125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1189739 |
End bp | 1190752 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829088 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_430014 |
Protein GI | 83590005 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000470435 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAACTTA AAGAAGTAGG GGAATTTGGA CTTATAGAAC GCTTAAAGGC CGGGGCCATA GTGGACCCGG CCAGGGTAGT TATGGGGATC GGTGATGACG CCGCCGTATT AAAGGGAGAA CCTGGTTTCC TGACCCTGGC TTCGACGGAT ATGCTGGTGG AGGATATCCA CTTTACCTTG GCCACAGCTA CCCCCCGGGA AATAGGTTAT AAAACCATGG CTGTTAATGT CAGCGATATT GCCGCCATGG GGGGGATACC AGAACAGGCC CTGGTATCCC TGGCTTTAAG GCCGGAGCAG CAGGTCGAGT TTGTGGACGA GCTCTACGCC GGCCTGCGGG AATGTGGCCA GCGTTTTGGA GTAAATATAA TTGGTGGCGA CACGGTCTCT TCCCCCAGGG CCATGGTAAT CAACCTGGCC ATATTGGGAC GGGTGGAAAA TGATGCCTGT CTCTACCGCC ACGGTGCCCA GCCCGGGGAT ATTCTGCTAG TAACCGGCGA CCTGGGGGGC TCTGCCGCAG GTCTGGATAC CCTGCTCAGT CCCCGCCCGG CCCCGGCGGA GGTCATCGCC TGGGCCCGGG CACGCCACTT CCGGCCTACC CCGCGGGTAG TGGAAATCCG TGCCGCCCTG AAGGCCGGTG GGCTCACGGC GGCCGATGAC ATCAGCGACG GCCTGGTAGC GGAGGTCTAT ACCCTGGCGA CGGCTTCCAG GGTGGGAATA GTCCTGGAGG CCGGGGCCAT CCCCATCGCT CCGGCCACCC GGCAATTGGC GGCTATATAC CATAAGGAGC CCCTGGATTA CGCCCTTTAT GGCGGCGAAG ACTTTGAGCT CCTGCTGGCC TGCCGGCCGG ATAAAGTAGA CGCCGTTCGG GAAGCTGTGA ACCGGGCCTG CAGGACGCCG GTGACGGTCA TCGGCAGGGT CGTCCCGGCA GAAGAGGGTA TTACCATAAA CCACAGCGGC CGAGTGCTAC CCTTGACGCC AGGGGGTTAT AACCACTTCC AGCTGGATAG ATAA
|
Protein sequence | MELKEVGEFG LIERLKAGAI VDPARVVMGI GDDAAVLKGE PGFLTLASTD MLVEDIHFTL ATATPREIGY KTMAVNVSDI AAMGGIPEQA LVSLALRPEQ QVEFVDELYA GLRECGQRFG VNIIGGDTVS SPRAMVINLA ILGRVENDAC LYRHGAQPGD ILLVTGDLGG SAAGLDTLLS PRPAPAEVIA WARARHFRPT PRVVEIRAAL KAGGLTAADD ISDGLVAEVY TLATASRVGI VLEAGAIPIA PATRQLAAIY HKEPLDYALY GGEDFELLLA CRPDKVDAVR EAVNRACRTP VTVIGRVVPA EEGITINHSG RVLPLTPGGY NHFQLDR
|
| |