Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0895 |
Symbol | |
ID | 4709945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 971853 |
End bp | 972827 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855364 |
Product | thiamine-monophosphate kinase |
Protein accession | YP_001002473 |
Protein GI | 121997686 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0369638 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATCGGCG GAGGCGAAGC GGCGCTCATC CAGCGCTATT TCCACGGTCT GACCGCGCAG CGCGACGGGG TCGAGCTGGG GGTCGGGGAC GACGCGGCAC TGCTCCAGAC CGAGACGGGG CTGCTGGCTG CCTGTTGCGA CACCTTGGTC GAGGACATCC ACTTCCCTGG GGATATCCCA CCGGAGGCGC TGGGTCACCG GGTCCTGGCG GTCAATCTCA GCGATCTGGC CGCCGTCGGT GCCCGACCGG CCTGGACCCT GCTCTCCCTG ACCCTGCCAG AGAAAGACCC GCAGTGGTTG GAGCGGTTCA GCGCCGGATT CAAGGCGCTG GCCGACCGCT ACGGCGTCGC CCTGGTCGGC GGAGACACCA CCCAGGGTCC GCTGTCGGTC TCGGTCACCG CCCTCGGCCA GGTGGCCGGC GATCACGGCC TTCGGCGCGG TGGCGCGCGG CCCGGCGACG GCGTCTGGGT GACCGGTACC CTGGGGGATG CGGCCCTGGG GCTGGAGCTG TGGCAGGAGC GCGAGGAGGC GACCGCACTG GCCGGCGATC CGGCCTACCT GGCGGGCCGG CTGTTCCGCC CCGAGCCGCG GGTGGCGGCC GGTACCGCGC TGCTGGGGCG CGCGAGTGCT GCTATCGACG TCTCCGACGG GCTCGCCGCC GACCTGTCGC GGGTGCTCGA TGAGAGCGGC GTCGGCGCCA CCCTGGAGCT GGAGGCATTG CCCCGCTCCC AGGCGTTCAT CGATGAGCAG GGCGACCTCC GCCACCTGCT CCACGGCGGC GACGACTACG AACTCTGCTT CACCCTGCCG GCGGAGCGGG AGGAGGAGAT GGCCTGCCTG CGCGAGCACG CCGCCACGCC GGTGACGCGC ATCGGCACCG TGGAGGAGAC CCCCGGGCTG CGCGGGGTGG ACGCCGGCGG CGTGGTCTGC GCCCTGGAGC CGGGTGGCTA CGACCACTTC GCGGAGGGGT CGTGA
|
Protein sequence | MIGGGEAALI QRYFHGLTAQ RDGVELGVGD DAALLQTETG LLAACCDTLV EDIHFPGDIP PEALGHRVLA VNLSDLAAVG ARPAWTLLSL TLPEKDPQWL ERFSAGFKAL ADRYGVALVG GDTTQGPLSV SVTALGQVAG DHGLRRGGAR PGDGVWVTGT LGDAALGLEL WQEREEATAL AGDPAYLAGR LFRPEPRVAA GTALLGRASA AIDVSDGLAA DLSRVLDESG VGATLELEAL PRSQAFIDEQ GDLRHLLHGG DDYELCFTLP AEREEEMACL REHAATPVTR IGTVEETPGL RGVDAGGVVC ALEPGGYDHF AEGS
|
| |