Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0942 |
Symbol | |
ID | 6314919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 998557 |
End bp | 999516 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642643315 |
Product | GHMP kinase |
Protein accession | YP_001917115 |
Protein GI | 188585570 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG4542] Protein involved in propanediol utilization, and related proteins (includes coumermycin biosynthetic protein), possible kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.977974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.388216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTC AGGTAAAAGT TCCTTTGACC TGTGGAGAAT GGGTTCAAGG CACAATTGAT GGTCAAGATT TTTTGGTGTC ATGTCCAATC AATGCATTTA GCATAGTTAC TTTAGAATTA TTTGCAGTTG ATGAAAGAAC CGAAGTAAAA CCAATTAGTG GTGTAAGATA TGGCCCCCAT GGGTTTCCCC AAATACCTGG GAAAGCAATA CAAGCCCTGC AAAGATTTGT TTCTGAACCT AACAATGACT TAAATATACC TAGTAAAATT AAAATTGCAG ATAAATATGG GATGAATCTA ACTATTAAAT CTGACCTTCC TTCAGCACAA GGTTTTGGAA CCAGCAGTGC AGATATCTAC GGAACATTAT ACAACTTATA TAACTTGATA GGAGCTTCTT TTCATGATCA ATATATCTTA TCAAGTTCCT TGGCCAGGCA AGCAACTAAA ATAGAACCTA GTGATACTAA TTGTTTTCGT CAGCTGACTG CTATGAATCA CAGAACAGGG AAAGGGGCCA CCTATTTAGG AACTGTACCC AAGGGTCAAG TGGCTATTTT AAATTTTTAT GGTTCAGTTG ATACGGAGAA GTTTAATCAA CAAAAAAATC TTAGGGAATT AAATAAAATA AAAGAACCAC AGGTAGAAGC GGCTTACCGC TATTTGCTCC ATGGAATACG CATCAAAGAT TTAAAACTTA TGGCAAAAGG CAGTACTTTA GGTGCTCGTA GTCATCAAGA GGTTTTGCAC CGAGAAGAAG TAGAAAAAAT ATTGGAGCTA TATCCAAAGG CTGGAGCCCT GGGAGTTATT AGAGCGCACA GTGGCACTGC TCTGGGCCTT ATCTACCCAG AGGGCGATTT ATATAAAAAA TACTTTCGAG AATGGTATCT CAAATATCAA ATTGAGGATA CGGCCGAATT TTTAGGGTTT TATAATTTGG TTAATGGCGG GATAGATTAA
|
Protein sequence | MKSQVKVPLT CGEWVQGTID GQDFLVSCPI NAFSIVTLEL FAVDERTEVK PISGVRYGPH GFPQIPGKAI QALQRFVSEP NNDLNIPSKI KIADKYGMNL TIKSDLPSAQ GFGTSSADIY GTLYNLYNLI GASFHDQYIL SSSLARQATK IEPSDTNCFR QLTAMNHRTG KGATYLGTVP KGQVAILNFY GSVDTEKFNQ QKNLRELNKI KEPQVEAAYR YLLHGIRIKD LKLMAKGSTL GARSHQEVLH REEVEKILEL YPKAGALGVI RAHSGTALGL IYPEGDLYKK YFREWYLKYQ IEDTAEFLGF YNLVNGGID
|
| |