Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1437 |
Symbol | |
ID | 6314860 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1503317 |
End bp | 1504387 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642643817 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001917608 |
Protein GI | 188586063 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.198015 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0526767 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAGAC GTCATAGAAC TTTTCCTGTA AAAGTTGGAT CTGTCACTAT TGGAGGAACT GCTCCGGTTA CCATCCAATC AATGACCAAT ACAGATACCA AAGATACTGA AAAAACTTTA GCCCAAATCA AGGATTTGGT TGAAGCTGGT TGCCAATTAG TTAGAGTGGC GATTCCTGAT GAAGAATCTG TTAACAGTTT TAAAACTTTA ACACAGTTTT CACCAGTCCC TTTGATTGCG GACATCCACT TTAGTTATCA ATTAGCAATC AAAGCAATAG AGGCAGGTGC CAGCAAAATT AGAATAAACC CCGGTAATAT TGGGTCACGT CAAAGAGTGG CAAAGGTTGT CGAAAAAGCC AAGACTCACA ATGTGCCAAT TCGAGTGGGC ATTAATTCAG GATCAGTTGA AAAAAACCTC CTCCAAAAAT ATGGTGGACC TACTCCAAGT GCATTGGTGG AAAGCGCTGT CAATAATGTC ATGATGTTAT CTGAGATGGG TTTTGGCGAT GTAGTAGTAT CCATTAAAGC TTCGGATGTC AATACTACAG TTAAAGCTAA CCAAGAGTTT GCCACAAGAT TACCTAATCC TTTGCATTTA GGCATAACAG AGGCAGGGAC TATTAAACAA GGTACTATTA AGAGTTCTGT AGGAATTGGA ACTTTACTAT CTCATGGAAT TGGAGATACA CTGCGAGTTT CATTATCGGG TTCTCCTATA GAAGAAGTTT CTGTTGCGCG CGGAATATTA TCTTCTTTGA ATTTAGCTGA AGGGCCTCGG ATTGTATCAT GCCCTACTTG TGCCAGATCA AATATATCTG TCGAGGACTT GGCCAGTACC GTAGAAGACA GGCTAAAAGA TCTCAACACT TCTCTCACTG TTGCTGTTAT GGGTTGTGAA GTCAATGGTC CAGGTGAAGC CAAGGAAGCT GATATAGGAA TTGCTGGCAG TAAAGAATAT GGGGTACTGT TCAAAAAGGG AAAGATTATA GATAGAGTAC CTAAAAATCA ATTACTTGAA GTCCTGTCTC GAGCCATCGA TGAATATATT GATGAGATTC ACAGCAAGTA A
|
Protein sequence | MSRRHRTFPV KVGSVTIGGT APVTIQSMTN TDTKDTEKTL AQIKDLVEAG CQLVRVAIPD EESVNSFKTL TQFSPVPLIA DIHFSYQLAI KAIEAGASKI RINPGNIGSR QRVAKVVEKA KTHNVPIRVG INSGSVEKNL LQKYGGPTPS ALVESAVNNV MMLSEMGFGD VVVSIKASDV NTTVKANQEF ATRLPNPLHL GITEAGTIKQ GTIKSSVGIG TLLSHGIGDT LRVSLSGSPI EEVSVARGIL SSLNLAEGPR IVSCPTCARS NISVEDLAST VEDRLKDLNT SLTVAVMGCE VNGPGEAKEA DIGIAGSKEY GVLFKKGKII DRVPKNQLLE VLSRAIDEYI DEIHSK
|
| |