Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2377 |
Symbol | ispG |
ID | 3784968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2703718 |
End bp | 2704968 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637812466 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_413058 |
Protein GI | 82703492 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTATGG TTCAAAGTGC TTTTCCCCCA CGCCGGAACA GCGTGGGCGT TCAGGTAGGT TCGATCCGGA TCGGCGGGGG CGCGCCCATC GTAGTGCAGT CCATGACCAA TACCGATACG GAAGATGAAA TCGCTACCAC CGTGCAAGTG GCCCAGCTTG CGCGTGCCGG ATCCGAACTC GTGCGCATCA CCGTCAATAC GGCCGAAGCA GCCAGGGCGG TGCCGGGTAT CAGGGCGCGA CTCGACGATA TGGGTTGCCA AGTTCCCCTG ATAGGCGATT TTCACTTCAA TGGCCATAAA CTCGTGACTG AGTATCCCGG TTGCGCCCGT GCGCTGGCGA AATACCGTAT CAATCCCGGT AATGTCGGGC ACGGAAAGAA ACGTGACGAA CAGTTTTCCA TTCTGATCGA AGCGGCCTGC AAGTATGAAA AACCGGTGCG CATCGGGGTC AACTGGGGAA GCCTCGATCC AGAGCTGCTG GCGCGCATGA TGGACGAGAA TGCCCGCTCC GGGGACCCGA GGGATGCCTC CGCGGTAATG TACGAAGCCT TGATTACCTC TGCGCTTCAA AGCGCTGAGC GTGCGGAGGA GATCGGGCTG GGGCGTGACA GAATCATATT GTCATGCAAG ATGAGCGGCG TGAGAGACCT CATTACCGTT TATCGCGCCC TTGCGGCCCG CTGCGATTAT GCGCTGCACC TGGGGCTCAC CGAGGCGGGC ATGGGTTCGA AGGGGATTGT TGCTTCCACG GCGGCATTGT CGGTACTGCT TCTCGAAGGT ATCGGCGATA CGATACGGAT ATCGTTGACG CCCGAACCAG GCGGAGACCG CGCGCGCGAA GTGGTCGTGG CCCAGGAGAT ACTGCAAACC ACCGGTTTGC GCGCTTTTGT GCCTCTGGTT GCCGCCTGTC CCGGCTGCGG CCGTACCACC AGCACCTATT TTCAGGAGCT GGCGGAAAGC ATCCAGGGCT ACGTGCGCGA GCAGATGCTG GTATGGCGCG AGGAATACGA AGGTGTGGAA AATATGACCC TCGCTGTGAT GGGGTGCGTG GTCAATGGTC CCGGCGAAAG CAAGCATGCC AATATCGGCA TCAGCCTGCC GGGCTCGGGG GAACGGCCTG TGGCGCCGGT ATTTGTGGAT GGCCAGAAGG CTGTAACGCT GAAGGGCGAT AATATTGCAG GAGAGTTTCG CCAGATAGTC GATGAATATG TGCAGATGAA ATACCCCAAG AAAGCAGTCG ATGCCCACTA G
|
Protein sequence | MSMVQSAFPP RRNSVGVQVG SIRIGGGAPI VVQSMTNTDT EDEIATTVQV AQLARAGSEL VRITVNTAEA ARAVPGIRAR LDDMGCQVPL IGDFHFNGHK LVTEYPGCAR ALAKYRINPG NVGHGKKRDE QFSILIEAAC KYEKPVRIGV NWGSLDPELL ARMMDENARS GDPRDASAVM YEALITSALQ SAERAEEIGL GRDRIILSCK MSGVRDLITV YRALAARCDY ALHLGLTEAG MGSKGIVAST AALSVLLLEG IGDTIRISLT PEPGGDRARE VVVAQEILQT TGLRAFVPLV AACPGCGRTT STYFQELAES IQGYVREQML VWREEYEGVE NMTLAVMGCV VNGPGESKHA NIGISLPGSG ERPVAPVFVD GQKAVTLKGD NIAGEFRQIV DEYVQMKYPK KAVDAH
|
| |