Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_07341 |
Symbol | ispG |
ID | 4780409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 676176 |
End bp | 677399 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640084009 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001014557 |
Protein GI | 124025441 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00679645 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTGCGA CCCTAGAAAA AAATATGGAA GGAAATAATC TCTCTCAGCG ATATAGCACC AGAATTATTC GTAGAGATAC TAGGCCAGTA ATGGTCGGTG ATATTGGCAT CGGTGGAGAT AATCCAGTGC GTGTTCAATC AATGATTAAT GAAGATACGA TGGATATCGA GGGTTCAACG GCCGCAATAA GGAGATTGCA TGAAGTTGGA TGTGAGATCG TCAGATTAAC AGTGCCAACT CTTGCAAGTG CAAAAGCTGT GGGGGAAATC AAGAAACTTC TAGCTAGCAC TTATCAACCA GTTCCTTTAG TAGCCGATGT TCATCATAAT GGAATGAAAA TAGCCTTAGA AGTAGCTAAG CATGTAGATA AAGTTCGTAT CAACCCTGGA TTATTCGTTT TTGAAAAACC TGATCCAAAT AGAACTGAAT TTACTAAAGA TGAAATTGAT GTAATTAAAG AGAAGATCAT ACAAAAATTC AAACCAATTG TTAATACTTT AAAAGAGCAA AATAAGGCAC TCAGAATAGG CGTTAACCAT GGATCTTTGT CTGAAAGAAT GTTATTTGCT TATGGAGATA CTCCATTTGG AATGGTTGAA TCAGCTATGG AATTTATTCG AATATGTCAT TCATTAGATT TTCATAATAT TGTAATTTCG ATGAAGGCTT CTCGAGCTCC CGTGATGCTT GCAGCTTATA GAATGATGGC TGACACAATG GACAAAGAGG GATTTAATTA TCCTCTGCAT TTAGGTGTAA CGGAAGCGGG AGACGGGGAT TATGGAAGAA TTAAAAGTAC GGTAGGGATA GGGACATTAT TATCCGAAGG TATTGGAGAT ACCATTAGAG TTTCTTTAAC AGAGGCGCCC GAAAAAGAAA TACCAGTTGC ATATTCAATT TTACAAGCGG TTGGTTTGAG AAAAACCATG GTTGAATATA TTAGTTGTCC TAGTTGTGGT AGAACATTAT TTAATTTAGA GGAAGTTGTA GCAAGAGTTA GAGACGCTAC TCAACATTTA ACCGGTTTGG ATATTGCTGT AATGGGTTGC ATCGTTAATG GGCCTGGAGA GATGGCAGAT GCTGATTATG GTTATGTAGG AAAAGGTGTT GGAACCATTG CTCTTTATAG GAATAGAGAT GAAATTAAGA GGGTACCTGA GGATGAAGGC GTTCAGGCAT TGGTTGATTT AATTAAAGAG GATGGTAAAT GGGTAGATCC TTAA
|
Protein sequence | MIATLEKNME GNNLSQRYST RIIRRDTRPV MVGDIGIGGD NPVRVQSMIN EDTMDIEGST AAIRRLHEVG CEIVRLTVPT LASAKAVGEI KKLLASTYQP VPLVADVHHN GMKIALEVAK HVDKVRINPG LFVFEKPDPN RTEFTKDEID VIKEKIIQKF KPIVNTLKEQ NKALRIGVNH GSLSERMLFA YGDTPFGMVE SAMEFIRICH SLDFHNIVIS MKASRAPVML AAYRMMADTM DKEGFNYPLH LGVTEAGDGD YGRIKSTVGI GTLLSEGIGD TIRVSLTEAP EKEIPVAYSI LQAVGLRKTM VEYISCPSCG RTLFNLEEVV ARVRDATQHL TGLDIAVMGC IVNGPGEMAD ADYGYVGKGV GTIALYRNRD EIKRVPEDEG VQALVDLIKE DGKWVDP
|
| |