Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07901 |
Symbol | ispG |
ID | 5730096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 694776 |
End bp | 696014 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285154 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_001550675 |
Protein GI | 159903331 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.301169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00960397 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTTCAG TTTCTTCCCT AGAATTGAAC TCCGAATTGC TCTCTCGTCG GTACAGCACT CAGATTCACC GGCGCCCAAC AAGGACTGTA ATGGTAGGTG ACATACCTAT TGGTAGTGCG CATCCTGTTA GTGTCCAGTC AATGATTAAC GAAGATACAT TAGATATAGA AGCTTCTACT GCTGCCATTC GCAGATTGCA TGAAATTGGC TGTGAAATCG TTCGATTAAC TGTCCCATCA CTTTCTCATG CGAAGGCGGT TGGGGAAATC AAGCAGAAAC TAGAAAGAAA TTATAAACCT GTCCCTTTGG TCGCAGATGT TCACCATAAC GGTATGAAAA TAGCTCTGGA AGTTGCTAAG CATGTAGACA AAGTAAGAAT TAACCCTGGC CTTTTTGTTT TTGAAACCCC CGATCCTAAT AGAACAGAAT TCTCTGAGGA AGAGATGCAG TTAATAAAAG AAAAGATAGT TTCTAAATTT GAACCAATAG TAAATACTCT TAAAGCTCAA AATAAAGCCT TAAGAATAGG AGTTAACCAC GGTTCCTTGG CTGAAAGGAT GTTGTTTGCT TATGGCGATA CCCCTTTAGG GATGGTTGAA TCAGCAATGG AGTTTGTTCG AATTTGTGAT TCTTTAGACT TCCATAATAT TGTTATATCT ATGAAAGCTT CTAGGCCGCC AGTAATGCTT GCTGCATATA GAATGATGGC TGACAGAATG GATAAAGAAG GCTTTAATTA TCCATTACAT CTAGGAGTAA CTGAAGCAGG TGATGGGGAT TATGGAAGAA TTAAAAGTAC AGTAGGTATA GGTACTCTTT TGTCTGAGGG TATAGGTGAT ACGATCAGAG TTTCTCTCAC CGAAGCTCCT GAGAAAGAGA TACCAGTAGC ATATTCAATA TTACAAACTG TCGGTTTAAG GAAGACGATG GTTGAATATA TAAGCTGTCC TAGTTGTGGA AGAACATTGT TTAATCTTGA AGAAGTAGTA GCTAAAGTTA GAGATGCTAC TTCTCATTTA ACAGGCTTAG ATATTGCTGT TATGGGATGT ATTGTTAATG GTCCTGGAGA GATGGCAGAT GCAGACTATG GTTATGTAGG AAAGGGGAAG GGAGTTATTG CTCTTTATCG TGGTAGAGAC GAAATTCGTA AAGTTCCCGA GGAGGATGGT GTTAGTGCTT TAGTTGACTT AATAAAGCAA GATGGTAAAT GGCTTGAACC TGAAGAAGCG AAATTATGA
|
Protein sequence | MTSVSSLELN SELLSRRYST QIHRRPTRTV MVGDIPIGSA HPVSVQSMIN EDTLDIEAST AAIRRLHEIG CEIVRLTVPS LSHAKAVGEI KQKLERNYKP VPLVADVHHN GMKIALEVAK HVDKVRINPG LFVFETPDPN RTEFSEEEMQ LIKEKIVSKF EPIVNTLKAQ NKALRIGVNH GSLAERMLFA YGDTPLGMVE SAMEFVRICD SLDFHNIVIS MKASRPPVML AAYRMMADRM DKEGFNYPLH LGVTEAGDGD YGRIKSTVGI GTLLSEGIGD TIRVSLTEAP EKEIPVAYSI LQTVGLRKTM VEYISCPSCG RTLFNLEEVV AKVRDATSHL TGLDIAVMGC IVNGPGEMAD ADYGYVGKGK GVIALYRGRD EIRKVPEEDG VSALVDLIKQ DGKWLEPEEA KL
|
| |