Gene NATL1_07341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_07341 
SymbolispG 
ID4780409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp676176 
End bp677399 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content38% 
IMG OID640084009 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001014557 
Protein GI124025441 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00679645 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGCGA CCCTAGAAAA AAATATGGAA GGAAATAATC TCTCTCAGCG ATATAGCACC 
AGAATTATTC GTAGAGATAC TAGGCCAGTA ATGGTCGGTG ATATTGGCAT CGGTGGAGAT
AATCCAGTGC GTGTTCAATC AATGATTAAT GAAGATACGA TGGATATCGA GGGTTCAACG
GCCGCAATAA GGAGATTGCA TGAAGTTGGA TGTGAGATCG TCAGATTAAC AGTGCCAACT
CTTGCAAGTG CAAAAGCTGT GGGGGAAATC AAGAAACTTC TAGCTAGCAC TTATCAACCA
GTTCCTTTAG TAGCCGATGT TCATCATAAT GGAATGAAAA TAGCCTTAGA AGTAGCTAAG
CATGTAGATA AAGTTCGTAT CAACCCTGGA TTATTCGTTT TTGAAAAACC TGATCCAAAT
AGAACTGAAT TTACTAAAGA TGAAATTGAT GTAATTAAAG AGAAGATCAT ACAAAAATTC
AAACCAATTG TTAATACTTT AAAAGAGCAA AATAAGGCAC TCAGAATAGG CGTTAACCAT
GGATCTTTGT CTGAAAGAAT GTTATTTGCT TATGGAGATA CTCCATTTGG AATGGTTGAA
TCAGCTATGG AATTTATTCG AATATGTCAT TCATTAGATT TTCATAATAT TGTAATTTCG
ATGAAGGCTT CTCGAGCTCC CGTGATGCTT GCAGCTTATA GAATGATGGC TGACACAATG
GACAAAGAGG GATTTAATTA TCCTCTGCAT TTAGGTGTAA CGGAAGCGGG AGACGGGGAT
TATGGAAGAA TTAAAAGTAC GGTAGGGATA GGGACATTAT TATCCGAAGG TATTGGAGAT
ACCATTAGAG TTTCTTTAAC AGAGGCGCCC GAAAAAGAAA TACCAGTTGC ATATTCAATT
TTACAAGCGG TTGGTTTGAG AAAAACCATG GTTGAATATA TTAGTTGTCC TAGTTGTGGT
AGAACATTAT TTAATTTAGA GGAAGTTGTA GCAAGAGTTA GAGACGCTAC TCAACATTTA
ACCGGTTTGG ATATTGCTGT AATGGGTTGC ATCGTTAATG GGCCTGGAGA GATGGCAGAT
GCTGATTATG GTTATGTAGG AAAAGGTGTT GGAACCATTG CTCTTTATAG GAATAGAGAT
GAAATTAAGA GGGTACCTGA GGATGAAGGC GTTCAGGCAT TGGTTGATTT AATTAAAGAG
GATGGTAAAT GGGTAGATCC TTAA
 
Protein sequence
MIATLEKNME GNNLSQRYST RIIRRDTRPV MVGDIGIGGD NPVRVQSMIN EDTMDIEGST 
AAIRRLHEVG CEIVRLTVPT LASAKAVGEI KKLLASTYQP VPLVADVHHN GMKIALEVAK
HVDKVRINPG LFVFEKPDPN RTEFTKDEID VIKEKIIQKF KPIVNTLKEQ NKALRIGVNH
GSLSERMLFA YGDTPFGMVE SAMEFIRICH SLDFHNIVIS MKASRAPVML AAYRMMADTM
DKEGFNYPLH LGVTEAGDGD YGRIKSTVGI GTLLSEGIGD TIRVSLTEAP EKEIPVAYSI
LQAVGLRKTM VEYISCPSCG RTLFNLEEVV ARVRDATQHL TGLDIAVMGC IVNGPGEMAD
ADYGYVGKGV GTIALYRNRD EIKRVPEDEG VQALVDLIKE DGKWVDP