Gene OSTLU_39392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39392 
Symbol 
ID5004887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp423448 
End bp425028 
Gene Length1581 bp 
Protein Length526 aa 
Translation table 
GC content62% 
IMG OID640420308 
Productpredicted protein 
Protein accessionXP_001420686 
Protein GI145352721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase
[COG0710] 3-dehydroquinate dehydratase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase
[TIGR01093] 3-dehydroquinate dehydratase, type I 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.541352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGC GCGCGCGCGC GGCGTGCAAA CTGACCACGT CCGTCATCGC GCCGAACCTG 
GAGCTCGCGC TGCGGGACGT GGACGACGCC GTGAGGAAGG GCGCGGACGT CGTGGAGCTC
AGGGTGGATT TCTTGCGCGA CGACGCGGCG CGCGGGACGC TGGGCGACGC GATCGAAGCG
CTGATTAAAG CGTCGCCGGT GCCGGTGATC GTGACGAATC GACCGACTTG GGAGGGAGGG
CAAGATGATG GGGACGAGAG CGCGCGGTTG GAGGCGCTTT GGAGGGCGCA CGAGTGCGGG
GCGGCGTACG TGGATTGCGA GGCGCTCGCG GCGGAGCGAT TCTTCGCGGC GAAACCGGCG
ACGGCGACGC TGAGGAATGG GAAGACTAAG ATCATTTTGA GCTCGCATAA TTACGATGAG
ACGCCGAACG ATGAGATTTT GGCGGAGATT CACGCGAAGT GCGTGCGCTT GGGGGCGGAT
ATCGTCAAGA TGGCGTCGGT GTGTAACGCG GTGGAGGACG TGGCGCGATT GGAGAAGCTC
TTGCGCACGA AGGGAAGGGA GATCGAGACG ATCGTTTTGG GCATGAGCGA GCACGGACAA
GTGTCTCGAT TGCTCGCGGC GAAGTTTGGA AGCTTTTTGA CGTTCGGGGC GATTCGAAGA
GGGGAAGAGA GCGCACCGGG GCAGCCGTTG CTCGAAGAGC TGCGAGATTT GTATCGCGTG
CCGACGCAGA CGGCGGCGAC AAAGGTGATG GGCGTGATCG GGAACCCGAT CGGACACAGT
AAGTCACCGG CGTTACACAA TCCTTGTCTC GCCGCCGCGG GCGTGGATGC GTGCTACGTC
CCGTTACTCG TCAAGGATAT CAAAACCTTT CTCGCGTCGC CGCTCTTCGG GTCGAAGGAC
TTTGTGGGGT TTAGCGTGAC GATTCCGCAT AAGGAAGACG CCTTGGAGTG CTGCGCCGAG
GTCGACCCCG TGGCGAAGCA AATCGGCGCG GTGAACACGT TGGTTCGTCA ACCGGATGGA
TCGTTAAAGG GTTACAACAC CGATTATGTC GCCGCGATCG AGGCTATCGA AAACGCGATG
GAGAAGAAAA CGGGCGTCGC CGCCGCGAAG TCCTTGGCTG GGAAAACAGT CGTCGTCATC
GGCGCCGGCG GTGCGGGCAA GGCTTTGGCG TTCGGCGCTA AGTTTAAGGG CGCCAACGTC
GTCATCGCCA ATCGCAGCGT CGAGCGCGCA CAGGCGCTCG CTGACGCGTG CGGCGGCGTC
GCGGTGTCGC TCGAAGATTT AGCGAGCGGT AGCGTCGTCG GCGACGTCTT GGCCAATAGC
ACCTCGGTCG GGATGCAACC GAACGTTGAA GACACGCCGA CGCCAGCGTC TGTACTCGGA
GGGTTTTCCG TCGTCTTCGA CGCGGTGTAC ACCCCGCTCG AGACGCGGCT CTTGCGCGAA
GCCAAGGCGA GTGGGTGCGA AATCGCGAGC GGGCTGGACA TGTTCGTCGG GCAAGCGGCG
AGGCAGTTCG AGCTCTTCAC CGGGAAAGAG GCCGAGGTTG AGCTCATGCG CGACGCCGTG
TTGTCGAGCA TAAAAAGGTA A
 
Protein sequence
MRVRARAACK LTTSVIAPNL ELALRDVDDA VRKGADVVEL RVDFLRDDAA RGTLGDAIEA 
LIKASPVPVI VTNRPTWEGG QDDGDESARL EALWRAHECG AAYVDCEALA AERFFAAKPA
TATLRNGKTK IILSSHNYDE TPNDEILAEI HAKCVRLGAD IVKMASVCNA VEDVARLEKL
LRTKGREIET IVLGMSEHGQ VSRLLAAKFG SFLTFGAIRR GEESAPGQPL LEELRDLYRV
PTQTAATKVM GVIGNPIGHS KSPALHNPCL AAAGVDACYV PLLVKDIKTF LASPLFGSKD
FVGFSVTIPH KEDALECCAE VDPVAKQIGA VNTLVRQPDG SLKGYNTDYV AAIEAIENAM
EKKTGVAAAK SLAGKTVVVI GAGGAGKALA FGAKFKGANV VIANRSVERA QALADACGGV
AVSLEDLASG SVVGDVLANS TSVGMQPNVE DTPTPASVLG GFSVVFDAVY TPLETRLLRE
AKASGCEIAS GLDMFVGQAA RQFELFTGKE AEVELMRDAV LSSIKR