Gene OSTLU_40076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40076 
Symbol 
ID4999383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1106926 
End bp1107939 
Gene Length1014 bp 
Protein Length337 aa 
Translation table 
GC content59% 
IMG OID640414804 
Productpredicted protein 
Protein accessionXP_001416028 
Protein GI145341875 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0447] Dihydroxynaphthoic acid synthase 
TIGRFAM ID[TIGR01929] naphthoate synthase (dihydroxynaphthoic acid synthetase) 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACG CGAGACGTCG CGTGGCGCAA ATTGCGAATC ACGTCACGGC TGCCGACGAC 
GGCTTCTCGC GACAAACCTA CGCCCGCGCC GACGCCGCGC GCGTGAGCTC GTACGAGCGC
GTGCACGGTG ACGTCAGTCG TGATCCCGTT TCGTGGGTGA AGTGCGCGCC GGGCGCGGAT
GAACGCTCGG TGGCGTCGCA TTACGAATTG CGCGACGTGA TTTATGAAAA GTCTCCAGAG
GGCATCGCGC GGGTGACCAT AAATCGACCC GAACGTAGAA ACGCGTTCAC GCCGCGAACG
GTGAAGGAGA TGCGATGGTG CATGGACGAC GCGAGAGATG ATATGACGAT CGGGGTCGTG
GTGATGCGCG GGATGGGAGA TCTGGCGTTT TGTAGCGGCG GCGATCAGAG CGCGAGGGGC
GACGGCGGAT ACGTCGACGC CAAGGCGGGA GGAGCGGAGG AGACGCCGAG ATTGAATGTG
TTGGACTTAC AGATGCAGAT ACGAAGGATG CCGAAACCCG TGATCGCGAG CGTGGCGGGG
TACGCGGTCG GAGGAGGACA CATTCTGCAC ATGGTGTGCG ATCTGACCAT CGCCGCGGAT
AACGCCGTGT TCGGCCAGAC GGGGCCAAAG GTGGGATCGT TCGACGCCGG TTACGGAAGT
ACGCACATGG CGCGGTTGAT AGGTCAAAAG AAGGCGAGAG AGATGTGGTT CTTAGCGCGT
TTATACAACG CGAGCGATGC GTTGAAGATG GGATTGGTGA ACACGGTGGT ACCTTTAGCC
GAACTCGAGA CGGAGACGGC GGTGTGGTGT CGAGAGATTT TGCGCAATTC GCCGACGGCA
ATTCGACTGT GTAAAAATGC ATTGAATGCG GCCGAGGACG GGCAAGCGGG CATTCAAGAT
CTCGGTGGAA GCGCAACGCT GCTATTTTAT CAATCAGAAG AGGGTAACGA AGGTCGACGA
GCGTTCTTAG AAGGGCGCAA GCCAGACTTT TCCAAGTTTA AACGATTTCC GTAG
 
Protein sequence
MDDARRRVAQ IANHVTAADD GFSRQTYARA DAARVSSYER VHGDVSRDPV SWVKCAPGAD 
ERSVASHYEL RDVIYEKSPE GIARVTINRP ERRNAFTPRT VKEMRWCMDD ARDDMTIGVV
VMRGMGDLAF CSGGDQSARG DGGYVDAKAG GAEETPRLNV LDLQMQIRRM PKPVIASVAG
YAVGGGHILH MVCDLTIAAD NAVFGQTGPK VGSFDAGYGS THMARLIGQK KAREMWFLAR
LYNASDALKM GLVNTVVPLA ELETETAVWC REILRNSPTA IRLCKNALNA AEDGQAGIQD
LGGSATLLFY QSEEGNEGRR AFLEGRKPDF SKFKRFP