Gene OSTLU_30717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30717 
Symbol 
ID5000764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp639526 
End bp641346 
Gene Length1821 bp 
Protein Length533 aa 
Translation table 
GC content57% 
IMG OID640416185 
Productpredicted protein 
Protein accessionXP_001416719 
Protein GI145344396 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.990452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGGACGCG CGCGTCGACG ACGACGCGGA CGATGACGAC GGCGACGATG ATCTCGACCG 
CGCGCGCGCC CGCGTCGGCG ACGCGACGAC GCGCGCGCGG CGTCGCGACG ACGACGGCGA
CGCGACGCGC GGTGGCGACC GCGGCGGCGA ACGAAAAGGC GAGCGTGAAC GCGGCGACGG
CTGAGCGCGA CGCGACGACG ACGATGGAAG CGAAGAACTG GGATCTTCCG CGCGCGCCGT
ACGAGGCCGG GAACGCACCG GCGAATGGGA CGTGGGCGCC GACGAATCAA ACGATTAAAG
ATTGGCGCTC GGTGTATCGC AACACGACGC TGAGCGTCAC GGGCGAGGGC GAACCGAAGG
CGACGATCGT CGGGGAGATC CCGAAAGAGT TGAAAGGGAC GCTGTTTCGC AATGGACCGG
GGTTGTTGGA GATTTACGGG AAGAAGCTGA ATCAATTCTT CGATGGCGAT GGGTTGGTGT
ACGCGGCGGC GTTTGAGAAT GGCGACGTGA AATTCAGGCA CAATTTTGTC GGCACAAAGG
GATTCACCGA AGAGCAAGCG GCGCAAAAGA TGATTTACAA GGGCGCTTTC GCGATTGGTA
ACCCCAAGGG TGATTCGCTT TACAACCCTT TCGATTTCGA TGTTAAGAAC GTGGCTAACA
CTGGCGTCGT CAACTGGGGC GGCGAGACGT ACGCGTTGTG GGAAGGTGGC AAACCGCACA
AGATGCGCTC GGAGGATTTG AAGACTGTCG GCGAGGTTGA TGAAGTTCTC GGCACGAAGC
TTCGTGTGCC GCAAATGGCC GCGCACTACA GAATTTGGGA AGGTGCAACC GAGGAAGAGA
AACGCCTGAT CGGCTTTAGC ATCGAAAGCC AAGCGTCTTT TCCGCAAAAC GTCAACCGCG
CCGCGTTTTA CGAGTGGGAC TCCCAAGGCA AGGCGACGGC TGTGAATAAC TTTGACGTCC
CGAACGCCAC TTTTGGATTC TTTCACGATT GCCTCGTCAC GGAGAACTGG TACTTTTTGT
TCCAAAATCC CACGAGTTTG AACCCGCAAA AGCTTCTCAC CGAGTACATG TTTGCCAAGT
GCGCGTTGGC GGAGACGATT TGCATGGTGG ATGGAAAGAG CGCAATTTTG CACTGCTTGC
CTCGCACGGA GAAGGCGCGT AAGATTGGAC AAAAGGCCAT CGAGCTCAAA CCGCAATTCT
TCTTTCACCA CATCAACGCA TTCGAGCGCG AGAACGGCGA CGTCGTCTTG GACTCCTTGC
CTTGGGTCTT TATGGACTTT TCGATGAATC TCGACTCGGT GAATCCGGCG TTTTTCAACG
GTGGTTCCAG AGTCGAGTAC ACGCGATTCG TTGTCAATCC AGTGCGTGAG ACGTGCAAGC
AGACGGTATT GAACAATCGT GTTGGGGAAT TTCCGACGAT GAACAACAAG TTTGTTGGTC
GTAAATACAC GCACGCTTAC AGCGCGGGAA CCATCGTCAA GGATGACGTC AAGTGGGGCC
CGAATCAGTG CTTGGTGAAA CATACCACGG ACCCGAACAG CGAGGCGCCG GCGCAAGAGC
AAGTTTGGTA CTATGGCGAA CGGTGTTTTG TGCAGGAACC GATCTTTGCG CCACGTCCTG
GAGCGACAAA GGAAGACGAG GGATGGATTC TGCAAGTCGT GAACGACGCC GGCGCAGAGA
CGTCGACGCT CCACATCTTC GACGCGTTGG ACATCACGAA GGGACCGGTT GCAAGCGTGC
TGTTCGAGGG CGAGCACTTG CCGCCAGGAC TCCACGGCAT GTGGGCGCAA GACGCTGTGT
ATTAGGATTT AGCAGTACAC C
 
Protein sequence
MEAKNWDLPR APYEAGNAPA NGTWAPTNQT IKDWRSVYRN TTLSVTGEGE PKATIVGEIP 
KELKGTLFRN GPGLLEIYGK KLNQFFDGDG LVYAAAFENG DVKFRHNFVG TKGFTEEQAA
QKMIYKGAFA IGNPKGDSLY NPFDFDVKNV ANTGVVNWGG ETYALWEGGK PHKMRSEDLK
TVGEVDEVLG TKLRVPQMAA HYRIWEGATE EEKRLIGFSI ESQASFPQNV NRAAFYEWDS
QGKATAVNNF DVPNATFGFF HDCLVTENWY FLFQNPTSLN PQKLLTEYMF AKCALAETIC
MVDGKSAILH CLPRTEKARK IGQKAIELKP QFFFHHINAF ERENGDVVLD SLPWVFMDFS
MNLDSVNPAF FNGGSRVEYT RFVVNPVRET CKQTVLNNRV GEFPTMNNKF VGRKYTHAYS
AGTIVKDDVK WGPNQCLVKH TTDPNSEAPA QEQVWYYGER CFVQEPIFAP RPGATKEDEG
WILQVVNDAG AETSTLHIFD ALDITKGPVA SVLFEGEHLP PGLHGMWAQD AVY