Gene OSTLU_3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3050 
Symbol 
ID5003496 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp664241 
End bp665770 
Gene Length1530 bp 
Protein Length467 aa 
Translation table 
GC content57% 
IMG OID640418917 
Productpredicted protein 
Protein accessionXP_001419451 
Protein GI145350080 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.601284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACTTCGCGC CGGTGGAGGG CGAGCTCGAG ACGCCCGTCG CGTGCGTCGT CGCGCGCGGG 
CGGTTGCCGG ACGATTTGGA CGGGTTGTAC CTGCGGAACG GCCCGAACGC GCGATTTCGA
CCGGCGCTCG GGACGAATCG GTATCACTGG TTCGACGGCG ACGGCATGGT GCACGCGATT
CGTTTGCGCG GCGAGGGCGA ACGAGCGGAG TATACAAGGC GGTACGTGCG CACGCGCGGT
TTCGAGCGAG AAGAAAAGGC GAACGCGGCG CTGTACACGG GGCTCCGGGA TATAAATCCG
ATTTGGCGGT ACTTGCTGCC GAGGTTGTTG GAAAAGATGA CCTTGGACGT TCGGCAGCCG
GATTCGGCGT TCTTCGTCAT TCAGTCCAAG AATACGAGTA GTAACGGATT GACGCATCAT
GCGGGGCGAT TGTTAGCGAC GTACGAGAGC GGTTCGCCGT ACGAGATCGC GTTAGAGCCG
ACGCTGCGCA CGAAAGGGCT GTGCGATTTC AATCAGACGT TTGGCACGAT GGATTATTGG
CTGGACAATT TCACCGCGCA TTCGAAGACG TGCCCGATGA CGGACGAGTT AATTTACATC
GGGTACAATC TCGTGGCGCT GAGCGGCGAG CAGGATGGGC AGACGACGAT CACGGTTGGC
GTGATCGACG GCGAGACGGG GAAACGCACG CACCGGCGGC AATTTAAAGT GCCTCGACCC
TCGATGCAAC ACGACGTCGC CATCACGCCG ACAAAGACGG TGCTGATCGA TGGGCCGTTG
ATCTTCAACT TGCCGCGCGT CATCGAAGGC GGACTGCCGT TTAGCTTTGA AAGAGAATGC
ACGTTGCGTA TCGGATATCT CCCACGAAGA GGTGAGGAAG GGCCGTTTTG GATTGACACT
GGCGAGACGT GCTTTGCGTA TCACGTCGTG AACGCGTACG AAGAAGGAAA TATTCTGACG
TTGGATGTGT GCAAAGCCGA CGAAACGAAC GCGTTGGGGA TGTGCCAAGA GTCGAACGTG
CCGCGTTCAA CGCCGGCGAA GAATCCAGTG AACGCCGGTC GCGACGTTGC GGCGTTGTGG
AGATGGCAAA TCGACACCGA CGCTAACGCG ATAATATCGA GCAAGCGCCT ATGCGAACAG
ACTTCCGACT TTCCGTGTAT TAACCGCAAG TACACTGGCT TAAAGTACCG CTTCGCGTAC
TCGGTGGCGT ACAAATTGGG CACCGAACCA AAGTCGCGCA TGGACATTCC TCTGTTCGAC
GCCGTACTCA AGCACGACCT ACAGTCAGGA GTGACGACTC GATACGAATT AGGTGAAGGT
GTCACGTGTG GTGACATTAT TTTCGTTCCT TCGAAAGATG CCGCGCGCGA AGACGACGGT
TATTTGCTCG TGTTGACACA CCTCGACGTC GACGGTGAAG AGCCTCGAGC AGAGTTATTA
ATTTTAGACG CCTCGGGCGA CGAACTCACG ACGCAGTGCG TCGTGCATAT TCCAATGCGC
GTACCGTACG GATTTCATTG CGAATATGTA
 
Protein sequence
NFAPVEGELE TPVACVVARG RLPDDLDGLY LRNGPNARFR PALGTNRYHW FDGDGMVHAI 
RLRGEGERAE YTRRYVRTRG FEREEKANAA LYTGLRDINP IWRYLLPSSN GLTHHAGRLL
ATYESGSPYE IALEPTLRTK GLCDFNQTFG TMDYWLDNFT AHSKTCPMTD ELIYIGYNLV
ALSGEQDGQT TITVGVIDGE TGKRTHRRQF KVPRPSMQHD VAITPTKTVL IDGPLIFNLP
RVIEGGLPFS FERECTLRIG YLPRRGEEGP FWIDTGETCF AYHVVNAYEE GNILTLDVCK
ADETNALGMC QESNNPVNAG RDVAALWRWQ IDTDANAIIS SKRLCEQTSD FPCINRKYTG
LKYRFAYSSR MDIPLFDAVL KHDLQSGVTT RYELGEGVTC GDIIFVPSKD AAREDDGYLL
VLTHLDVDGE EPRAELLILD ASGDELTTQC VVHIPMRVPY GFHCEYV