Gene OSTLU_33533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33533 
Symbol 
ID5003573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp369803 
End bp371437 
Gene Length1635 bp 
Protein Length544 aa 
Translation table 
GC content58% 
IMG OID640418994 
Productpredicted protein 
Protein accessionXP_001419574 
Protein GI145350354 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.175528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC CGGATTGGAT GACCCAGTTG AACCGACTGT GGGGTGGGGC GTCGGAGATT 
CCCGTGGCGG ACGCGAAACT GGAGGATATC ACGGGATTGC TCGGAGGCGG GTTGTTTCAG
CCGCTGTTCA AGTGGATGCG AGAGAGCGGG CCGGTGTATT TGCTTCCCAC GGGACCCATC
ACGTCGTACG TCGTCGTGAG CGATCCCGAC TGCATTAAAC AGGTGTTATT CAACTACGGA
AGCCGGTACA TCAAGGGCAC AATCGCGGAG GCGGGTGAAT TTTTGTTTGG ATTGGGCGTC
GCGTTGCAAG AGTTGGAGCC GTGGAAGATT CGACGGAAGG CGGTGGCGCC CTCTTTGCAT
CGCAAATACG TCGAAGCGAT GGTGGATAGA TGCTTTGGTC CGTGCGCCGA TCGGATGGTG
AGTATTTTGG AGGGCGAAGC TGGCGCTGGC GGCGTCGGTG GCGTGAACAT GGAGAGTCGA
TTTAGCAAGA CGGCGCTCGA CATCATCGGC ATCTCCGTGT TCAATTACGA CTTTGAGGCG
CTCACGACGG CGGCGCCGGT GATTCAAGCC ACGTATACTG CGCTCAAGGA GGTCGAGACA
CGATCGATGG ATCTTTTGCC CACGTGGCGC TTGCCTGAGA AGTTCTTGCG CGTCGTTAGC
CCGCGCCAAC GCGACGCGCA AGACGCCGTC ACCGTCATTC GCGACGTCAC CCAGCGACTC
GTGGACGACT GCAAGCGCAT GGTCGAGGAG GAAGAAAAGG TCGGTGGCGC TGAAGAGTGG
GCTCGCGATT ACTTGAACGA GTCAAATCCT TCGGTGTTGC GCTATCTCAT CGCTGCGCGC
GAAGAAGTGT CTTCGACGCA GCTCAGAGAC GATTTGCTCT CGCTTCTCGT CGCCGGTCAC
GAAACCACCG CATCGGTGCT CACGTGGGGC ACGTACGAGC TTCTCAAGCC TGAAAACGCG
GAGCAGCTTC GCTTACTTCG AGCCGAGCTC GATGAAGTGC TAGGCACGCG TCCGTTCCCG
ACGTTTGCGG ATTTACCAAA GATGCCCTAC CTCGAGCGAT GCTTTCACGA GTCCATGCGG
CTTTATCCGC AACCTCCCGT GTACACGCGA CGCGCCGTCG TGGAGGATGT CTTGCCCAAC
GGCATGACGA TACCAAAGAA CCAAGATTTA TTGGTATCGA TTTACAACCT CCACCGTTCG
CCCACGAGTT GGGGGCCGAC ATCGCAAGAG TTCGAGCCCA TGCGCTTCGG ACCGCTGGCG
AACGGGCAAC CGAACGAGTT AAACACGGAC TACCGCTACG TGCCGTTTAG CGCCGGACCG
AGGCGATGCC CCGGGGATAA GTTTGCCGTC TACGAGGGCA TCGTCATTTG GGCCACGATG
TTCAGGCGAT TAGACTTGGA GTTAAAGGCT GGTCACGACG TTGTCATGAC GTCTGGGGCG
ACGATTCACA CGAAGAGCGG TTTGTTAGCC ACCGTCAAAG CGCGCGCCAT GCGAGAAGTC
GCCGAGGCGG ACCGTGTCGA CTGGGCTAAC TTGAAGCCTG CGAAGGATAT CGGCGAGGAG
TGGATGGAAA AGGCACTCTT CAATAGCGAA GCGACCGGTG CGGTGAGCGC GGGTAAATGC
CCCATGGGTC ATTAA
 
Protein sequence
MAAPDWMTQL NRLWGGASEI PVADAKLEDI TGLLGGGLFQ PLFKWMRESG PVYLLPTGPI 
TSYVVVSDPD CIKQVLFNYG SRYIKGTIAE AGEFLFGLGV ALQELEPWKI RRKAVAPSLH
RKYVEAMVDR CFGPCADRMV SILEGEAGAG GVGGVNMESR FSKTALDIIG ISVFNYDFEA
LTTAAPVIQA TYTALKEVET RSMDLLPTWR LPEKFLRVVS PRQRDAQDAV TVIRDVTQRL
VDDCKRMVEE EEKVGGAEEW ARDYLNESNP SVLRYLIAAR EEVSSTQLRD DLLSLLVAGH
ETTASVLTWG TYELLKPENA EQLRLLRAEL DEVLGTRPFP TFADLPKMPY LERCFHESMR
LYPQPPVYTR RAVVEDVLPN GMTIPKNQDL LVSIYNLHRS PTSWGPTSQE FEPMRFGPLA
NGQPNELNTD YRYVPFSAGP RRCPGDKFAV YEGIVIWATM FRRLDLELKA GHDVVMTSGA
TIHTKSGLLA TVKARAMREV AEADRVDWAN LKPAKDIGEE WMEKALFNSE ATGAVSAGKC
PMGH