Gene OSTLU_47300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47300 
Symbol 
ID5004998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp403083 
End bp404570 
Gene Length1488 bp 
Protein Length495 aa 
Translation table 
GC content59% 
IMG OID640420419 
Productpredicted protein 
Protein accessionXP_001420992 
Protein GI145353380 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.227145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.672503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAGATGCCGG TGGCGAGCGG GGACATTCGC GAGATCGCGG GACAACCGGT TTTTGTGCCG 
CTGTATAAGC TGTTCTTGGC GTACGGGGAG ATGTTCGTCC TGGCGATCGG GCCGAAGAAA
TTCGTCGTCG TGAGCGACAA CGCGGTGGCC AAGGAGATGT TGCTCACGCA GGCGAAGAGC
TTCTCCAAGG GATTGCTGTC GGAGATTTTG GACTTTGTCA TGGGTCAGGG GTTAATCCCG
GCGAACGGTG AGGTGTGGAA GATTCGACGC AAGGTGATCG TGCCGAGCTT GCACAAAAAG
TACGTCACGT CCATGGTGGA CATGTTCGGC GACTGCGGGT TGAAGGGGAT GTCGCAGCTC
GCGCGCGCGG AGAAGGCGAA CGAGTCGGTG GAGATGGAGA ACTTTTACTC GCGATTCGCC
TTGGATATCA TAGGCAAGGC GGTGTTCAAT TACGATTTCG ACTCCTTGTC CACGGACGAC
CCCGTGATCA AAGCCGTGTA CACGGTTTTG CGCGAAGCCG AGTACCGGAG CGTGACGTTT
ATTCCCTATT GGAAGGTTCC CCCGCTTCGC TGGCTCGTGC CGAGGCAGCG TCAGTGCCAG
GAGGCGCTGC AAGTGGTGAA CGACACCTTG GATGACCTCA TCAACCGATG CAAAGCCGTG
GTGGAGGAAG AGGATGAGGA ATTCGTCGAG GAGTACATGA ACACGGACGA TCCGAGCATT
TTGCACTTTC TCATCGCGAG CGGCGACGAC GTGACGTCCA AGCAACTTCG CGATGATTTA
ATGACGCTCC TGATCGCCGG CCACGAAACC ACCGCCGCGG TGCTGACGTG GACGACATTT
TTGCTCGCCA AGCACCCCGA AGTGAAGGCC AAGGTATTCG AGGAGGTTGA CCGCGTCGTC
GGCGACCGCA ACCCGACGGT GGCGGATATG CGCGCGCTCG TGTACACGAC GCGCGTCATC
AACGAGTCCA TGCGACTTTA CCCGCAACCT CCGGTGTTAA TCAGGCGCGC GTTAGAGCCC
GTCACCCTCG GAGGGTACAA CATCGACGCC GGAACCGACT TCTTCATTTC GGTTTGGAAC
TTGCACAGAA ACCCGCGGAT TTGGGACGAA CCCGACGCGT TCAAGCCCGA ACGCTTCCCG
ATCGACGCCC CGATGCCGAA CGAGTACACC GAAGAGTACG CGTACTTGCC CTTCGGCGGT
GGCCAGCGCA AATGCGTGGG CGATCAGTTT GCTATTTTTG AGTCAATCGT GTCGCTCGCC
ATGCTCATGC GACGATTCGA CTTTGAACTC GACGAGTCCA AGCACCCCGA CGGCGAATGC
GGCATGACGA CGGGCGCGAC GATTCACACC ACGAACGGCT TGCACGTCAA GCTCAAGCGC
CGCGATGGGC GAGGTGGGCG AGAGATGGAC GGCACGTACG TCACCGGTAT GGCGTTGAGC
AACCTGGAAG ACGTCGACGT CGTTCGAGGG TCCATCGACG CCCCGACG
 
Protein sequence
MPVASGDIRE IAGQPVFVPL YKLFLAYGEM FVLAIGPKKF VVVSDNAVAK EMLLTQAKSF 
SKGLLSEILD FVMGQGLIPA NGEVWKIRRK VIVPSLHKKY VTSMVDMFGD CGLKGMSQLA
RAEKANESVE MENFYSRFAL DIIGKAVFNY DFDSLSTDDP VIKAVYTVLR EAEYRSVTFI
PYWKVPPLRW LVPRQRQCQE ALQVVNDTLD DLINRCKAVV EEEDEEFVEE YMNTDDPSIL
HFLIASGDDV TSKQLRDDLM TLLIAGHETT AAVLTWTTFL LAKHPEVKAK VFEEVDRVVG
DRNPTVADMR ALVYTTRVIN ESMRLYPQPP VLIRRALEPV TLGGYNIDAG TDFFISVWNL
HRNPRIWDEP DAFKPERFPI DAPMPNEYTE EYAYLPFGGG QRKCVGDQFA IFESIVSLAM
LMRRFDFELD ESKHPDGECG MTTGATIHTT NGLHVKLKRR DGRGGREMDG TYVTGMALSN
LEDVDVVRGS IDAPT