Gene OSTLU_1824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1824 
Symbol 
ID5006872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp124004 
End bp125386 
Gene Length1383 bp 
Protein Length461 aa 
Translation table 
GC content58% 
IMG OID640422293 
Productpredicted protein 
Protein accessionXP_001422903 
Protein GI145357392 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.0946641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0013207 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
AAGATGCCGG TGGCGAGCGG GGACATTCGC GAGATCGCGG GACAACCGGT TTTTGTGCCG 
CTGTATAAGC TGTTCTTGGC GTACGGGGAG ATGTTCGTCC TGGCGATCGG GCCGAAGAAA
TTCGTCGTCG TGAGCGACAA CGCGGTGGCC AAGGAGATGT TGCTCACGCA GGCGAAGAGC
TTCTCCAAGG GATTGCTGTC GGAGATTTTG GACTTTGTCA TGGGTCAGGG GTTAATCCCG
GCGAACGGTG AGGTGTGGAA GATTCGACGC AAGGTGATCG TGCCGAGCTT GCACAAAAAG
TACGTCACGT CCATGGTGGA CATGTTCGGC GACTGCGGGT TGAAGGGGAT GTCGCAGCTC
GCGCGCGCGG AGAAGGCGAA CGAGTCGGTG GAGATGGAGA ACTTTTACTC GCGATTCGCC
TTGGATATCA TAGGCAAGGC GGTGTTCAAT TACGATTTCG ACTCCTTGTC CACGGACGAC
CCCGTGATCA AAGCCGTGTA CACGGTTTTG CGCGAAGCCG AGTACCGGAG CGTGACGTTT
ATTCCCTATT GGAAGGTTCC CCCGCTTCGC TGGCTCGTGC CGAGGCAGCG TCAGTGCCAG
GAGGCGCTGC AAGTGGTGAA CGACACCTTG GATGACCTCA TCAACCGATG CAAAGCCGTG
GTGGAGGAAG AGGATGAGGA ATTCGTCGAG GAGTACATGA ACACGGACGA TCCGAGCATT
TTGCACTTTC TCATCGCGAG CGGCGACGAC GTGACGTCCA AGCAACTTCG CGATGATTTA
ATGACGCTCC TGATCGCCGG CCACGAAACC ACCGCCGCGG TGCTGACGTG GACGACATTT
TTGCTCGCCA AGCACCCCGA AGTGAAGGCC AAGGTATTCG AGGAGGTTGA CCGCGTCGTC
GGCGACCGCA ACCCGACGGT GGCGGATATG CGCGCGCTCG TGTACACGAC GCGCGTCATC
AACGAGTCCA TGCGACTTTA CCCGCAACCT CCGGTGTTAA TCAGGCGCGC GTTAGAGCCC
GTCACCCTCG GAGGGTACAA CATCGACGCC GGAACCGACT TCTTCATTTC GGTTTGGAAC
TTGCACAGAA ACCCGCGGAT TTGGGACGAA CCCGACGCGT TCAAGCCCGA ACGCTTCCCG
ATCGACGCCC CGATGCCGAA CGAGTACACC GAAGAGTACG CGTACTTGCC CTTCGGCGGT
GGCCAGCGCA AATGCGTGGG CGATCAGTTT GCTATTTTTG AGTCAATCGT GTCGCTCGCC
ATGCTCATGC GACGATTCGA CTTTGAACTC GACGAGTCCA AGCACCCCGA CGGCGAATGC
GGCATGACGA CGGGCGCGAC GATTCACACC ACGAACGGCT TGCACGTCAA GCTCAAGCGC
CGC
 
Protein sequence
KMPVASGDIR EIAGQPVFVP LYKLFLAYGE MFVLAIGPKK FVVVSDNAVA KEMLLTQAKS 
FSKGLLSEIL DFVMGQGLIP ANGEVWKIRR KVIVPSLHKK YVTSMVDMFG DCGLKGMSQL
ARAEKANESV EMENFYSRFA LDIIGKAVFN YDFDSLSTDD PVIKAVYTVL REAEYRSVTF
IPYWKVPPLR WLVPRQRQCQ EALQVVNDTL DDLINRCKAV VEEEDEEFVE EYMNTDDPSI
LHFLIASGDD VTSKQLRDDL MTLLIAGHET TAAVLTWTTF LLAKHPEVKA KVFEEVDRVV
GDRNPTVADM RALVYTTRVI NESMRLYPQP PVLIRRALEP VTLGGYNIDA GTDFFISVWN
LHRNPRIWDE PDAFKPERFP IDAPMPNEYT EEYAYLPFGG GQRKCVGDQF AIFESIVSLA
MLMRRFDFEL DESKHPDGEC GMTTGATIHT TNGLHVKLKR R