Gene OSTLU_43938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43938 
Symbol 
ID5004365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp282429 
End bp283844 
Gene Length1416 bp 
Protein Length471 aa 
Translation table 
GC content55% 
IMG OID640419786 
Productpredicted protein 
Protein accessionXP_001420297 
Protein GI145351898 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.270818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGG TTATTTTCAT CTTCTTGCTC GATCAGCTCT CGCACGGACC GCTGAAGGGA 
CGAAAGTCGC CGCCGGTGAT CGACGTGGCG CCGGTGTGGG GCGGCATGTT GGCCTTTTTG
GCGGGACCGA TGAAACTCAT GCGCGAGGCG ACGCCGAAGT ACGGTGAGGT GTTTACCGTG
CCGGTGTTTC ACAAGCGGAT CACGTTTCTG ATCGGGCCCA AGGTGAGCGA GTTTTTTTTC
AAGGCGAAGG ATACGGAGAT GTCGCAAAAG GAGGTGTACG AGTTCAACGT GCCGACGTTC
GGTAAGGGCG TGGTGTTCGA TGTAGATCAC ACGACTCGTG CGGAACAGTT TAGATTTTTC
GCGGATAGTC TCAAGAGTAA CCGATTGAGG ATGTACGTGG GGATGATGGT GAAGGAGGCG
GAGGATTTCT TCAGCAAGTG GGGAGACGCA GGCGAGGTGG ATTTGCTCGA GCAACTCTCG
GAGTTGATCG TACTCACGGC TTCCAGATGC TTGCTCGGAA GAGAGATTCG CGAGACGCTC
TACTCTGAAG TTACCGATCT GGTGCACGAT TTGGATAAGG GTATGGTGCC GTTGTCGGTA
TTTTTCCCGT ACGCGCCGAT CGAGGCGCAC CGCAAGCGAG ACGCGGCGCG CAAAAACTTG
GCCAAGATTT TCGACAAAGT TATCCAAGCT CGTCGCGAGA GCGGCGCGAG TGAACCGGAT
GTCTTACAAA CGTTCATCGA CGCCCGGTAC AAGGATGGTA GCAGGCTCAC GAACGACCAA
GTCCTCGGTA TGTTGATTGC CGTGCTCTTC GCCGGTCAAC ACACGTCCTC GATCACGTCC
ACGTGGACTG GTTTGCTGGC CATCGCGAAC AAGGAGCGCG TGATGCCTGC GCTCGAAAAG
GAGCAAAAGG ATATCATGAA GAAGCACGGC AAGGATTTGG ATTTCGACAT CTTAGCGAAA
ATGGATGAGT TGCATTTTGC TGTGAAGGAG GCGCTTCGAA TGCACCCGCC TCTCATCATG
CTCCTTCGCA TGGCGCAAGT GCCGTTCGAG GTCGAAACCT CTACGGGTAA GAAGTACACC
GTCCCCAAGG GCCACATCGT CGCCACCTCT CCCGCGTTCT CGCACCGCTT GGACAATGTC
TACAGCGACC CGAACGAGTA CAAGCCTGAA CGATTCCGCG AACCGAACCC CGAAGACAAG
GCCCAGTTCG CCTCCTTCAT CGGTTTCGGC GGCGGACGTC ACGGTTGCAT GGGGGAAACC
TTTGCGTACA TGCAAATCAA AACCATTTGG TCCATCCTTT TGCGAAACTT TGAGTTCGAA
ATGGTTGGAA AAGTTCCCGA ACCCGATTAC ACCGGCATGG TCGTCGGTCC CACCGCGGGC
CAATGCAAAA TCCGCTACAA GCGCCGCGTT CTGTGA
 
Protein sequence
MTVVIFIFLL DQLSHGPLKG RKSPPVIDVA PVWGGMLAFL AGPMKLMREA TPKYGEVFTV 
PVFHKRITFL IGPKVSEFFF KAKDTEMSQK EVYEFNVPTF GKGVVFDVDH TTRAEQFRFF
ADSLKSNRLR MYVGMMVKEA EDFFSKWGDA GEVDLLEQLS ELIVLTASRC LLGREIRETL
YSEVTDLVHD LDKGMVPLSV FFPYAPIEAH RKRDAARKNL AKIFDKVIQA RRESGASEPD
VLQTFIDARY KDGSRLTNDQ VLGMLIAVLF AGQHTSSITS TWTGLLAIAN KERVMPALEK
EQKDIMKKHG KDLDFDILAK MDELHFAVKE ALRMHPPLIM LLRMAQVPFE VETSTGKKYT
VPKGHIVATS PAFSHRLDNV YSDPNEYKPE RFREPNPEDK AQFASFIGFG GGRHGCMGET
FAYMQIKTIW SILLRNFEFE MVGKVPEPDY TGMVVGPTAG QCKIRYKRRV L