Gene OSTLU_37938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37938 
Symbol 
ID5004252 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp156286 
End bp159105 
Gene Length2820 bp 
Protein Length866 aa 
Translation table 
GC content59% 
IMG OID640419673 
Productpredicted protein 
Protein accessionXP_001420098 
Protein GI145351467 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases
[COG2041] Sulfite oxidase and related enzymes
[COG5274] Cytochrome b involved in lipid metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.590439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.879833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACG TCGACGTGCA GCGCGCGCTG CTGGCGTGCG ATGGACATCG CGGCGCGCTG 
ACGAGGGCCA AAGCGGAGGG CGCGCGCGCG GACGGACGCG ACCTGAACAC GCCCGATCAC
TGGGTGGCGA GACACCCGAG CCTGGTGCGC CTGACGGGGA CGCACCCGTT CAACTGCGAG
CCTCCGCTGC GAGCGCTGAT GGAGGCGGGG AGCGTGACGC CGGCGGAGCT GCACTTCGTC
AGGAATCACG GGGCGGCGCA GAACATCGCG TGGGACGCGC ACGAGGTGAG GGTGGAGGTG
AGCGACTCGG GCAAGGGGAG GACGTACGGG ATGGACGCGT TGTTGCGCAA GCCGGCGATG
ACGATGCCGT GCTTGCTCGT GTGCGCGGGA AATCGGAGAA AGGAGCAGAA TATGTATGAA
AAGACGATTG GATTCGGATG GGGCGCGGCT GGATTGTCGA ACTCGCTCTG GACCGGGGTG
CCGCTGCGGA TTTTGCTCGC CGAGTGCGGG GTGACCGAGG TGACGGCGAA GCGACGCTTT
GTGTGCTTTG AGGGACCGAA GGGGGAGCTG CCGAAGGGTA AGGATGGAAC GTACGGAACG
AGCATTCCCT TGAGCAAGGC GCTCGACCCG GCGCAAGATG TCATGGTGTG TTACGCGCAA
AACGGTGAGG CACTTCGACC CGATCACGGG TTTCCCGTGC GCTTGATCAT CCCCGGCTAC
ATCGGCGGCA GAATGATCAA GTACCTCACG ACGATTCGCG TCACGGAGCA TCCGAGCGAC
AACTTTTATC ATTTCCGCGA TAACCGCATC ATGCCTCCAG GCGTGGACGC GGCGATGGCG
GATAAGCAAA ACTGGTGGGA GAAGCCGGAG TACATTTTCA ACGAGCTCAA CATCAACAGC
GCCATCGCGT CGCCCGCGCA CGACGAGTGC GTGCCGCTCG ACCGCGAGCG GTACGAAGTG
AAAGGGTACG CGTACACGGG CGGAGGACGC GCGATCACGC GCGTTGAGGT GAGCCTTGAC
GGTGGTTACA CCTGGAGATT GGCCAACATT CGTCGTCCAT TCGCACCGAC GATGTACGGC
AAGCACTGGT GCTGGATCTT TTGGGATTTG GACGTCGCCA CCGTCGAACT CGCGAGCGCC
AAGGAGGTGA TGTGTCGGGC GTGGGACGAA GCGAACAACA CGCAGCCGCG AGATTTTACG
TGGAATCTCA TGGGTATGGG CAATAATTGC TACTTCCGCG TGAAGCTGTC GGTCGCGGCG
CTGAGCGATA GAGCACAAAA GTACATTCGA TGCGAACATC CGACGGAGCC GGGCGCGCTC
AAGGGTGGCT GGATGGGTAA CGAAGCCGGC GGATGGAAGC CAGTGATCGA AGCGCTCGAA
GCCGCGCGAG CGGGTGAAGA ACGGCATGTG GGTGCGAGTG CCTCAGAAGT GCAAGTGACG
ATCGTAAACA CGGAGAAAGT AGAAGTTTCG CAGAAGAATG ACGCCGCCGA CGAAGCGCCC
GTCGTCGCCG CCGCGCCGAA GAAAAATGTC AGATACATCA CCATGGAGGA GGTGGAGAAA
CACAACACTG AAGACGACTG CTGGATTGTC GTTAAAGGTA AGGTGTATGA CGTCAACGCG
TACTTGAAGG AAGGTCTGCA TCCGGGAGGT AATGCGAGTA TTACCATGAA TGCGGGCGAA
GACACCACGG AGGATTTCGA AGCCGTGCAC AGCGCCAAGG CTTGGAAGCA GCTCGAACCT
TTTTATATCG GCGACGTCGG AAGTGCGGAA GATCAGGCGG CTTCAAGTGA TGTGTCGGTC
GAAATTGAGC GAGTCGAAAC CGAGCCCAGG CGTGAGTTCC CGGCGATGCC CAAACGTGGC
CCGGTGCATC TCGTCAAGTT TTACGAAGAA CACAAGGAGG CGTTCGGCCA GATGTTACTC
GGTGAAGCCG CGGAAGCGGC GGCTTTCGAT CGCATGTGGG CGGGTGCAAA GCATATCGTG
CCCGAAAACG CTCCACTCGG GCTGAACCCG AAGAAATGGT TACAGCTCAA GATTGAAAAC
AAAATTCCGC TGTCGCACGA TTCCATTCTT TTGCGTTTGA AGCTCGAAAG CGACGAGCAT
CAGTGTGGTA TGCCCGTGGG TTATCATGTA TATTTACGAG GCGAGTGGAA TGGCAAGAAG
GTGATGCGCG CGTACACGCC TTCCTCGCTC AACGGCACCC TTGGGGCTGT GGAACTGGTG
ATTAAGATTT ATTATTCCGA CGTCCACGAA GCGTACCCCG AGGGCGGGGC GTTGACACAA
TACCTTCACC ATCTCAACGA GGGTGACAAG ATTGACGTCA AAGGCCCAGT TGGTCACATC
AAGTATCTCG GTCAGGGATT GTTTAGCATC GACAAAAAAG ACTTGCCGCC GGTGAAGAAG
ATGACGCTCT TGGGCGGCGG CACGGGCGTC GCGCCGATGT TGCAACTCAT CGTCGCCGTT
CTCGCGGACG AGAAGGACGA AACGGAACTT TCCTTCATTT ACGCCAACAA AACGGAAGAC
GATGTGCTAT TGAAGTATAC CCTCGATCGT CTCGAGCGCG AGCATAAGGG GCGGTTTAAA
GTGCACTACA TGATTTCCAA GGAGACGTGG GCGGCGGACA GGAAGACTGG CCCGGAGTGG
AGCTCGGATC GCGTCACGTA CGGGCGCATA AGTTTGCCCA TCATCCAACA ACACGGCTTC
CCGTCCAACG GTTCGTCGCA CATCGCCGTG ATGTGTGGGC CACCTGCGTT CGAGGAAGAC
ACGTGCATTC CAGCGCTGAA AGCGCTCGGT TATCCCGAAG ACGCCATCAT CCGCTATTAG
 
Protein sequence
MDDVDVQRAL LACDGHRGAL TRAKAEGARA DGRDLNTPDH WVARHPSLVR LTGTHPFNCE 
PPLRALMEAG SVTPAELHFV RNHGAAQNIA WDAHEVRVEV SDSGKGRTYG MDALLRKPAM
TMPCLLVCAG NRRKEQNMYE KTIGFGWGAA GLSNSLWTGV PLRILLAECG VTEVTAKRRF
VCFEGPKGEL PKGKDGTYGT SIPLSKALDP AQDVMVCYAQ NGEALRPDHG FPVRLIIPGY
IGGRMIKYLT TIRVTEHPSD NFYHFRDNRI MPPGVDAAMA DKQNWWEKPE YIFNELNINS
AIASPAHDEC VPLDRERYEV KGYAYTGGGR AITRVEVSLD GGYTWRLANI RRPFAPTMYG
KHWCWIFWDL DVATVELASA KEVMCRAWDE ANNTQPRDFT WNLMGMGNNC YFRVKLSVAA
LSDRAQKYIR CEHPTEPGAL KGGWMGNEAG GWKPVIEALE AARAGEERHN DAADEAPVVA
AAPKKNVRYI TMEEVEKHNT EDDCWIVVKG KVYDVNAYLK EGLHPGGNAS ITMNAGEDTT
EDFEAVHSAK AWKQLEPFYI GDVGSAEDQA ASTAAFDRMW AGAKHIVPEN APLGLNPKKW
LQLKIENKIP LSHDSILLRL KLESDEHQCG MPVGYHVYLR GEWNGKKVMR AYTPSSLNGT
LGAVELVIKI YYSDVHEAYP EGGALTQYLH HLNEGDKIDV KGPVGHIKYL GQGLFSIDKK
DLPPVKKMTL LGGGTGVAPM LQLIVAVLAD EKDETELSFI YANKTEDDVL LKYTLDRLER
EHKGRFKVHY MISKETWAAD RKTGPEWSSD RVTYGRISLP IIQQHGFPSN GSSHIAVMCG
PPAFEEDTCI PALKALGYPE DAIIRY