Gene OSTLU_36317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36317 
Symbol 
ID5000077 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp198814 
End bp200718 
Gene Length1905 bp 
Protein Length588 aa 
Translation table 
GC content63% 
IMG OID640415498 
Productpredicted protein 
Protein accessionXP_001416100 
Protein GI145342029 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTCGT TCGCGACGCG CGCGCGACGG ACGTTCGAGG TGGAGTACGG CGCGCGGCGA 
GGGACGACGG AGAACGCGTA CGCGACGCGA GGCGAAAAGG ATGCGTGCCT GATCGATTGC
GTCGACGGGA GACACGCGGA GGGGTACGCG CGCGAGATCG AGGGGCTGGG GGCGTGGGCG
CGCGACGCGG CGTATCACGC GGTGCTGCAC GTGAGTCCGC GGCGGTTGGA CGCGCTCGCG
GCGGCGATCG CGAACAGGAG CGAGGGCGCG GCGTGCGTGG AGGTGCTGTG CTCGAATCCG
GGGGCGCAGT TGATTCAACA GGCGCTGAAA CCGAATAGTC CTCTGACGAA TGAAGCGCTG
TGTGCGGCGT GGAAGGGGAC GGATGGGAAA CTGAGGGCGC GGTTGCGCGT CGTGCGCAAC
GGCGAGCGGT TGGATTTGGG CGGGCGGACT TTGAGATTTA CGCTCGCGCC CACGCCGCGT
TGGCCGGATT TGATCTTCGC GCGCGATGAG AAGTCGCAAA CTCTGTTTAC ATCGAAGTTT
TTCTCCGCAC ACGTCGGCAC GATGGAAGGA TACGGCGATG AGGGCGGGTT GGAGACGTTC
GGGGAGGATT GGAGGTTTTA CTTTGACTGC CTCCTCGCGC CGATGGCGAG GCAAGTGTCG
CCTTTGCTCG AGAAGCTTAC CGTGAAAGAA GAGAACGCGT ACGACGAACG GATGGGACGA
CGGATGGAGG AGTGGGAAAA GAGCGGCGCG GTGAAAAAGG TTTTGCTGCG CGCGATGGGC
AAGTCCATGG CGGGGAAAAC GAGCTCGCAA GCGTTCGAGG GCGTCGCGAA GACTATTTGC
CCGGCGCACG GCCCGGTCGT CGCGTCCTCG GTCACGGAGC TGTATCGAGA GTACGTAGAG
TGGTGCAAAA TGCAAACTTC CGCTGGGGAT AACTTGTCCG TGGCCGTCAT CTACGCCAGT
GCGTACGGGA ACACCGGCGC GATGGCGCAA GCCATCGCGC GAGGCGTCGC CAAGACAGGC
GTCGGCGCGG AAATGTTCAA CTGCGAGCTC GCGTCACCGA TCGAAGTGGA AGAAGTGTTG
AAACGTAGCG CGGGGTTCGC CCTCGGTGCA CCGACGCTCG GTGGCACTTT ACCGACGCCC
GTGCAGACCG CACTCGGGGC GATCGTGAAG GAGGGCGATT TGGAGAAGCC GTGCGGCTCC
TTTGGATCCT TTGGCTGGTC TGGCGAAGCG GTGGCGATGA TCGACAAACG CTTGACAGAC
GCGGGCTTTA AAAGTGCGTT CGAGCCGTTG CGGTGTAAAT TCAAACCTAC CGCCGAGACG
CTGCAGCTTT GTGAAGAGAG CGGGACGGAT CTCGCGCAAG CGGTGCGCAA GATCGAGCGC
CGCAAACAAG TGCTCGAGCG TAAATCCGTC GGCCAAGCCG CGGACGGCGT CAGTGACACC
GCCGCCGCCG TCGGGCGCAT CGTCGGCTCG CTATGCGCGG TGACGACGAA GAACGAAGAC
ACGCAAAGTG CCATGTTAGC GTCTTGGGTA TCTCAAGCGA GCTTCAATCC GCCGGCGCTC
ACCGTCGCCG TCGCCAAGGA GCGCGCCGTC GAGAGTTTCC TCATGACCGG CGGCAAGTTC
AACCTCAACG TCCTCAAGTC CGGCGGCGAA AAGGACGTCA TGAAGGCCCT ACTCAAACCG
TTCGCTCCCG GTGAGAACCG TTTCGGCGCG CTCGACGTAG ACATCTCCGA AACCAACGGC
TGCGCCGTGG TGAAACAGGC CCTCGCGTGC GTCGAGTGCA CCGTCACGAA GCGAATGGAG
GCCGGCGACC ACTGGGTCGT CCTCGCCGAA GTCGAGCGCG GGACTCTCTT AGACGCCGAA
GGCGTGACGA GCATCCACCA CAGAAAGACT GGTAGTTCTT ATTAA
 
Protein sequence
MVSFATRARR TFEVEYGARR GTTENAYATR GEKDACLIDC VDGRHAEGYA REIEGLGAWA 
RDAAYHAVLH VSPRRLDALA AAIANRSEGA ACVEVLCSNP GAQLIQQALK PNSPLTNEAL
CAAWKGTDGK LRARLRVVRN GERLDLGGRT LRFTLAPTPR WPDLIFARDE KSQTLFTSKF
FSAHVGTMEG YGDEGGLETF GEDWRFYFDC LLAPMARQVS PLLEKLTGVA KTICPAHGPV
VASSVTELYR EYVEWCKMQT SAGDNLSVAV IYASAYGNTG AMAQAIARGV AKTGVGAEMF
NCELASPIEV EEVLKRSAGF ALGAPTLGGT LPTPVQTALG AIVKEGDLEK PCGSFGSFGW
SGEAVAMIDK RLTDAGFKSA FEPLRCKFKP TAETLQLCEE SGTDLAQAVR KIERRKQVLE
RKSVGQAADG VSDTAAAVGR IVGSLCAVTT KNEDTQSAML ASWVSQASFN PPALTVAVAK
ERAVESFLMT GGKFNLNVLK SGGEKDVMKA LLKPFAPGEN RFGALDVDIS ETNGCAVVKQ
ALACVECTVT KRMEAGDHWV VLAEVERGTL LDAEGVTSIH HRKTGSSY