Gene OSTLU_38397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38397 
Symbol 
ID5002109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp507413 
End bp508387 
Gene Length975 bp 
Protein Length324 aa 
Translation table 
GC content60% 
IMG OID640417530 
Productpredicted protein 
Protein accessionXP_001418029 
Protein GI145347128 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.504345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00204259 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGCGAT GTTGGTGGCC TATCGGCCAC TTGCGAGGAT GGCTCGGACC TTCGGGCTTC 
GGCTCCAAAT CGTCCTGGCG CGACGTCATC GCGGACCTCG ACCCTGGATC TGCCAGCCGA
AATTGCGTTC TCATAACCGG CGCCACGGCT GGCATAGGCT TCGAGACGCT CAAAGCATTT
TGTAGCACGG GTGCGACGGT AGTGGTCGGT GCGCGTGACG AAGCGCGAGC AAAGGCGCTC
GCGTGCGAGC TCATGTCTAA GACGACGTCC ATCGTTCGCG TCCTTCGTCT CGACTTGTCG
TGTTCAAAGT CGGTTCACGC CTTTGTTGAC GCATTTCTCG CGCTCAATCT CAAGCTCACC
GTCCTCGTGA ACAACGCTGG TATAATGCCT TGTCCGTTTG ACGCAGATTC ACACCGAGAC
CTCGCGTTTC ACGTGAAGTT TCTCAACCAC TTTGTTTTAA CGCAGTTGCT CCTGGAATCG
TTTGATCCGG CGGGCGCGCG CGTGGTGAAC GTCACCAGCG AAGTCTATCG CTTCTCTTAT
CCGGAAGGTA TTCGGTTCGG CAAAATAGAC GACGACCGAG CGTACGACAG CGTGAAATCC
TACGCCCAAT CGAAACTCGC GCTGCTCTTG TGGACTCGGT ACCAAGGCGA AGCGCTTCGC
GAGCGCGGCG TGCAATTTTT CGCCGTGCAT CCGGGCTCGG TCGCCACGCA AGGCAGCGCG
CGCGCGCGAA AATCCAGCGG TTGGCGCGGA GCCTTGCTCC ACTGCGTCGG CGCACCGTTC
GTCAAATCCG TCGAGTGCGG CGCGGCGACG ACGATTTATT GCGCGCTTCA TCCCGGCGCG
TCGATGTACA ACAGATTCGG CGAGTATTAT TTCGCGTCGT GCAATCCGAG AGGCGTGCGC
GAGATTTCGC GCGACGCAAC GCTCGCTCGA CGTCTCGTCG AGTACGCCGC GCGCGAGCTC
GACGCGAGCG CGTGA
 
Protein sequence
MPRCWWPIGH LRGWLGPSGF GSKSSWRDVI ADLDPGSASR NCVLITGATA GIGFETLKAF 
CSTGATVVVG ARDEARAKAL ACELMSKTTS IVRVLRLDLS CSKSVHAFVD AFLALNLKLT
VLVNNAGIMP CPFDADSHRD LAFHVKFLNH FVLTQLLLES FDPAGARVVN VTSEVYRFSY
PEGIRFGKID DDRAYDSVKS YAQSKLALLL WTRYQGEALR ERGVQFFAVH PGSVATQGSA
RARKSSGWRG ALLHCVGAPF VKSVECGAAT TIYCALHPGA SMYNRFGEYY FASCNPRGVR
EISRDATLAR RLVEYAAREL DASA