Gene OSTLU_31498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31498 
Symbol 
ID5002096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp55370 
End bp56552 
Gene Length1183 bp 
Protein Length356 aa 
Translation table 
GC content67% 
IMG OID640417517 
Productpredicted protein 
Protein accessionXP_001417657 
Protein GI145346360 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.532767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCGCGTT CGCCGCGCGA CGCTTCGACG GACGCGACGG ACGCGCGCGC GCGCGTCTTC 
AGCGCGCGCG TCGTCGTCGC GTGCGCGATC GCGATCGCGA TGGCGTCCGG ACGCGCCGGG
ACGCCGGCGG CGACGCTCGG GACGCTGCGC GCGCACGACG CGAAGCGCGC GCTCGCGGTG
GATCGGTTGA TCCTGGGGGC GTCGCCGCTG GCGGGAATAT ACCGCGGCGT CGACGCGAGC
GAGGCCGCGG CGACGGTGCG CGCGGCGCTC GACGTCGGGT TCACGCGGTT CGACACGGCG
CCGCACTACG GGTTGGGGCT GAGCGAGACG CGCCTGGGCG AGGCGCTGCG AACGCACGCG
AAGAAGGCGG TGAAGGTGTA CACCAAGGTG GGAAGGGTGA TGAAACCGAT GGATGAAGTG
ACGGAAAGCG AACGCGAGTC GGTGGTCGAG TGGGGGAACG TGCCGGGAAA CGACGGGTGC
ATATTTCCAG ACGCCCCGCG AGACGTGTTG CCCGTGCTCG ATTATTCCGC GAATGGGTTC
GTGCGATCGC ACGCCGATAG TTTAAACAGG CTTGGGATTG ATAGAATAGA GGGATTGAGG
ATTCACGACG CCGAGACGCC CGAAAGGTTC GAGGCGGCGA CGACCGGGGG CGGCGTGCGC
GCGTTGACGG CGCTTCGCGA CGCGGGAACC ATTTCAGAGG TATCGCTCGG GATGAACGAC
GCCTCGTACG TGTTGCGAAT GATTCGCGAG AATCCCCCAG GAACGTTCGA CTCGGTCATG
ATGGCCGGTT CGTGGAACTT GCTCGATCAA GACGGTCTCG AAGTCTTGCT CGAGTGCCAA
GCGCGAAACA TCAAGGTGCA CAACGCCGGC GTCTTCGCCA GCGGCGTCCT CGTCGGCGGT
TCGCACTACA AGTACGGCCC CGCTCCCGAC GAGATCAAGC GGCGCACCGA AAAGTGGAAC
GTCTTGGCGC GCGCGTACGA CATCCCTCTC CCCGCCATCG CCCTCGCCTT CGCCCTCACT
CCCGAAGTCG TCGACTCGTG CGCCGTCGGC GTCAAGTCCC CCGACGAGGT CGCCCAATCC
GTCGCCTGGC TCGCCGACGC CGCCCGCGTC CCGCGCCAGC TTTGGCTCGA CGCGTTTTCT
CAAGGTTTAC TCGCGTGGAT CCCGTCCTAG GCCCTCGTTC GTC
 
Protein sequence
MASGRAGTPA ATLGTLRAHD AKRALAVDRL ILGASPLAGI YRGVDASEAA ATVRAALDVG 
FTRFDTAPHY GLGLSETRLG EALRTHAKKA VKVYTKVGRV MKPMDEVTES ERESVVEWGN
VPGNDGCIFP DAPRDVLPVL DYSANGFVRS HADSLNRLGI DRIEGLRIHD AETPERFEAA
TTGGGVRALT ALRDAGTISE VSLGMNDASY VLRMIRENPP GTFDSVMMAG SWNLLDQDGL
EVLLECQARN IKVHNAGVFA SGVLVGGSHY KYGPAPDEIK RRTEKWNVLA RAYDIPLPAI
ALAFALTPEV VDSCAVGVKS PDEVAQSVAW LADAARVPRQ LWLDAFSQGL LAWIPS