Gene OSTLU_12848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12848 
Symbol 
ID5003707 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp16072 
End bp17220 
Gene Length1149 bp 
Protein Length220 aa 
Translation table 
GC content63% 
IMG OID640419128 
Productpredicted protein 
Protein accessionXP_001419466 
Protein GI145350122 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0450] Peroxiredoxin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG CGCGCGCGCG CGTCCGGAGC GCCTCGTCGC CGCGAGCGCG CGGTGCGGTG 
GTGAAGGTGA AGGTGAATGC GCGTGCGGTG TGCGTGCGAT AACGTCAATC GGGCGTGCGT
CGCCGTCGTG TCGGTGTCCT CGGAGTGCGG TCGTCGTCCG TCGCAAGATT AAGATTGATT
TCCAGGTGTT GGCCGCGCGC GTCGAAAATC GGATTTCCCC GACCGCGCGG TCGGCGATCG
ACGTCGACGC GACGTCACTT TCGAGACGAC GACGACGACG ATGGCGAGCG CGATGACGAG
CACCTCTGCG TTCACCCCGA CCACGGCGGG GCTGAAGGCG CGGCGCGCGA ACAAAAACTT
CTCGCGATCG ACCGTTCGCG TGGTGCGCGC GAGGCGAGGC GAAGCGACGA CGGACGACGA
CGGCGCGAAC GACGATGCGA TCGGGGAAGG ATAAAGAAGC CCGCGCGAAC GGTCGAGCCG
GCCGAGGATC GCGGAGGGAG GCCGCGAGGA CACGGGCGAA GACTGACGAT CACCGTTACC
ACGATTTCCG ACTCGCAGCA AGCGCGCAAG CCGTTGGTCG GGTACGAAGC GCCGGACTTC
AGCGCCGAAG CCGTCTTCGA CCAAGAGTTC CAAGACATCA AGCTCAGCGA TTACCGAGGC
AAGTACGTGG TGTTGTTCTT CTACCCGCTC GATTTCACCT TTGTGTGCCC GACGGAAATC
ACCGCGTTCT CTGATCGTTA CGAAGAGTTC GCCAAGCTCA ACACCGAAGT TCTCGGCTGC
AGCGTCGACT CCAAGTTTTC CCACTTGGCG TGGTTGCAAA CGGACCGCAA CGACGGCGGT
CTCGGCGACT TGGCGTACCC GCTCGTGAGC GACCTTAAGC GCGAAATCAC CGAGGCTTAC
GACGTCCTTT ACGAAGACGG CACCGCGCTC CGTGGTTTGT ACATCATCGA TCGCGAAGGC
GTCATTCAGC ACAGCACCGT CAACAACGCT CCGTTTGGCC GCTCCGTCGA CGAAACGCTG
CGCGTGCTTC AAGCCATCCA GCACGTGCAA AACAACCCGG ATGAAGTCTG CCCGGCGGGC
TGGACCCCGG GTGCGGCGAC GATGAAGCCG GATCCGAAGG GTTCCAAGGA ATACTTCAAG
GCCATCTAA
 
Protein sequence
MRDARARVRS ASSPRARGAV VKVKQARKPL VGYEAPDFSA EAVFDQEFQD IKLSDYRGKY 
VVLFFYPLDF TFVCPTEITA FSDRYEEFAK LNTEVLGCSV DSKFSHLAWL QTDRNDGGLG
DLAYPLVSDL KREITEAYDV LYEDGTALRG LYIIDREGVI QHSTVNNAPF GRSVDETLRV
LQAIQHVQNN PDEVCPAGWT PGAATMKPDP KGSKEYFKAI