Gene OSTLU_3507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3507 
Symbol 
ID5004692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp374231 
End bp375220 
Gene Length990 bp 
Protein Length330 aa 
Translation table 
GC content54% 
IMG OID640420113 
Productpredicted protein 
Protein accessionXP_001420832 
Protein GI145353025 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0652] Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family
[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.22649 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCC TGCTCGAGAC CTCGAAAGGG GACATCGTCA TCGACCTGTT CGTCGACGAC 
GCCCCGAAGA CGTGCCTGAA TTTCGTCAAG CTGTGCAAGA TGAAGTATTA CAACCACTGC
AAGTTTTTCG ACATTCAAAA GGATTTCATG GCGCAGACGG GCGATCCCAC GAACACCGGG
CGCGGTGGAG ACTCGGTGTT TAAGTTTCTG TACGGCGAGC AAGCGCGGTT CTTCGAGGAT
GAGATACACG CGACGCGGAA GCATGAAAAG TTTGGAACGG TGAGCATGGC GCCGGCGGCG
GAGAACGCGA ACGCGTCGCA GTTCATCATC ACGACGCGCG CGGGCGCGGT AGACGCGTTG
GATGGGACGC GAACGATATT TGGGGAGGTT TCAGAGGGGA TGGACGTGTT GAGGGCGATT
AATGAGGCGT ATTGCGATGA AAGTGGACGA CCGTGGCAGA ATATTCGCAT AAAGCACTGC
GTCGTGCTGG ACGACCCGTT CGACGATCCC CCGGGGTTCG CCGAACTCGT CCCGGACGCG
TCGCCGCCTA GAAAAGAGGA CCCCGATGAT GATAGGTTAG AGGATGATTT CGATGTCAAC
GCGGAGATGA ATAAGGATGA GGCACAGATT GAAGAGGAGA AGCGTGGGCG CGAGGCGCAC
AATCGAGCCG TCGTCCTTGA ACTCATAGGA GACTTGCCCG ATGCGGACGC AAAACCGCCG
GAAGAATCGC TCTTTGTGTG CAAATTGAAC CCAGTGACTA CGGATGAGGA TTTAGAAATC
ATTTTCAGTC GATTTGGTAA GGTATTGTCG TGCGACGTCA TTCGCGACTT CAAGACGGGA
GCGAGTCTTG GGTACGCCTT CGTTAACTTT GAGCACAAAC ACGAGGCCGA GCAGGCGTAC
TTCAAGATGG ATAATGTTTT GATCGACGAC AGACGCATTC ACGTCGACTT TTCGCAATCG
ATGCATCATC TTTGGAAGAA TTTCAAGCGT
 
Protein sequence
MAVLLETSKG DIVIDLFVDD APKTCLNFVK LCKMKYYNHC KFFDIQKDFM AQTGDPTNTG 
RGGDSVFKFL YGEQARFFED EIHATRKHEK FGTVSMAPAA ENANASQFII TTRAGAVDAL
DGTRTIFGEV SEGMDVLRAI NEAYCDESGR PWQNIRIKHC VVLDDPFDDP PGFAELVPDA
SPPRKEDPDD DRLEDDFDVN AEMNKDEAQI EEEKRGREAH NRAVVLELIG DLPDADAKPP
EESLFVCKLN PVTTDEDLEI IFSRFGKVLS CDVIRDFKTG ASLGYAFVNF EHKHEAEQAY
FKMDNVLIDD RRIHVDFSQS MHHLWKNFKR