Gene OSTLU_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3562 
Symbol 
ID5000677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp107340 
End bp108326 
Gene Length987 bp 
Protein Length329 aa 
Translation table 
GC content62% 
IMG OID640416098 
Productpredicted protein 
Protein accessionXP_001416558 
Protein GI145344062 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAGCGCGAGA CGGTGCGATT GTTTAATAAC GCCAAGGCGA GCGTCGTGTA CATCACCAAT 
GTCGCCGTGC GCAGAGATGC GTTCACGTTG AATCTCACGG AACAGCCGCA AGGGGCGGGG
AGCGGCATCG TCTGGGACGA CAAGGGGCAC ATCGTCACCA ACTACCACGT CATCGACAAG
GCGAATCAGT TGAAGGTGTC GTTTTTGCCG AATAAAGGCG GGGTGCAGAA TCAGAAGACG
TACGACGCCG CAATCGTTGG GTTCGACGAC GATAAGGACA TCGCCGTGCT GCAGGTGAAC
GACCCAGAGG CGCTGCGGGA GATGAAACCG CTCGTCATCG GAACGAGCGG AGACTCCATG
GTGGGCCAGC GTGTCTTCGC GATCGGGAAC CCGTTTGGGC TCGATCACAC GCTCACAACC
GGCATCATCA GCGGCCTCGG ACGAGAGATT CAAAGCGGTA ACACCGGGCG CCCAATCGAC
GGCATCATTC AAACGGACGC GGCGATCAAT CCCGGCAACT CGGGGGGCCC TTTGTTGAAT
TCGTCGGGAC AGCTCATCGG CATCAACACC GCGATTTATT CCGCGTCCGG GACGTCCAGC
GGCGTGGGAT TCGCCCTCCC GAGCGACATG GTGAGCGGTA TCGTCGATCA AATCATTCGT
TACGGTCGCG TGACGCGTCC GATTCTCGGC GTCTCCTTCG CCCCCGACGG CGCGCTCGAC
CAGCTCGGCC TCGGCGGCGT GTTGGTGCTC GACGCTCGCG CGGGCGGTCC CGCCGCGCGC
GCCGGCGTCC GCAGCACCAC GCGCGACGAA TCCGGCCGTC TCATCCTCGG CGACATCATC
ATCGAGCTCG CGGGCGAGCA AATTCAAGAC TCCAGCGATT TATACCGCAC CCTCGACAAG
CTCTCCGTCG GCGAAACCGT CGACGTGACG CTCTTGCGAG GCGTCGACAA AGTCTCCGCC
CGCGTCACCC TCGACGACGT CAAGGAC
 
Protein sequence
ERETVRLFNN AKASVVYITN VAVRRDAFTL NLTEQPQGAG SGIVWDDKGH IVTNYHVIDK 
ANQLKVSFLP NKGGVQNQKT YDAAIVGFDD DKDIAVLQVN DPEALREMKP LVIGTSGDSM
VGQRVFAIGN PFGLDHTLTT GIISGLGREI QSGNTGRPID GIIQTDAAIN PGNSGGPLLN
SSGQLIGINT AIYSASGTSS GVGFALPSDM VSGIVDQIIR YGRVTRPILG VSFAPDGALD
QLGLGGVLVL DARAGGPAAR AGVRSTTRDE SGRLILGDII IELAGEQIQD SSDLYRTLDK
LSVGETVDVT LLRGVDKVSA RVTLDDVKD