Gene OSTLU_31659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31659 
Symbol 
ID5001915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp301730 
End bp303438 
Gene Length1709 bp 
Protein Length550 aa 
Translation table 
GC content60% 
IMG OID640417336 
Productpredicted protein 
Protein accessionXP_001417982 
Protein GI145347029 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0143795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0189648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGATGACG CGCGCGACGA AGCGCGCGGG AAAGACGCGC GCGACGAAGG CGCCGGCGCG 
CGCGTCGAAG CGCGCGCGAG GGGAGACGCC GGTGAAGGAG GAGACGACGC GACGAAGGCG
AGGCGGCGCG CGAGACGCGG ACGAGGACGA CGAGACGGAC GCGACGGAGG CGCCGGCGAG
CGGACGGTCG TCGGAGGACG CGGTGGATTA CTTTGGGGAC GTCGATGGAT GGGCGAGCGA
CGACGAGGAC GCGCCGCAGT ACCCGCTGGA CGCGGTGGTG AAGGTGTTCG CGACGCACAC
CGAGCCGAAC TGGAGCCTGC CGTGGCAGCG AAAGCGACAG AGCTCGAGCA CGAGCACGGG
GTTCGTCATA GAGGGGAATA TGGTGCTGAC GAACGCGCAC AGCGTGGAAC ATCACACGCA
GGTGAAGCTG AAGAAGCGCG GGAGCGATAA AAAGTACGTG GCCAAGGTGT TGACCATCGG
CGTGGAGTGC GACTTGGCGC TGTTGACGGT GGAGGAGAAG GAGTTTTTCG AGGGCGTGGC
GCCGGTGAAG TTCGGCGTGT TGCCTCGCTT GCAAGACAGC GTCACTGTGG TTGGGTATCC
GGTGGGTGGG ATCGCGATTA GCGTCACGAG TGGGGTGGTT AGTCGGATAG AGGTGACGTC
GTATTCGCAC GGCGCGACGG AGCTTTTGGG GGTGCAAATC GACGCGGCTA TCAATTCTGG
CAACAGCGGG GGTCCGGCGT TCGGGCGCGA AGGGCAGTGC GTGGGTGTGG CGTTTCAATC
GCTCAAGGAC TCGGACACCG AAGGAATCGG CTACATCATA CCGACGCCCG TCGTCGACCA
TTTTATTAGC GACTTCAAGA GGACGGGCGT GTACAACGGT TTCCCGGCGC TGCAGTGCGA
GTTCCAGCGC TTGGAGAACC CATCGCTTCG GAAGAGTCTC GGGATGAAAC CCGCGCACAA
CGGCGTCTTG CTTCGACGTC TATCGCCTCT CGCGCCGGCT GCCAAGGTGT TGAAACGCGG
CGATGTGTTG ATGAAATTCG ACGGCGTCGA CGTCGCTTCC GACGGCACCG TGGTTTTTAG
AACGGGCGAG CGCATTAACT TTTCTTATTT AGTCTCTCGC AAGTACGTCG GCGATAGTGC
TGCGGTCACC GTCCTACGCG ATGGCAAGAT GATGAATTTC GACATTTCGT TGACGCCGCA
CGACCGCCTC GTTCCGGTGC ACATCGAGGG CAAGCCTCCG TCGTATTACA TTTGCGCGGG
CATCGTCTTC ACCGTGGTGT GTGTGCCGTA CTTGCGATCG GAATACGGTA AAGATTACGA
TTACGACGCT CCGCTGCGTT TACTGACGAA GATGATGCAC GGGCACAAGG AGAAGCCAGA
CGATCAAGTC GTCGTCGTGA GTCAAGTGCT CAACTCGGAC ATCAACATTG GCTATGAAGA
CATCGTCAAC GTCGTCGTGT GCGGCGTGAA CGGTAAATCC GTGAGAAACT TGCGCGAACT
CGTGAAAATC GTCGAAGGCT GCAAGCACGA GTACTTGAAA ATCGAGCTCG ATCAATCGAT
ACAAATCGTG CTCGAGACCA AGGCGGCGAA AAAATCCACG AAGGAAATTT TACACACCCA
CTGCATCCCG AACGCGTCGA GCGTGGACTT GCGCTGAGCG CGAGGCGAGC GCTTCAAAGA
CGACGATGAT GATTAGTATC GACTAGCAC
 
Protein sequence
MTRATKRAGK TRATKAPARA SKRARGETPV KEETTRRRRG GARDADEDDE TDATEAPASG 
RSSEDAVDYF GDVDGWASDD EDAPQYPLDA VVKVFATHTE PNWSLPWQRK RQSSSTSTGF
VIEGNMVLTN AHSVEHHTQV KLKKRGSDKK YVAKVLTIGV ECDLALLTVE EKEFFEGVAP
VKFGVLPRLQ DSVTVVGYPV GGIAISVTSG VVSRIEVTSY SHGATELLGV QIDAAINSGN
SGGPAFGREG QCVGVAFQSL KDSDTEGIGY IIPTPVVDHF ISDFKRTGVY NGFPALQCEF
QRLENPSLRK SLGMKPAHNG VLLRRLSPLA PAAKVLKRGD VLMKFDGVDV ASDGTVVFRT
GERINFSYLV SRKYVGDSAA VTVLRDGKMM NFDISLTPHD RLVPVHIEGK PPSYYICAGI
VFTVVCVPYL RSEYGKDYDY DAPLRLLTKM MHGHKEKPDD QVVVVSQVLN SDINIGYEDI
VNVVVCGVNG KSVRNLRELV KIVEGCKHEY LKIELDQSIQ IVLETKAAKK STKEILHTHC
IPNASSVDLR