Gene OSTLU_40456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40456 
Symbol 
ID5005780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp450459 
End bp452963 
Gene Length2505 bp 
Protein Length834 aa 
Translation table 
GC content60% 
IMG OID640421201 
Productpredicted protein 
Protein accessionXP_001421802 
Protein GI145355086 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCGTTCGT TGCTCGGAGA CGTGAGCATC GCGAAGAGCA TGGGCGCGCC GACGCTGTGG 
GAGCAAGGGT ACACCGGGGC TGGGGTGAAG ATGGCCGTCT TCGACACCGG TATCGCCGAA
GATCATCCGC ACTTTCGGCA CATCGTCGAA CGCACGAATT GGACCAACGA AAACCAACTC
CACGACGGGT TGGGACACGG GAGCTTCGTC GCGGGCGTCG TCGCGGGAAC GTCGAAACAG
TGCGCGGGAT TCGCGCCTGA TGCCCTCATA CACACGTTTC GCGTATTCAC GAACGATCAA
AACTCGTACA CGTCGTGGTT TCTCGACGGG TTCAACTACG CCATCGCCAC GGGCGTGCAC
GTCTTAAATC TTTCCATCGG CGGGCCTGAT TACTTGGACC GCCCGTTCAC GGACAAGATC
AACGAGATCA CGGCGGCGGG AATCATCATG ATTAGCGCGA TAGGAAACGA TGGGCCGCTG
TACGGAACGT TGAACAACCC GGCGGATAAC TTGGACATCA TCGGCGTGGG CGGCATCACC
GACGCCGACG CCATCGCGCA CTTTAGTTCG CGCGGGATGA CCACGATGGA GCTACCCAGT
GGATACGGCC GTGTGAAACC CGATATCGTG ACGTATGGCG ACGGTATCTG GGGAAGTAAG
GTTGGCGATG GCTGCCGCGC GCTTTCGGGG ACGTCCGTCG CGTCGCCCGT CGCCGCGGGC
GCGGCGGTGT TGCTCAGCTC CATCATACCC GAGAGCGAGC GATGGTCGGT GTTAAATCCA
GCGGTGATGA AACAGGCGCT CGTGGAAGGG GCGACGCGAA TACCGTTAGC ACATCGCTAC
GAGCAGGGGG CGGGGAAGCT GGATTTGTTG AAATCGGCGG AGATTCTGAA GACGTACACG
CCGCGCGCCT CGGTGATTCC GGCGACGTTT GACTTGACGG AGTGCCCGTA CGCGTGGCCG
CACTGCAAAC AAGGAATCTA CGCCACGATG ATGCCGCTGA TGTTGAACGC GACCATCGTT
AACGGTCTCG GCACGCACGG GGAGATTGTG AGCGGGCCCG ATTTCATCCC GAACGACGGC
GACTTGGGCG CGCACCTCGA CGTTCGCTTT GCGTTTTCGG AGACGCTTTG GCCGTACTCT
GGGTTCTTGG CGTTGTACGT GCGAGTCCAG GACGACGCGG CGACCAAGTC TGGCGTCGCG
AGTGGGCGCG TGCGATTCAC CGTCGCTTCG TCGGGCGCGC GCGGAGAGAC GAAATTGCGC
GTGTCGGAGG TTGAGATGAC GCTCAAGGTC AACGTCGTGC CGACGCCGAG TCGCGACAAG
CGATTGCTGT GGGACCAATT TCACAACGTG CGGTATCCAC CGGGATACAT TCCGAGAGAT
AACATCGACA TGAAGCAAGA CGTTTTGGAT TGGCACGGCG ATCACCCGCA CACGAACTTC
CACCAATGGT ACGATGATCT GACGCGGGCG GGTTACTTTG TCGAAGTCTT GGGATCGCCG
TTCACGTGCT TTGACGCGAA GAATTACGGG GCGCTACTGC TCATCGATCT CGAAGAGGAG
TACGCGAGCG ATGAAATCGC CAAACTCACT CGAGACGTTC GCGAAGAAGG ATTGGGGCTG
GTCGTCTTCG CCGATTGGTA CGATTTGACG ACGATGGAAA GCCTGAAATT TTTCGACGAC
AACACGCACT ACGAATGGCA CGCCGCCACG GGTGGAGCCA ACGTGCCGGC TTTGAATGAT
CTTTTGAAAG ACTTTGGCAT TCAGTTCGGG GGCGAGGTCA CGGAAGATAG CATCGTTTTC
GATGACGACC AAGTCATCGT CTCGAGCGGA ACCCACATCA CGCGCGCGCC GGCGGGCGCG
TATTTGCACA GCGAACGCAT GACGCAGCGC GGCAAGGCGG AGGGCGATTA CGCATTTTTA
TCGCTCTTTG AGGCGGGTAA AGGAAGAATC TTCGCGTTCG TCGACTCTAA CTGCGTCGAC
AGCTCGCACA TGCGCGGTCA ATGCTTCGGT TTCGCGCGAA AAGGTGTCGA GTTCGCCGTC
GGCGGTTCGT GCGCCGCCGC GCACTGCGAC GATCGAAAGC GTCTCGCCGA ATCGTGGTCG
GACTCGAAGC CTCTTCCCTC GCGAAGAACA GACGTGGACT TTTCACGGTT TAGCACCGTT
CTCGGAGGAC ACCCTGGGAA CGAAGGTTCG ATGACGTGCG GCCCCAACGC TCCTTTGGAC
CGTCACGACG CCAAGGCGGC GTACAGCGAT CTTCCGGGGC GATTGAAAGT GTCCGAGTCG
GAGAAGACGT CTTTGAAGAC GACCGTTCCA CAGCGCCCGA TGAAGGATTC ACGCACGACG
CTCGATGTGA AAACAGATTA CGTCGCGACG GATCGCGCGG ACGACGGGTC GAACGTTCGA
TATTACGTCG TCGCCTTTGG CGTCGTCGCC GCGTTGCTGG CGCCGCGCGC GAGACGGCGC
AGGGCGCGAC GCGCGCGTCT TCGAGCGAGC GGCGGCGCGT CTTAG
 
Protein sequence
RRSLLGDVSI AKSMGAPTLW EQGYTGAGVK MAVFDTGIAE DHPHFRHIVE RTNWTNENQL 
HDGLGHGSFV AGVVAGTSKQ CAGFAPDALI HTFRVFTNDQ NSYTSWFLDG FNYAIATGVH
VLNLSIGGPD YLDRPFTDKI NEITAAGIIM ISAIGNDGPL YGTLNNPADN LDIIGVGGIT
DADAIAHFSS RGMTTMELPS GYGRVKPDIV TYGDGIWGSK VGDGCRALSG TSVASPVAAG
AAVLLSSIIP ESERWSVLNP AVMKQALVEG ATRIPLAHRY EQGAGKLDLL KSAEILKTYT
PRASVIPATF DLTECPYAWP HCKQGIYATM MPLMLNATIV NGLGTHGEIV SGPDFIPNDG
DLGAHLDVRF AFSETLWPYS GFLALYVRVQ DDAATKSGVA SGRVRFTVAS SGARGETKLR
VSEVEMTLKV NVVPTPSRDK RLLWDQFHNV RYPPGYIPRD NIDMKQDVLD WHGDHPHTNF
HQWYDDLTRA GYFVEVLGSP FTCFDAKNYG ALLLIDLEEE YASDEIAKLT RDVREEGLGL
VVFADWYDLT TMESLKFFDD NTHYEWHAAT GGANVPALND LLKDFGIQFG GEVTEDSIVF
DDDQVIVSSG THITRAPAGA YLHSERMTQR GKAEGDYAFL SLFEAGKGRI FAFVDSNCVD
SSHMRGQCFG FARKGVEFAV GGSCAAAHCD DRKRLAESWS DSKPLPSRRT DVDFSRFSTV
LGGHPGNEGS MTCGPNAPLD RHDAKAAYSD LPGRLKVSES EKTSLKTTVP QRPMKDSRTT
LDVKTDYVAT DRADDGSNVR YYVVAFGVVA ALLAPRARRR RARRARLRAS GGAS