Gene OSTLU_41786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41786 
SymbolClpY_13 
ID5005220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp483684 
End bp485312 
Gene Length1629 bp 
Protein Length542 aa 
Translation table 
GC content59% 
IMG OID640420641 
ProductClpYQ (HslUV)-type protease, ATP-dependent subunit ClpY (HslU) 
Protein accessionXP_001421162 
Protein GI145353738 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1220] ATP-dependent protease HslVU (ClpYQ), ATPase subunit 
TIGRFAM ID[TIGR00390] ATP-dependent protease HslVU, ATPase subunit 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value0.640054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.383571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGGC GCGCGTTCGG CGCGCTCGCG CGCGCGTGGC GCGCCGCGCG CGACGCGGAC 
GTCGACCAAC GATGCGCCGC GACGCTCGCC GCGCGCGCGA CGACGACGAC GACGACGACG
ACGACGACGA CGACGAAAAC GCACCGCGGC GAGCGACCCT ACGGCGCGAC GGGACCGTCG
CGCGTCGCGA AACCGCCGAA ACCGCCCACG AAGCGGTCGA CGATCGACGT GACGATCGAC
GTGACGCCCG CGGACGCGCG CGACGGGCGA GACGCGAGGG AAGAGGACGG ACGCGGTGGG
TTGACGCCCG AACGCGCGAC GGCGCTGCTG GACAAGCACA TCGTCGGACA GACGGAGGCG
AAGCGGGCGT GCGCGGTGGC GCTTCGGAAT CGATGGCGGC GACACAGGAT CGGGAAACCG
ATGCGAGACG AAATCGTGCC GAAAAATATC CTGATGATCG GCCCGACGGG GTGCGGGAAG
ACGGAAATCG CGCGACGATT GGCGAAGATT ACGGATTCGC CCTTCGTAAA GGTGGAGGCG
ACGAAATTTA CAGAGGTTGG ATTTCACGGG CGAGACGTCG ATCAGATCAT TCGAGATTTG
GTGGATAATG GGATCGCGGT GACGAAACAA AAGATGCGCG CGAAGTTTGA AAAGTTTGTG
GAGGAGTTGA TCGAGAATAA AATTTTAGAT TTTGTGTGCG GGGAGGGGGC CAACGACGAG
ACGCGCGAGG CGTTTCGGAC GTTGTATCGA GACGGTACTT TGGACGATCG CACGATCGAG
GTTGAGTTGC CGGATTCGGG TGGGCAAAAC ATGAAGATCG ACCCGAGCGG GGGACCGATT
CCCATTCACG AATTAGTCAT CAAGGTTGAT AAATTGTTCG GAAACCGCAA GAGCACGTCC
AAGCGTAAAA TGACGGTGGC CGAGTGCAAA CCTTTGATCG AAGAGATGGA GTTTGACAAC
TTGTTGTCCG CGGAGACGAT CGCGAAAGAG GCGATCACGG CGGTGGAGAA CGATGGGATC
GTGTTCATAG ATGAGATTGA TAAGATTGTG TCGTCGAGCG ATTACCGTCA CGGCGCCGAC
GCGAGCTCCG AGGGCGTGCA GCGCGACTTG TTGCCGATCA TCGAAGGCTC CGTCGTGAGC
ACGAAGCACG GCAACGTCAA CACCGACCAC ATCTTGTTCA TAGCCTCCGG AGCGTTCCAC
AGCGCGAAAC CGAGCGACAT GCTCGCCGAG TTACAGGGAC GATTGCCCAT TCGCGTCGAG
CTCAAAGGTT TGACCGAGCG CGACTTGTAC CGAATTTTAA CCGAACCGGA GATGAACATG
ATCGCGCAAC AAAAGGCGCT CATGAAGACG GAAGGGATCG ATTTAGAGTT CACGAACGAA
GCGATCGAGC ACATCGCGAG CATCGCGGCC AAGGTGAACA AAACGGTCGA CAACATCGGC
GCCAGACGAC TGCACACCGT GCTCGAACGC ATCGTCGAGG ATTTGTCGTT TGACGCGCCC
GAGAGATATG CGAAATTCGT CGCGGCGGGC GGCAAAGGCG AGCTGCAGGT CAAGATTGGC
GTCAAAGATA TCGACGACGC GATCGGCAAC ATGCTCAAGC AAGAGGACTT GAGTCGATTT
GTACTTTAG
 
Protein sequence
MSRRAFGALA RAWRAARDAD VDQRCAATLA ARATTTTTTT TTTTTKTHRG ERPYGATGPS 
RVAKPPKPPT KRSTIDVTID VTPADARDGR DAREEDGRGG LTPERATALL DKHIVGQTEA
KRACAVALRN RWRRHRIGKP MRDEIVPKNI LMIGPTGCGK TEIARRLAKI TDSPFVKVEA
TKFTEVGFHG RDVDQIIRDL VDNGIAVTKQ KMRAKFEKFV EELIENKILD FVCGEGANDE
TREAFRTLYR DGTLDDRTIE VELPDSGGQN MKIDPSGGPI PIHELVIKVD KLFGNRKSTS
KRKMTVAECK PLIEEMEFDN LLSAETIAKE AITAVENDGI VFIDEIDKIV SSSDYRHGAD
ASSEGVQRDL LPIIEGSVVS TKHGNVNTDH ILFIASGAFH SAKPSDMLAE LQGRLPIRVE
LKGLTERDLY RILTEPEMNM IAQQKALMKT EGIDLEFTNE AIEHIASIAA KVNKTVDNIG
ARRLHTVLER IVEDLSFDAP ERYAKFVAAG GKGELQVKIG VKDIDDAIGN MLKQEDLSRF
VL