Gene OSTLU_17568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17568 
Symbol 
ID5004650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp215234 
End bp217681 
Gene Length2448 bp 
Protein Length815 aa 
Translation table 
GC content57% 
IMG OID640420071 
Productpredicted protein 
Protein accessionXP_001420622 
Protein GI145352587 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5160] Protease, Ulp1 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.180274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00230003 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGACGCGC GGATGCGGCG CGCGAGGCGC GAGCACGAGA AAATCGCGCG AACGTACGCG 
GAGGCGCACG AGAACGTGGG GACGAGGGAA AACGGGACGA CGCGCGACGC CGACGCGCTG
CGAAGGGAAC GCGTCAAGAC GAGTCATCTG ATAGCCAAGG TGAACGAAAC GAAAGTCGGC
GCGAACGCGA CGCTGGTGAG AAGGAAGGAG TATTTAGACC GGATGATCGC GAACGCGGAG
GAACGGGAGG CGTTGATGGA GGACGGCGGA TCGGAGGCGC GAGGCGGAGG CGCGGCGACG
ACGACGACGC GCCGGGACGC GCCGCAGGAA TGGACGACGA TGGGGCAGAT GGCGGAGAGA
AAGTCGCCGG CGAAGTCGCT GAGGGAGCGC GATGCGAGAG ATTTCCAGCC GGAAAAGTAT
AGATTTGACG ACGAAAAGGC GTCGCCGAGC GCGGCGGAGG CTGTCAACGG GTTGAGCAGA
GCGTTGGACG ACATAGACGG AGATTTAGGG TCGATGTCGT CTCTGTCGCC CGATAGATAT
CCAGGACGAA AAGAAACGGC GAGTTGGGAC ACGTTAGCGC GACAACCGAC GTCGCACGGC
AGTAGACAGT TGGACGCCGC GCTCGGGCGC GCGGGGGCTC CCGAGGCGAA GTCGACGCCG
ACGACGATTT CGACGTACTT TAAGAAGAGC GCGGGTGCGA CGCAGCCGAC GACGACGACC
GCCGCTGTGG ACGATTTCGC CGCTGCTGCG ACCGACAAGA GTCGCGAGCG TCTGCCGATG
TCGCCGCCGC CGCCCAAGCG TCGAGCGAAG TCGACGCGCG GCGACCCGGC GTCTCCCATG
GTGGTGGATC TTCTGGAGAC GAGCGACGAG GAGAACCCGT TGGAGAACGA GTCAAGAGGG
GTGCAAAGCT TGGATCCAGT GAGTCTTACG AGCGAGAATG GTGAAATTCG CCGATCGACG
CGCTCGAGTA GGTATCGCTC GCTGCAGAAC ACCGTCGGCA ACTTGGCGAC GGATTACCCA
GACTCGAAGA CGAAGGGGTC GGTGCAAATC ACTCTCGGCG ACTTGGAGCA CCTCAGAGAC
GGCGAGATGC TGAACGATCA GTGCGTGGAC TTTTATTTGA AGTACATTCA AGTGGAAATG
TTGGGCGCGA ACGCGTTTGA GATATTAGAC AAGGTGCACA TTTTTAACAG CTTTTTCTAT
CAAAAGCTCG CGCAAAAGCA CGATAGAGAT CGAAGCAACG TCGACGCCGC GACGGCATCG
CACGCACGCG TGAAGAATTG GACCAAGGGT GTGGATATCT TTACGAAAAG CTTTCTCATG
ATTCCCGTGC ACAGCAATTT GCATTGGTCT TTAGTCATCG TTTGCTATCC AAACGGTACT
GATGAGCGCC AACCGATGAT GCTCCATCTC GACAGCATGA CGCAACATGG CGGGCACAAC
TCAGAAGTCG TGTCGAAGAC GGTCCGGCGT TACCTGAGCA AAGAATGGAA AACGCAAAAG
GGCGACGACA CTGAATCAAA ATTCGACGCT CGATACATGC CAACGTATCG CGTCAATGTT
CCGAGACAAA ACAATGGTTG CGACTGCGGC GTTTTCATTT TGGCCTTTCT CGAAAAGTTT
TTAACCGAAC AACCCGAGAT TTTGAAGAAG AGCGATGTTC AACGCGCGGC GCAAAAGCGC
TCGTTTGGAA TGGACGATGC GGGCAAGTTT CTACGAAAGA ACTGGTTTCC CAACGAGTTC
GTGGACGAAT TGCGCGCCAA GTTGAGTCTC CTCGTCATCC AGCGCATTCA GGCTTCGCTC
GCGGAAAACG ACGCGTCAAA GCTCATTCTC AACGCGGCAA ATGAAGAGCA AATGAAAGAC
TATGAGCATC GGCAGTGGCG AACGAAACAA GCGGTTTCAA AGGCTGAGCG TGACCGTAAG
AAGAGAATGG AAGCCATGGA AGCGGAGCGA AGACGCAAGC TCGAGCCGCA AGACCTCTCT
GACGACGACG ACGCCAAAAC CAAAGCCTCC GACGCCAAGG ATGTAGACTT TGAGATTTCT
CGCGAAGTCA GGGTGAAGAA ACAGCCGACG AAGACTAGGC AGACGCAGAT ATTCTCGGGA
TCCCACTGGA ACAAGGCGAG AGAGAAAGAA CCCGTGCGCG AGGAGACTCG CGACGAGAAG
CCATTCGAAC CCTTCGCGGG CCCGTCGTAC AAAATCGGGC GACACACGCG CGCGCGTGCG
CCTTCTCCGA GCGCTGATTG GACGAGCGGA ACGACCCGTC CCAAGTATGG TAACGCCAAC
CTTTTGAAAC GAGCGGAACG ATCGCGAGCG AAGCAAAACG GGGAAGAGGA ACGGAGCGCC
GCCGACGACG AATCCCCGGG CCAGGCCAAG GCGAAGCTCA ACGACGCGAA TGAAAAACTT
TCCGAGAGAA TCAAGGGGAT GACGTCTCGA GTGTTCTCGA CGAAATAG
 
Protein sequence
MDARMRRARR EHEKIARTYA EAHENVGTRE NGTTRDADAL RRERVKTSHL IAKVNETKVG 
ANATLVRRKE YLDRMIANAE EREALMEDGG SEARGGGAAT TTTRRDAPQE WTTMGQMAER
KSPAKSLRER DARDFQPEKY RFDDEKASPS AAEAVNGLSR ALDDIDGDLG SMSSLSPDRY
PGRKETASWD TLARQPTSHG SRQLDAALGR AGAPEAKSTP TTISTYFKKS AGATQPTTTT
AAVDDFAAAA TDKSRERLPM SPPPPKRRAK STRGDPASPM VVDLLETSDE ENPLENESRG
VQSLDPVSLT SENGEIRRST RSSRYRSLQN TVGNLATDYP DSKTKGSVQI TLGDLEHLRD
GEMLNDQCVD FYLKYIQVEM LGANAFEILD KVHIFNSFFY QKLAQKHDRD RSNVDAATAS
HARVKNWTKG VDIFTKSFLM IPVHSNLHWS LVIVCYPNGT DERQPMMLHL DSMTQHGGHN
SEVVSKTVRR YLSKEWKTQK GDDTESKFDA RYMPTYRVNV PRQNNGCDCG VFILAFLEKF
LTEQPEILKK SDVQRAAQKR SFGMDDAGKF LRKNWFPNEF VDELRAKLSL LVIQRIQASL
AENDASKLIL NAANEEQMKD YEHRQWRTKQ AVSKAERDRK KRMEAMEAER RRKLEPQDLS
DDDDAKTKAS DAKDVDFEIS REVRVKKQPT KTRQTQIFSG SHWNKAREKE PVREETRDEK
PFEPFAGPSY KIGRHTRARA PSPSADWTSG TTRPKYGNAN LLKRAERSRA KQNGEEERSA
ADDESPGQAK AKLNDANEKL SERIKGMTSR VFSTK