Gene OSTLU_10034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_10034 
Symbol 
ID5000597 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp159108 
End bp161735 
Gene Length2628 bp 
Protein Length826 aa 
Translation table 
GC content62% 
IMG OID640416018 
Productpredicted protein 
Protein accessionXP_001416860 
Protein GI145344690 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGGAGGTGC TCGCGCCGGC GGGCGGATGG CCGCAGTTGC GCGCGGCGAT CGAGACCGGC 
GCGGATTGCG TGTACTTTGG GCTCGACGCG CTGAACGCGC GCGCGCGGGC GAGTAATTTT
ACGCTGGATG AGCTGGATGA GGTGATGGCG TACGTGAAGT CGCGCGGGCG GCGAGCGTAC
GTGACGATGA ACGTGTTGGT GTTCGACGAG GAGTTGCGGT TGGCGGAAAC GCTGATTCGA
GGCGTGGCGC GCGCGGGCGT GGACGCGGTG ATCGTGCAGG ACGTGGGGGT GACGCGGTTG
ATTCGGAAGA CGGCGCCGAA TTTGCCCATA CACGGGTCGA CGCAGATGAG CGTGACGAGC
GCGGAGGGCG CGCGCTTCGC GCGAGAGCTC GGGTGCAAGC GCGTCGTCGT CGGTCGAGAG
TTGAGCGTGA CGGACATCGC CGCGGTGAAA GCGAGCTGTC CGGACGTCGA GGTCGAGGCG
TTCGTGCACG GGGCGATGTG CGTGAGCTAC AGTGGACAGT GCTTTTCGAG CGAAGCCTGG
GGCGGTCGGT CGGCGAATCG CGGTCAGTGC GCGCAGGCGT GTCGAATGCC GTACGGATTG
GTCGTCGACG GCGAGCTGCG CGAGATGGGG GATGTAAAAT ATTTGTTGTC ACCGCAAGAT
TTAATGGCGG TCGAGCTCGT GCCCGAGCTC ATCGAATCCG GCGTGGGGTG CTTTAAAATC
GAAGGACGAT TGAAAGGTCC AGAGTACGTC GCCATCGCCA CGCAGGCGTA TCGCCGCGCG
GTGGACTTGG CGTGGGACGC CATGCAGAGC GGACGCGACG TGAAGCGCTC CGAATTGCTC
TCGAAAGAGC AGCGACTGGA ACTCACGCAA GTGTTCGCGC GCGGCCAAGA CGCGGATTTC
GATGGATTGA CGCGCGGTTT CCTGGAGGGA CCGCGGCATC AAAATCTCGT TCGCGGACGA
GCCCCAAGGC ATCGCGGCGT CTTGCTCGGA GAAATCGTGC GCGTGATCAA GCCGGACGGC
AGACGCTCGA ACGGTGAGAT AGTCGTACGC ACGAGCGACG GCGCGGCGAT GAAGCGCGGC
GATGGCGTCG TCATCGATCG AGGCGAGCCG CAAGAGAAGG AGGAGGGCGG TCGCGTCTAC
GAAATTTTTG ATCGCAATCG AGCGCTCGTG GGCGGTAAAG ACGACGACCG CATCACCGCG
GGCGAGTACA CCGTTACGTT CGGTGCGAAC CAAATCGATT TCAACCGGGT CGAGGTTGGA
CAGCTAGTGT GGAGAACATC CGACCCGACG CTCGAGGCCA AGCTCGCGAA GATGGTTCCC
GCGGAGGCCG AGCCGAGTCG CGACGGTCGT CGAGATCCGT GCACCGTCGT CGCGAGTGGT
GCGATCGGTG AACCGCTAGT CATCAAAATC ATCGATGATG CCGGACGATT CGGAGAAGCT
CGCACTGGTT CTGTGTTGGA GAAGAGTCAA AACAAGGCGC TCGACGAAAA TTCGTTGAAG
AAAGCCATCG GCGAACTCGG CGGAACGCCG CTCGTGGTGG CAAGTGTGGA TGTCTCCACC
CTTCGAGGTA TCGCAGACGT CGACGCGCAG GCTACGGACG GACTTTACAT ATCACCGGGC
GAGATCAAAG CTGCGCGGCG GGACGCAGTC GAGTCGCTCC TGCGAGCGCG GCGAGACGGT
GGCCCGAATC GAGCCGAAGG TATGGCGGTG CGCGAAGTCG TCGACGAACT CGTCGGCAAC
GCATGGTGGG GCGATATCAT CGATCAACGC GATAAAGGAG CGAGCAAAGA ACCGCGCGTG
AATCCTACGA GCGGGTTGCC GTCCATCTCT GTGCTCTGTC GCACGCGCGC GCAAGCCGAA
GCTGCGGTGA CTGTCGATGG CGTGGACGAA ATCGCTTTGG ACTTTTTAGA AGTTCACGGA
TTGCAAAAAA CGGTTCGTCT CGTTCAAGCC GCGGGGAAAA CAGCCGTGGT CGCCACGCCG
CGCGTGCTGA AGCCAGAAGA AGAGCAACTG TGGCGATACT ATCTCAATCT AGGCGCCGAC
GCCCTGCTCG TGCGATCGGC GGGTTTATTA CAGACACTGC TCGAACTCGG GCGCTCGGAC
GCCGATGTCG TCATTCCACC GCTTCGTGGT GACTTTTCAT TAAACGCCGC CAACGCAATC
GGTGCGGACG TCTTTCTGAG GTCAGGACTC GAGCGCTTGA CGCCAACGCA TGACTTGAAC
GCCGAGCAGC AACGCGCGCT CGCAGTCGCC CTCGGTCCCG CGGGCGCGTC CAAGCTCGAA
GTCATCGTCC ATCAGCATTT GCCAATCTTC CATACCGAAC ACTGCGTGTT CTGTCGATTC
ATGTCCGACG GAAACTCGTA CAAAGATTGT GGCCATCCAT GCGAGAGCAA GCACTTGCAC
TTGCGAGACG ACCGCGGTGC CGACCATCTT GTCCTCGCAG ACATGGGGTG CCGAAACACC
GTCTTCAACG CTCAGGCGCA AACGGGCGCG GAGTACCTGA AAACATTCAT CGACGCCGGC
ATTTGCCACT TCCGCGTCGA GCTCGTGGAC GAGCCCGCGA GTGTGGTCCC CGAGCTTCTA
TCGCGTTATC GCGACCTAGC GAACGGGCGC ACGTCCTCGA GCGAACTC
 
Protein sequence
PEVLAPAGGW PQLRAAIETG ADCVYFGLDA LNARARASNF TLDELDEVMA YVKSRGRRAY 
VTMNVLVFDE ELRLAETLIR GVARAGVDAV IVQDVGVTRL IRKTAPNLPI HGSTQMSVTS
AEGARFAREL GCKRVVVGRE LSVTDIAAVK ASCPDVEVEA FVHGAMCVSY SGQCFSSEAW
GGRSANRGQC AQACRMPYGL VVDGELREMG DVKYLLSPQD LMAVELVPEL IESGVGCFKI
EGRLKGPEYV AIATQAYRRA VDLAWDAMQS GRDVKRSELL SKEQRLELTQ VFARGQDADF
DGLTRGFLEG PRHQNLVRGR APRHRGVLLG EIVRVIKPDG RRSNGEIVVR TSDGAAMKRG
DGVVIDRGEP QEKEEGGRVY EIFDRNRALY TVTFGANQID FNRVEVGQLV WRTSDPTLEA
KLAKMVPAEA EPSRDGRRDP CTVVASGAIG EPLVIKIIDD AGRFGEARTG SVLEKSQNKA
LDENSLKKAI GELGGTPLVV ANVDAQATDG LYISPGEIKA ARRDAVESLL RARRDGGPNR
AEGASKEPRV NPTSGLPSIS VLCRTRAQAE AAVTVDGVDE IALDFLEVHG LQKTVRLVQA
AGKTAVVATP RVLKPEEEQL WRYYLNLGAD ALLVRSAGLL QTLLELGRSD ADVVIPPLRG
DFSLNAANAI GADVFLRSGL ERLTPTHDLN AEQQRALAVA LGPAGASKLE VIVHQHLPIF
HTEHCVFCRF MSDGNSYKDC GHPCESKHLH LRDDRGADHL VLADMGCRNT VFNAQAQTGA
EYLKTFIDAG ICHFRVELVD EPASVVPELL SRYRDLANGR TSSSEL