Gene OSTLU_16088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16088 
Symbol 
ID5002631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp283074 
End bp284510 
Gene Length1437 bp 
Protein Length478 aa 
Translation table 
GC content61% 
IMG OID640418052 
Productpredicted protein 
Protein accessionXP_001418661 
Protein GI145348449 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00877055 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTGCGAT ACGAGGCGCA CAACGCGGAT TCGGCGCACA AGTTTCAGTT GAAAGAGACG 
AAGTTTAGCG ATCTCACGGA GGAAGAATTC GCCGCGAGGG TGTTGACGTA TAAACCGAGG
CGACAGTTTG GTGAAACGAT GCTGGGGAAC TCGGAAGATG AGGTTAGCTC GACGTCCGCG
CGCGTGGGCT ACGAGGCGGC GGCGTTGAAG TCGCCCGCGG AGATCGCTGT GGCACGGAAG
CATTCGCAGC GCGCGCGCGA CCGCACTCAA AGGCGCGAGC GAAACCGGCT GCGTGGACAT
GTCGAAGACC ACGGAGAAGT CGACGATAGT GGACCGTTGG GTGATGCGGA CCCATCGATT
CCGGCCGCCT TTTCGTGGCG CACGCCGCCA GATGGATACG GAAACGTCGT CGGTGTGGTG
CACGATCAAG AGGACTTGTG CGCGTCGTGC TGGGCGTTTG TCACCGCGGA TTCCATCGCG
AGCCGAATCG CAATCATAAA CAAGGGCGAC GACGCCCCGG CGTTAAGCGT GAAGCAGCTC
ATGGCATGCG ATGCTGTTGA TCACGGGTGT TCGACCGGGA ACATGTACAC CGCGTACGAA
TGGATCGGGC AATACGGCGG TATCAGCTCC AAGGCGGATT ACAACGCGAA AGTACCAGGT
GACCGAGACG ACGCTCCGGA TGCCAAGTGC GACGCGTCCG TCAAAAAAGT TTACGATACG
CCGGCTATGT GTGATTTAGC GCAAGTTGCC GGCGAAGAAC CGCTTTATCG TGCGATCTTC
GAGCGAGGTC CCGTCGCCGT TGGCATCAAC GCAAACAAAC TGCAGGCATA CGGCAGCGGC
GTCATCATGT TGGATGACTG TAAGCCACTT GGTCGTGGTA TTGAGTCCAT CAATCACGCC
GCACTTGTCG TTGGGTGGGG CACGACGGAC GACGGGGTCA AATACTGGGA AATTAAAAAC
TCTTACGGCC CAGAGTGGGG CGACGAAGGA TTCTTCAGGC TCGAGCGCGG TCGCATCGGC
GAACACAAGT TCGGCACTTG CGGTCTTCTC TTTGAATCCG TCTACCCGGT CGTCACGAAG
GCTGGCGATG CGACGTCGAC TGACGCTCCG TGTGTGAAGG GATCAGTCCA AAAGCAAACG
TACTATAGAA ACGAGACGCT CAATCCGGGC TTGGGCGACG ATGACGTTGA GGACGAGGCG
CGCATCGGCG TCGCGCAGCG CCGCGCGCAC CGAGCCCACG GTCACGCACA CCGCGCGCGA
AAAGCTCGCC TAGGCGACGC CGCTTCGTCG CACCTGACGA CGCACACCGA AAACGTCGTC
GCCGCGGCCG CCGCTCTGGC CTCGATCGCC GTCCTCGTCG CCGCGGTCGC TCATCGCCGC
CGAGCGCGCC GGGACGCCGT CCCCGAATCC GCCGCGCTCC TCGCCGCCGA GCCTTGA
 
Protein sequence
MVRYEAHNAD SAHKFQLKET KFSDLTEEEF AARVLTYKPR RQFGETMLGN SEDEVSSTSA 
RVGYEAAALK SPAEIAVARK HSQRARDRTQ RRERNRLRGH VEDHGEVDDS GPLGDADPSI
PAAFSWRTPP DGYGNVVGVV HDQEDLCASC WAFVTADSIA SRIAIINKGD DAPALSVKQL
MACDAVDHGC STGNMYTAYE WIGQYGGISS KADYNAKVPG DRDDAPDAKC DASVKKVYDT
PAMCDLAQVA GEEPLYRAIF ERGPVAVGIN ANKLQAYGSG VIMLDDCKPL GRGIESINHA
ALVVGWGTTD DGVKYWEIKN SYGPEWGDEG FFRLERGRIG EHKFGTCGLL FESVYPVVTK
AGDATSTDAP CVKGSVQKQT YYRNETLNPG LGDDDVEDEA RIGVAQRRAH RAHGHAHRAR
KARLGDAASS HLTTHTENVV AAAAALASIA VLVAAVAHRR RARRDAVPES AALLAAEP