Gene OSTLU_16772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16772 
Symbol 
ID5003554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp314827 
End bp315879 
Gene Length1053 bp 
Protein Length350 aa 
Translation table 
GC content57% 
IMG OID640418975 
Productpredicted protein 
Protein accessionXP_001419554 
Protein GI145350309 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5187] 26S proteasome regulatory complex component, contains PCI domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00903198 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGATCG TCGAGCGACG GAACATGGCG GCGACGTACG CGACGCTGTG CGAGCGGTTC 
GCGTGGCGCG CGGACGCGGC GCTGACGGCG AAGATGGCGG ACGCGAACGC GAAGCGGTTG
GGAGAGATCG AAGCGAAGAT CGAAGACGCG AAGGAAAATC TCGGGGACGT GGAGATACGG
GACGCGATGT GCGAGCGGGC GGAGTTTTAC GCGAGCGTGG GGGAGATGGA GAAGAGCGAA
CGGGCGTACG AGGAGACGGA GGCGAAGACG GCGTCGATCG GGCAAAAGAT GGACTGCGCG
TTCGCGCTGA TGCGCGCGCG GTTTTCGAGG TTGGAGCTGC ACGAGGTGAA GAAATTGATC
GAAAAGATTA AGGATATGCT GGATCAACCG GGCGGGGGGG ATTGGGAACG AAAGAACCGG
TTGAAGGTGT ACGAAGGGTT GCACGCGGTG GCGACGAGGA ATTTTGAAAC GGCGACAAAG
CTGTTTTTGG ACTCGTTGAG CACGTTCACG TCGTACGAGC TGTTGTCGTA CGATGATTTC
GTGTTCTACA CCGTCGTCTG CGCCGTGGTT TCGCTGCCGC GCACCGAACT CAAGGCGAAG
GTGATCGATT CGCCCGAGGT GTTGAGCGTG CTCAACCGAT TGCCCGGTCT CGGCGACTTT
TTAAACGCCC TGCACAAGTG TGATTATCGC ACCTTCATGT CGGCGTTCCC CGTCGTCGCC
GCGCAAGTGG AAAAGTCGGT GTGGATGAAT CCCCATTTCA GGTACTTTTT GCGCGAGGTG
CGCGTCGTGG CGTACGCGCA ATATTTGCAA AGCTACAAGA GCGTGACGGT GAAGAGCATG
GCGGATAGTT TCAACGTGAG CGAAGACTTT ATCGATCGCG AGCTCTCGCA CTTCATCGTT
TCTGGTAGAT TGAACTGCAA GATTGACAAG GTTTCGGGGG TGTTACAAAC CAACCGACCG
GACTTGAAGA ACTCGCTGTA CCAAACTTTG ATCAAGGACG GCGACGCACT GTTGAACAAC
GTATCCAAAC TCTCTCGAGT GATAGATCTT TAG
 
Protein sequence
MAIVERRNMA ATYATLCERF AWRADAALTA KMADANAKRL GEIEAKIEDA KENLGDVEIR 
DAMCERAEFY ASVGEMEKSE RAYEETEAKT ASIGQKMDCA FALMRARFSR LELHEVKKLI
EKIKDMLDQP GGGDWERKNR LKVYEGLHAV ATRNFETATK LFLDSLSTFT SYELLSYDDF
VFYTVVCAVV SLPRTELKAK VIDSPEVLSV LNRLPGLGDF LNALHKCDYR TFMSAFPVVA
AQVEKSVWMN PHFRYFLREV RVVAYAQYLQ SYKSVTVKSM ADSFNVSEDF IDRELSHFIV
SGRLNCKIDK VSGVLQTNRP DLKNSLYQTL IKDGDALLNN VSKLSRVIDL