Gene OSTLU_34331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34331 
Symbol 
ID5000620 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp545382 
End bp546704 
Gene Length1323 bp 
Protein Length400 aa 
Translation table 
GC content56% 
IMG OID640416041 
Productpredicted protein 
Protein accessionXP_001416980 
Protein GI145344936 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1222] ATP-dependent 26S proteasome regulatory subunit 
TIGRFAM ID[TIGR01242] 26S proteasome subunit P45 family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.926825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGC CGCGGGCGCC GCCGCCGGAT CCGCGGAAGG AGGCGTTGAG GAAGTACGCC 
GGGTTGTTGC TGCAGCACAA GGTGAGATCG CGCGCGACGG AGAGACCGAA GGATGGTTCG
TGGACGCGAA CGGTGGTGCG AGGCGTCGTA TTCGAGCGAG TGACTGACGG TTTATTGTCG
TGATGACGCG CGATGACGTA GGAATTGGAT GCGAGAGTGC GAGCGTCGAG GTTCGAGTTG
ATTGACATTC GAAAGAGGTT TGACAAGACG GAGGACGATT TGAAGGCGTT GCAGTCGATG
GGGCAGATTA TCGGGGAGGT TTTGAGACAG TTGGACGAAG ACAGATTCAT CGTGAAGGCG
AGCTCGGGGC CGAGATACGT CGTTGGGTGC AGAACGAAGC TCGATAAGAG CAAGCTAGTG
AACGGGACGC GGGTGACGCT CGACATGACG ACGTTAACCA TCATGCGCGC GCTGCCGCGA
GAGGTGGATC CGTTAGTGTT TAACATGCTC AGCGAGTCCA CGGGGCACGT GGACTATAGC
TCTATCGGTG GGCTCGGGGA GCAGATTCGA GCGCTTAGAG AGTCGATCGA GTTGCCGTTG
ATGAATCCCG AACTTTTTGT GCGCGTGGGC ATCGCGCCGC CGAAGGGCGT ATTGCTCTAC
GGACCGCCAG GGACGGGGAA GACGCTCCTC GCCAAGGCGA TCGCGAGTAA CATCGACGCA
AACTTTTTGA AGATTGTTTC TAGCGCTATA GTGGATAAGT ATATCGGCGA GTCCGCGAGA
TTGATCAGAG AGATGTTCGG TTACGCTCGG GACCACGAGC CGTGCATCAT CTTCATGGAC
GAAATCGACG CCATCGGTGG CAAGCGCTTT TCCGAAGGCA CGTCGGCCGA TCGCGAGATT
CAACGTACAC TCATGGAACT TTTGAATCAG CTCGATGGTT TCGACGTTCT TGGCAAGGTC
AAGATGATCA TGGCGACGAA CAGACCCGAT GTGTTGGACC CGGCGTTGAT GCGCCCTGGT
CGTCTCGACA GAAAGATTGA AATCCCGCTT CCGAACGAGC AAGGTCGCGT GGAGGTTTTG
AAAATTCACG CGCAAAAGTT GAACAAAGAG GGTGAAATCG ATTACGAGTC TATCTCCAAG
ATTGCCGAAG AATTCAACGC CGCCGACATG CGCAACGTGT GCACGGAGGC GGGAATGTTC
GCCATTCGCG ACGACCGCGA TTACTGCGTT CAGGACGATT TCATGAAAGC CGTCCGCAAG
CTCGTGGAGG CGAAGAAATT GGAACCCGCC GCGTCCTACG ACAGCTCTTT CAAGAACGAG
TGA
 
Protein sequence
MTTPRAPPPD PRKEALRKYA GLLLQHKELD ARVRASRFEL IDIRKRFDKT EDDLKALQSM 
GQIIGEVLRQ LDEDRFIVKA SSGPRYVVGC RTKLDKSKLV NGTRVTLDMT TLTIMRALPR
EVDPLVFNML SESTGHVDYS SIGGLGEQIR ALRESIELPL MNPELFVRVG IAPPKGVLLY
GPPGTGKTLL AKAIASNIDA NFLKIVSSAI VDKYIGESAR LIREMFGYAR DHEPCIIFMD
EIDAIGGKRF SEGTSADREI QRTLMELLNQ LDGFDVLGKV KMIMATNRPD VLDPALMRPG
RLDRKIEIPL PNEQGRVEVL KIHAQKLNKE GEIDYESISK IAEEFNAADM RNVCTEAGMF
AIRDDRDYCV QDDFMKAVRK LVEAKKLEPA ASYDSSFKNE