Gene OSTLU_39213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_39213 
Symbol 
ID5004678 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp128903 
End bp132033 
Gene Length3131 bp 
Protein Length963 aa 
Translation table 
GC content59% 
IMG OID640420099 
Productpredicted protein 
Protein accessionXP_001420593 
Protein GI145352527 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5116] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGA GCCGTCCCCC CGCGCCGTCC GCGGCGACCG CGCTGCTGTC GCTGCTCGAA 
GAGCCGCGCG CGGCGCTCCG CGCGCACGCC CTGCGCGCGC TGCACGACGT CGTGGACCGC
GAGTGGAGCG CGGTGGCGTC GAGCGTCGCC GCGATCGAGG CGCTGTACGA GGACGAGACG
TTCGAGGCGC GCGAAGACGC GGCGCTGTTG GCGAGCAAGG TGCGCGCGAG GGGCGAAAAA
GAAAGCGCGA AGCGACGGCG TGGATGGGAA CGAGACGCGG ATCTGGTTGC GCGCGACGAA
GACCGCGCGA GACCGGACGA CGGTCTCGCG ACGGCGCGCC CGCGATGAAC GCGATGGACG
TAGAAGACTG ACGCGCGAGC GCTCGTTTCG ACGCGTAGGT GTTTTATCAT TTGGGCGAGC
TGAACGACGC GTTGCACTAC GCGCTTCGCG CGGGGGATCG CTTCGACGTG AACGAAGGGA
GCGACTACGC GCAAACGCTG ATCGCGACGG CGATCGATGA ATACGTGGCG AAGAGACAGA
GCTTAGATAT GGATGTGACG CTCGGTGATG CGTCGAAAGA TGGGATCGAT TCCAAGTTGG
TGGACGTCGT CGAGCGAATG TTTGAAAGCT GCCTGAGCGA CGGGGAGCAC TTTCAAGCCA
TCGGCATCGC GCTCGAGAGT AAGCGCTTGG ATAAGCTCGA AGAAGCGATC ACGCGCTCGA
ACACCGTGGC CGAGTGCTTG AGCTATTCCA TGAAGGTGTG CGCGAGCTTG GTGTCGGTGC
GAGAGTTCAG GCAGCACGTC TTGCGGTTGC TTGCGAGAAT GTACTCGTCG CTGCGAGAAC
CAAATTTTTT GAGCATGTGT CAGTGTTTGA TGTTGCTCGA GGACGCCCAC GGCATCGCGG
AGGTGCTGAA TAAACTCGTC GCTGGCGACG AAGCGGAGCA GTTATTGGCG TACCAGATTG
CGTTTGCGTT GTTTGAGAAC GATATTCAAC CTTTCTTGAA TCGCGTGCAC GACGCCGTCG
GGGACGCAGA GGGCGCGGCA ATCGTCTCGG GAGAGCACAA GAATGGCGTC TCAGCAGCGG
AGACCGATGA ATCGCCGTTA GAGAAGTTGC GGAGCATTTT GAGTGGTGAA CGACCGTTCG
CGCTCTACTT GGAATTTTTG TACTCGCACA ATCACGCCGA CTTGCTCTTG TTGAAGCAAG
TGAAAACCGC TGTGGAGTCG CGAAACTCGG TGTGCCATTC TGCCACGGTG CTTGCGAACG
CTTTGACGCA CGCCGGAACG ACGTGTGACA AATTTTTGCG CGAGAACTTG GATTGGTTGA
GCAGAGCGAC GAACTGGGCT CGATTCAGCG CCACGGCGGG CGTCGGCGTC ATTCATCGCG
GGCGCACAAA GGATTCGCGA CAGTTGCTTT CGCAGCTTCT CCCTGCGCCG AGTCCGTACA
CGCTTGGTGG TGCGCTATAT GCCATGGGGT TGATTCATAC TGGTCAACCC GGAGACGCTT
TGCCTTTTCT TCTCGAGCGC GCTCGTGGAA ATAACAACGA GGTCATCCAG CACGGCGCGT
GCCTCGGCTT GGGTCTCGCC GCGGTGGGCA CTGGCAACGC TCAGGTGGAC GGGGAGTTGT
TCAGAATTTT GCGAACTGAT AGTGCAGTTG CGGGTGAGGC TGCGGGTATA GGTTTAGGCT
TGTTGTACGC GGGTTCGTGC ACGCCTCGAG CGAAGGAGAT TCATCAGTAC TGCGGCAAGA
CGAGCCACGG GAAAATCGTT AGAGGTTGTT CGCTTGGCAT GGCGCTCACC GTGTACGGTC
GAGAAGAAGG TGGTGATGAT TTGATCGAGT CGATGATCCA CGATGGGGAT AAAATCATGC
GTTATGGTGG CTGTCTTGCC CTCGCGTCGG CGTATGCTGG CACAGGAAAC AACAACGCCC
TTCGTAAGCT GTTGCACACC GCCGTGTCAG ACGTGTCGGA CGACGTTCGG CGATCCGCCG
TAATGTCGTT GGGTTTTGTC CTGTGCTCCA CTCCCGAGCA GTGCCCTCGA GTCGTGGCTC
TGTTAGCAGA ATCTTACAAC CCGCACGTGC GATATGGCGC GGCTATGGCG GTCGGCATCG
CGTGCGCGGG CACTGGGCTC GCCGACGCAA TTGCACTTCT CGATCCGATG ATGAACGATC
AAGTTGATTT CGTTCAGCAA GGTGCTTTGA TCGCGATGGC GATGGTGCGC ATTCAGCAGA
CTGAAAAGCA GCTGGCTCCG TTTCGAAAGA AGGTAATGGG TCACATTCAA GAGACGCACG
AGACGACGAT GTGCAAGATG GGCGCCATCA TGGCGCTGGG CATTTTAGAC GCCGGCGGCA
GAAACGTCAC CATCGGCTTA CGCTCGCGTT CTGGTCGTCC ACGAATGACT TCTGTCTTGG
GTATGCTCGT GTTCACGCAG TATTGGTACT GGTACCCACT TTCTTACTTC CTGTCGCTCG
TGTTTGTTCC CACCGCTTTC ATCGCCGTCG ATCGCACGCT CGCGATGCCG CACTGCTCTG
TGACGTCGCA CTGCAAGCCT TCGACGTTTG CCTACGCGGC ACCGGTGACG GAGGATGACA
AGAAGAATAG TGGTGAAATC GTCAAGGCGG TTCTCAGCAC GACGGCGAAG GCAAAGGCGA
AGGCGGATAA GAAGAAGGCT GAAGCCGAGG GCGCCGAGGG CATGGACGTC GACGGCGCCG
CTGCTGTCAA AACGGACGAA AAGACCTCAA AGACCACCAC TGAAGACGCG ATGGACGTGG
AGATGACCAA GGATGATGCC GGTGAAGCGA AGAAAGAGGA TGAAGAAGGC GAGAAGAAGG
ATGACAAACC CGAACCGACG TCGGAAGAGT TGACCAATCC GAGCCGCGTC ACGCCGGCGC
AAGAAAAGGC GGTTCGCTTT GATCAAAGCA GCCGTTTCGT CCCGATCGCC GCGCCCGCGG
GCACGTTCAA GTATCCCACT AGAGGTTTTG TCGTCCTTCG CGACACCGAT CCGGACGAAG
AGATCGCTTA CCTGGACGCT CAATTCAAGC CCGTGGCGCC GCCCGTCCCG GCGGACGCCG
CCGACGACGA CGAACCACCG CCGCCGGAAG ATTTCGAACT CGATCCCGAA GACGAAGCGC
GAGGATTCTA G
 
Protein sequence
MAPSRPPAPS AATALLSLLE EPRAALRAHA LRALHDVVDR EWSAVASSVA AIEALYEDET 
FEAREDAALL ASKVFYHLGE LNDALHYALR AGDRFDVNEG SDYAQTLIAT AIDEYVAKRQ
SLDMDVTLGD ASKDGIDSKL VDVVERMFES CLSDGEHFQA IGIALESKRL DKLEEAITRS
NTVAECLSYS MKVCASLVSV REFRQHVLRL LARMYSSLRE PNFLSMCQCL MLLEDAHGIA
EVLNKLVAGD EAEQLLAYQI AFALFENDIQ PFLNRVHDAV GDAEGAAIVS GEHKNGVSAA
ETDESPLEKL RSILSGERPF ALYLEFLYSH NHADLLLLKQ VKTAVESRNS VCHSATVLAN
ALTHAGTTCD KFLRENLDWL SRATNWARFS ATAGVGVIHR GRTKDSRQLL SQLLPAPSPY
TLGGALYAMG LIHTGQPGDA LPFLLERARG NNNEVIQHGA CLGLGLAAVG TGNAQVDGEL
FRILRTDSAV AGEAAGIGLG LLYAGSCTPR AKEIHQYCGK TSHGKIVRGC SLGMALTVYG
REEGGDDLIE SMIHDGDKIM RYGGCLALAS AYAGTGNNNA LRKLLHTAVS DVSDDVRRSA
VMSLGFVLCS TPEQCPRVVA LLAESYNPHV RYGAAMAVGI ACAGTGLADA IALLDPMMND
QVDFVQQGAL IAMAMVRIQQ TEKQLAPFRK KVMGHIQETH ETTMCKMGAI MALGILDAGG
RNVTIGLRSR SGRPRMTSVL GMLVFTQYWY WYPLSYFLSL VFVPTAFIAV DRTLAMPHCS
VTSHCKPSTF AYAAPVTEDD KKNSGEIVKA VLSTTAKAKA KADKKKAEAE GAEGMDVDGA
AAVKTDEKTS KTTTEDAMDK DDKPEPTSEE LTNPSRVTPA QEKAVRFDQS SRFVPIAAPA
GTFKYPTRGF VVLRDTDPDE EIAYLDAQFK PVAPPVPADA ADDDEPPPPE DFELDPEDEA
RGF