Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_39213 |
Symbol | |
ID | 5004678 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 128903 |
End bp | 132033 |
Gene Length | 3131 bp |
Protein Length | 963 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420099 |
Product | predicted protein |
Protein accession | XP_001420593 |
Protein GI | 145352527 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5116] 26S proteasome regulatory complex component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGA GCCGTCCCCC CGCGCCGTCC GCGGCGACCG CGCTGCTGTC GCTGCTCGAA GAGCCGCGCG CGGCGCTCCG CGCGCACGCC CTGCGCGCGC TGCACGACGT CGTGGACCGC GAGTGGAGCG CGGTGGCGTC GAGCGTCGCC GCGATCGAGG CGCTGTACGA GGACGAGACG TTCGAGGCGC GCGAAGACGC GGCGCTGTTG GCGAGCAAGG TGCGCGCGAG GGGCGAAAAA GAAAGCGCGA AGCGACGGCG TGGATGGGAA CGAGACGCGG ATCTGGTTGC GCGCGACGAA GACCGCGCGA GACCGGACGA CGGTCTCGCG ACGGCGCGCC CGCGATGAAC GCGATGGACG TAGAAGACTG ACGCGCGAGC GCTCGTTTCG ACGCGTAGGT GTTTTATCAT TTGGGCGAGC TGAACGACGC GTTGCACTAC GCGCTTCGCG CGGGGGATCG CTTCGACGTG AACGAAGGGA GCGACTACGC GCAAACGCTG ATCGCGACGG CGATCGATGA ATACGTGGCG AAGAGACAGA GCTTAGATAT GGATGTGACG CTCGGTGATG CGTCGAAAGA TGGGATCGAT TCCAAGTTGG TGGACGTCGT CGAGCGAATG TTTGAAAGCT GCCTGAGCGA CGGGGAGCAC TTTCAAGCCA TCGGCATCGC GCTCGAGAGT AAGCGCTTGG ATAAGCTCGA AGAAGCGATC ACGCGCTCGA ACACCGTGGC CGAGTGCTTG AGCTATTCCA TGAAGGTGTG CGCGAGCTTG GTGTCGGTGC GAGAGTTCAG GCAGCACGTC TTGCGGTTGC TTGCGAGAAT GTACTCGTCG CTGCGAGAAC CAAATTTTTT GAGCATGTGT CAGTGTTTGA TGTTGCTCGA GGACGCCCAC GGCATCGCGG AGGTGCTGAA TAAACTCGTC GCTGGCGACG AAGCGGAGCA GTTATTGGCG TACCAGATTG CGTTTGCGTT GTTTGAGAAC GATATTCAAC CTTTCTTGAA TCGCGTGCAC GACGCCGTCG GGGACGCAGA GGGCGCGGCA ATCGTCTCGG GAGAGCACAA GAATGGCGTC TCAGCAGCGG AGACCGATGA ATCGCCGTTA GAGAAGTTGC GGAGCATTTT GAGTGGTGAA CGACCGTTCG CGCTCTACTT GGAATTTTTG TACTCGCACA ATCACGCCGA CTTGCTCTTG TTGAAGCAAG TGAAAACCGC TGTGGAGTCG CGAAACTCGG TGTGCCATTC TGCCACGGTG CTTGCGAACG CTTTGACGCA CGCCGGAACG ACGTGTGACA AATTTTTGCG CGAGAACTTG GATTGGTTGA GCAGAGCGAC GAACTGGGCT CGATTCAGCG CCACGGCGGG CGTCGGCGTC ATTCATCGCG GGCGCACAAA GGATTCGCGA CAGTTGCTTT CGCAGCTTCT CCCTGCGCCG AGTCCGTACA CGCTTGGTGG TGCGCTATAT GCCATGGGGT TGATTCATAC TGGTCAACCC GGAGACGCTT TGCCTTTTCT TCTCGAGCGC GCTCGTGGAA ATAACAACGA GGTCATCCAG CACGGCGCGT GCCTCGGCTT GGGTCTCGCC GCGGTGGGCA CTGGCAACGC TCAGGTGGAC GGGGAGTTGT TCAGAATTTT GCGAACTGAT AGTGCAGTTG CGGGTGAGGC TGCGGGTATA GGTTTAGGCT TGTTGTACGC GGGTTCGTGC ACGCCTCGAG CGAAGGAGAT TCATCAGTAC TGCGGCAAGA CGAGCCACGG GAAAATCGTT AGAGGTTGTT CGCTTGGCAT GGCGCTCACC GTGTACGGTC GAGAAGAAGG TGGTGATGAT TTGATCGAGT CGATGATCCA CGATGGGGAT AAAATCATGC GTTATGGTGG CTGTCTTGCC CTCGCGTCGG CGTATGCTGG CACAGGAAAC AACAACGCCC TTCGTAAGCT GTTGCACACC GCCGTGTCAG ACGTGTCGGA CGACGTTCGG CGATCCGCCG TAATGTCGTT GGGTTTTGTC CTGTGCTCCA CTCCCGAGCA GTGCCCTCGA GTCGTGGCTC TGTTAGCAGA ATCTTACAAC CCGCACGTGC GATATGGCGC GGCTATGGCG GTCGGCATCG CGTGCGCGGG CACTGGGCTC GCCGACGCAA TTGCACTTCT CGATCCGATG ATGAACGATC AAGTTGATTT CGTTCAGCAA GGTGCTTTGA TCGCGATGGC GATGGTGCGC ATTCAGCAGA CTGAAAAGCA GCTGGCTCCG TTTCGAAAGA AGGTAATGGG TCACATTCAA GAGACGCACG AGACGACGAT GTGCAAGATG GGCGCCATCA TGGCGCTGGG CATTTTAGAC GCCGGCGGCA GAAACGTCAC CATCGGCTTA CGCTCGCGTT CTGGTCGTCC ACGAATGACT TCTGTCTTGG GTATGCTCGT GTTCACGCAG TATTGGTACT GGTACCCACT TTCTTACTTC CTGTCGCTCG TGTTTGTTCC CACCGCTTTC ATCGCCGTCG ATCGCACGCT CGCGATGCCG CACTGCTCTG TGACGTCGCA CTGCAAGCCT TCGACGTTTG CCTACGCGGC ACCGGTGACG GAGGATGACA AGAAGAATAG TGGTGAAATC GTCAAGGCGG TTCTCAGCAC GACGGCGAAG GCAAAGGCGA AGGCGGATAA GAAGAAGGCT GAAGCCGAGG GCGCCGAGGG CATGGACGTC GACGGCGCCG CTGCTGTCAA AACGGACGAA AAGACCTCAA AGACCACCAC TGAAGACGCG ATGGACGTGG AGATGACCAA GGATGATGCC GGTGAAGCGA AGAAAGAGGA TGAAGAAGGC GAGAAGAAGG ATGACAAACC CGAACCGACG TCGGAAGAGT TGACCAATCC GAGCCGCGTC ACGCCGGCGC AAGAAAAGGC GGTTCGCTTT GATCAAAGCA GCCGTTTCGT CCCGATCGCC GCGCCCGCGG GCACGTTCAA GTATCCCACT AGAGGTTTTG TCGTCCTTCG CGACACCGAT CCGGACGAAG AGATCGCTTA CCTGGACGCT CAATTCAAGC CCGTGGCGCC GCCCGTCCCG GCGGACGCCG CCGACGACGA CGAACCACCG CCGCCGGAAG ATTTCGAACT CGATCCCGAA GACGAAGCGC GAGGATTCTA G
|
Protein sequence | MAPSRPPAPS AATALLSLLE EPRAALRAHA LRALHDVVDR EWSAVASSVA AIEALYEDET FEAREDAALL ASKVFYHLGE LNDALHYALR AGDRFDVNEG SDYAQTLIAT AIDEYVAKRQ SLDMDVTLGD ASKDGIDSKL VDVVERMFES CLSDGEHFQA IGIALESKRL DKLEEAITRS NTVAECLSYS MKVCASLVSV REFRQHVLRL LARMYSSLRE PNFLSMCQCL MLLEDAHGIA EVLNKLVAGD EAEQLLAYQI AFALFENDIQ PFLNRVHDAV GDAEGAAIVS GEHKNGVSAA ETDESPLEKL RSILSGERPF ALYLEFLYSH NHADLLLLKQ VKTAVESRNS VCHSATVLAN ALTHAGTTCD KFLRENLDWL SRATNWARFS ATAGVGVIHR GRTKDSRQLL SQLLPAPSPY TLGGALYAMG LIHTGQPGDA LPFLLERARG NNNEVIQHGA CLGLGLAAVG TGNAQVDGEL FRILRTDSAV AGEAAGIGLG LLYAGSCTPR AKEIHQYCGK TSHGKIVRGC SLGMALTVYG REEGGDDLIE SMIHDGDKIM RYGGCLALAS AYAGTGNNNA LRKLLHTAVS DVSDDVRRSA VMSLGFVLCS TPEQCPRVVA LLAESYNPHV RYGAAMAVGI ACAGTGLADA IALLDPMMND QVDFVQQGAL IAMAMVRIQQ TEKQLAPFRK KVMGHIQETH ETTMCKMGAI MALGILDAGG RNVTIGLRSR SGRPRMTSVL GMLVFTQYWY WYPLSYFLSL VFVPTAFIAV DRTLAMPHCS VTSHCKPSTF AYAAPVTEDD KKNSGEIVKA VLSTTAKAKA KADKKKAEAE GAEGMDVDGA AAVKTDEKTS KTTTEDAMDK DDKPEPTSEE LTNPSRVTPA QEKAVRFDQS SRFVPIAAPA GTFKYPTRGF VVLRDTDPDE EIAYLDAQFK PVAPPVPADA ADDDEPPPPE DFELDPEDEA RGF
|
| |