Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32797 |
Symbol | |
ID | 5002791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 654709 |
End bp | 659177 |
Gene Length | 4469 bp |
Protein Length | 1479 aa |
Translation table | |
GC content | 52% |
IMG OID | 640418212 |
Product | predicted protein |
Protein accession | XP_001418766 |
Protein GI | 145348666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.480119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.167383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCG CGAGGTACCG TCGCGCGCGT CGACGCGTCG ACGACCGGGA CAGCGGCCGC GGCCGTGGTG TCGTCGCGCT GTGCGCGGTC GTCGTCGCGC TCGCGCGGGT GAGAACGAGC GGTGGTGAAG CGATATGGGG GTTAGAGACC GTGTACGATG GTTTGGGGTA TAATCCAGAG TTCCCGACGC AGGTTTGCGG CCCGTCCAAG GATGGGAACT CTTTTCGCGC CGCGGGTTGC AAACCGGCGA TTACAGAGAA GGATTCGTAC GGCACCGCGG GTGATCCGGG ATTTAGTTTC GCGAGGTACG CGTATAAAGG AAAATCATAC GGGCCACCAT ATAATAATGT AGCGGACGTG GCGGCGTGTA CGTACTCGTT GACGTACGCC GCGGCGGAGG CGGCGGGCCT GGAGCACTGC TTTTACGCTC CGTACACGGG TGGCGTGCCG ACGACGACGG TGCAGAAGAC GACGACGAAC TTGGGATACT CCACTCTAGG TTGGCGTCTA CCCTTGGCTG AATGGCGAGA TAAGTTGCCG GATTACGTGG TGGAACGATA CGAATTACCG CCGCTTCGGT TGGGCTTCGA GGAGGGGACG TTTTGGCCGA ATTGGAAGCT ACTCTCGAGC GGTTTGCCAA ACTTTGCCGT GACGAAGACA TGTTTTCTTG AAGCGCACAG CGGAGACTAT CAGCTGTGTA GCGCCTATCC GACGGAGCGT CGCGAAGATA TTCACGGCGT CCTTCAAGTG CAAAGCATCT CATTCGCACT CGGGCGTGGG AGCTTGTTCT TTGAGGCAAA CGGCGGAGAT AAAAACGTCC CAGACATACC TCAAGACCCG TATCCCATAG CTTCGCACGG GAAAGGTGCG CTTGGAGTTG CGTTAACGAG GATTAGCGAT GGGTATCGAG TGCTGACAAA ACCTGTGCGC GCGAGAAGAT ATTGGGAAAC CTTACGTTGG GAGGAGTCTT TACTCGCACA GTATTACGGG GAGGTTTTTA GGTTGGAGAT TTTCGATTAT CGTTCGGGTT CGTTCGGTTG GCTCGCGGTG GATTCTTTCG TAATTCCGCA GGCGCCTGTT ATCATCACGG AGGTGACTCC ATCGATTGGA CCTCGAGTCG GTGGCACGAG AATCACTATC AGCGGGCAAA ACTTTGGAAG TTCCGTTGAC GACAAAACCG TGTTCATCGG CGATAAAGAA TGCATCGACT TACGCATGAG TTATTCTCGT TGTTCGTCCG ATACCACCGT TTGTGCTGGT GCGTTGAGCT GTACTACACC AGCAGGTTCC GGGGTAGGTC TCACAGTAGC GGTGGTGATT GGAGACCCAA CCGTCGTCCG CATCGCAGGG CCGCGAGGTG GGTTCCAAGC GGGTTTTTGC GGCGATGAAT CGACCGAGCA TCCCTTCTCC GAATGTGCAA TAGATGCAAT CGATGCTGTC GCTGGAGCGC GAAAGCGTGG CTTTACTTAC GCTGATCCTC CGACGGTGAC AAGTACGCCA GTCACGAGTG CGATCCAAGA CATTCAGTAC GAGTACACGC TCATTGCTAC GGACCCAGAC GAGAACGACT TATTGACGTA TACGGCAATT ACGTTACCAA CGTTTTTGCA GTTTGATCCA ACGACTCGTC TACTTACTGG CGTGCCATTA CGGTCAGATG TACAATGTAG GAGCAACAAT TGGCACCCAG CTTCTGAGCG CTGCGCGTTG GGAGCAACGT ATCAAGTTGA GTTTGAGATC TCGGACTCAA TATACAGAGT TCAGCACAAC TTTGAAGTGA AAGTGACGAC GAATGAGACC TCACTTCTTG TTGATAAGTC CTTTCACTGG GAAAACGCGT TACAAATCTT TGAAAAATAC GAAGCCGTCG TTGCTTTGGG GTCGGTTACA GAATCGCTCA AAGCCTTGGA AAATAGGAAC GTGACGAGTC CGACAGAAAT CGTTGGCTTC AATTATAGAA ATCCTTTGAA TGATGAGGAT AAAACGATCC GTAATGCCCT CATCGCTATT GCAGAGAAGC CGAATATCGA TGAAGTCGAC GTCGATGCGG CCATTCAGTG GCTGCAAACG AACAACTCTG ACGCCGTAAA CCTTGATATG ATCCAGGCGC TCAAAGAGCA AGTTGCGGTA CAGAAGCGAG GAGACGTATC CAGCGGTGCT TCGGCACCAG CGTTGCAAGG CGTCAGCTGG AAAGGCTGGT TAAATTATTT TAAAAAACTA GGAACGTCGC TCGGCGCTGG ATCTGCCGTT TACACAAATG TTGCTGGGCT CCCCGTTCCA GCCGGCGCAA CGAAAATGCT TACGCGGGAG AAGATAATTG TCATCAAACT ATCGACTCCT TGCAACTCGA CCGCGTTGAA CACGGACAGA TATATTCCAT CATGTGACGC GGTTGTGGGT ATGTCATCAC AGCACGAGGT CTGGCTCGGA TTCAGTGACT ACGGAACAAC GTTGGAGGAT GTCATCGTGT TCCCCGAGGA TTTTGAGTTT GCGGATCTTG TTACGTCGGC CGTCTTTACA CAAGACGTTC CAGGTTTTCT CGGTCAATTA ACTAGACGAA TGGTGGCGCA ATCAGCAAAG CTACAGGAAA TCGCAACATT GGGCGAAACA CACCTTGTAT GGTTCGATAC ATCCACGTCT ATGCTAACCG TGAACATGAC CATCAAGTCT TCACCGGTGA AAGCAGTCAT GCAACTTGCA ACTTCGTATC CTTATCCCGA ACCTGGAAAG TATGGATTAC CCGAGATTAG CGTTGTGAGC TGCGATAACG TGGCGTACGA CTATGAAAAC GCGCTCATTT CATTCAGCAA TACCATGACG AGAACTGGTG GATCTTTGAC CAGTCGTCTC GCAGCGTTGG CGAACGATGT CTCAACATTT GCGTCACGGT GCAAAAATTT ATGCAACGGT CAGGGTCTGT GCCAAGATTC GCAACTGCCA CCCGTTTGTC AGTGTTATCA GGACTACTTC GGCGATGATT GCTCGCAAGT AACGTGTCCG AATGATTGCA GTAATAATGG CGCCTGCGAT TCGAGGCAAG TATGTACGCA CTTCCCGGAA AGCGGCGAAA CAGACTGCTC CGGTGGAACT GGAAAGTGTG CGTGTAATTA CCCTTACTTT GGCAAGGATT GCTCGTTAAA AACATGTGCG AAGAATTTCA TCGTTTTTGA AGGTAATTCG ACGCACAAGC TCGACGTCAC GGCCGGAACG CAACCTATCA AGGACAAATA CACCGCGCTC GGTCTCGATG TGTATGACGT GAAGATTAGC GAAACCAATG GTGACCAACC TGCAGCTTGG GCAATTCAAG GAGACGAGCT TAGCTTGGCG CATCTTGTTG GATCGGCTTA CGTGGAATTT ACCAGCTCTG AAGACGGAGA AACGGCGAGA TCACTGGAGC CATTTTACGT TGACTCTATC CCCAGTCGTG GCGTCGCAAT CGAGGCACAA TATTTTGCGC AGGAGTACCG CACTTTCGTA AAGCGTGAGC AACAGCTACT AGATTTGTTC GGCAAAAGCT CATCAGAGTG CAACGCCGCC GGTGCTTGTG CCTTTGAAAC CGGTACGTGC TTCTGTGCGA CGCAGTATTA TGGGGCCGGT TGTGAATTTC AGTACTGCGC AAATGATTGT GCAGGTCACG GTACGTGCAA TAAGCTTACC GGTGTGTGTA CGTGCGAAAC ACACTACGTG ACAGATGAAA CATATGGATG CGCTTTGAAG GACTATAGTT TGATTTCAAC CACGTGCACT GACGAAGCAC TTGATAGAGA GGTTGATGAC GCTGATCTGC GCGTGCGTCC ATTACACGCA TCATGTCTCT TTGGTACAAG ATTGGGTAGC CCTGTGACTG GCACGACGAC GACTTATATC ACTGACTTAC AAGGTAATGT GTGTTTGGAT TGCTCTGGTT ATACTCTAGA GTCGAACTCC AGGATTTATT TTTACGAAGA CGAACCATGC ACTTCGTCTG CTGGGAACCG CGAAGACTTT TGCGACAACA CGAAAGAGTC CTACGACACC ATCGTTCGCG GCATTGGCAT GATACCCGCT GGAGACAGTG GGTCGACTCT TTCATTCAAC CTCGCCGCCA TGCGTACATC GGATATTCAG TTTTCTATTT TCAAGGCAAA GGCGGGGATC GTAAGAGAGC ATTTAGCTGA CATCGATGCG GGCGGTTGTG GTGCATGTAC GGACGATAAT CCCCAGTGCG GCGCCATATT CACTATCGCC GTCGATGGTG TTACCGTATG GACTTCGTTG GTGAACAACA AGTACTCCGA GATCAACATC GATATCAGCG CTGCCAGTAC CATGACACTC TCCACAGCAA AGTACACCCC GCCGTATTGG CGTTCCAACG TCTTGCCTGG TCTCGCAGGA CAAGACGCAC CTGCCGTTTG GTGCGATGGC GCCGCTTGGG CGGACGCAGA GTTCTATTAG CTTTTAGCCG AGTATTCACA CGAATGCCA
|
Protein sequence | MRAARYRRAR RRVDDRDSGR GRGVVALCAV VVALARVRTS GGEAIWGLET VYDGLGYNPE FPTQVCGPSK DGNSFRAAGC KPAITEKDSY GTAGDPGFSF ARYAYKGKSY GPPYNNVADV AACTYSLTYA AAEAAGLEHC FYAPYTGGVP TTTVQKTTTN LGYSTLGWRL PLAEWRDKLP DYVVERYELP PLRLGFEEGT FWPNWKLLSS GLPNFAVTKT CFLEAHSGDY QLCSAYPTER REDIHGVLQV QSISFALGRG SLFFEANGGD KNVPDIPQDP YPIASHGKGA LGVALTRISD GYRVLTKPVR ARRYWETLRW EESLLAQYYG EVFRLEIFDY RSGSFGWLAV DSFVIPQAPV IITEVTPSIG PRVGGTRITI SGQNFGSSVD DKTVFIGDKE CIDLRMSYSR CSSDTTVCAG ALSCTTPAGS GVGLTVAVVI GDPTVVRIAG PRGGFQAGFC GDESTEHPFS ECAIDAIDAV AGARKRGFTY ADPPTVTSTP VTSAIQDIQY EYTLIATDPD ENDLLTYTAI TLPTFLQFDP TTRLLTGVPL RSDVQCRSNN WHPASERCAL GATYQVEFEI SDSIYRVQHN FEVKVTTNET SLLVDKSFHW ENALQIFEKY EAVVALGSVT ESLKALENRN VTSPTEIVGF NYRNPLNDED KTIRNALIAI AEKPNIDEVD VDAAIQWLQT NNSDAVNLDM IQALKEQVAV QKRGDVSSGA SAPALQGVSW KGWLNYFKKL GTSLGAGSAV YTNVAGLPVP AGATKMLTRE KIIVIKLSTP CNSTALNTDR YIPSCDAVVG MSSQHEVWLG FSDYGTTLED VIVFPEDFEF ADLVTSAVFT QDVPGFLGQL TRRMVAQSAK LQEIATLGET HLVWFDTSTS MLTVNMTIKS SPVKAVMQLA TSYPYPEPGK YGLPEISVVS CDNVAYDYEN ALISFSNTMT RTGGSLTSRL AALANDVSTF ASRCKNLCNG QGLCQDSQLP PVCQCYQDYF GDDCSQVTCP NDCSNNGACD SRQVCTHFPE SGETDCSGGT GKCACNYPYF GKDCSLKTCA KNFIVFEGNS THKLDVTAGT QPIKDKYTAL GLDVYDVKIS ETNGDQPAAW AIQGDELSLA HLVGSAYVEF TSSEDGETAR SLEPFYVDSI PSRGVAIEAQ YFAQEYRTFV KREQQLLDLF GKSSSECNAA GACAFETGTC FCATQYYGAG CEFQYCANDC AGHGTCNKLT GVCTCETHYV TDETYGCALK DYSLISTTCT DEALDREVDD ADLRVRPLHA SCLFGTRLGS PVTGTTTTYI TDLQGNVCLD CSGYTLESNS RIYFYEDEPC TSSAGNREDF CDNTKESYDT IVRGIGMIPA GDSGSTLSFN LAAMRTSDIQ FSIFKAKAGI VREHLADIDA GGCGACTDDN PQCGAIFTIA VDGVTVWTSL VNNKYSEINI DISAASTMTL STAKYTPPYW RSNVLPGLAG QDAPAVWCDG AAWADAEFY
|
| |