Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31001 |
Symbol | |
ID | 5001156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 130685 |
End bp | 132856 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416577 |
Product | predicted protein |
Protein accession | XP_001417154 |
Protein GI | 145345302 |
COG category | [S] Function unknown |
COG ID | [COG5644] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.565722 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGCG ATGACGAGGC GGAGGAGGCG GTGCGGGCGA AGGCGAAGTC GACGTCGAGG ACGAGGGCGA AACCGACGCG CGAGGTCGTC GAAGAGGAGG ACGTCGAGGC GTTGGTTTCG AGCGAAGACG ACGACGACGA AAAACGCGAG CGCGTGCTGG ATGAACTCGT CGGGGCGCGT AAACGAGTGA CGCTGGATCC TCGGTATCGA CCGGTGATCA CGGAGGTCGG GAGGGAGAAT CCGATGGCGG TGCCGCTTCG TCTTCGCGAG GGCGAGGAGA AGCTGACGCT GAAGGACGTC ATGGCGTCGC TCGGCGAAGA CGCGCTCGAT CGCGATACGG TGAAGCGGTT GAATAAGATT TCGAAGACTA AGGCGGTGGA TGCGCCTTTG GCGCGACCGA TTAAAGAGCG TATCGATCGC AAGGCGGCGT ACGCGGAGAC GAGCAAGGAC ATCGGCAAGT ATCAAGCCGT CGTCAAGGAA AATCGCGAAA AGCGAACGCT CAAGTTTGAG CCAGCGCGCA AGGAGATGCA ACGCAAGGAC ACGCTCGGTG CGCTCGCCGC AGATTTTACG CCGCGATCGG AGGTTGAGTT GGAAATCGCT CGCGTTTTGA AAGAGTCTGG GCACGCGAGC GCCAAAGACG TCGTGCGCGG CGAGTTGCTC GAGATGAACC ACTTGGACGT CGAGGACATA CAGGCGCGCC AAGCCAAGCT CGCGAAGATG CGTGCGGTTT TGTTTTATCA CGAGCAAAAG GCGAAAAGAC TGAAAAACAT CAAGTCTAAG GCGTTCCATA GGCACAATCG CAAGGGAGAA CTCAAGGTGA TCGGAGACGA GGATGACAGC GATTACGAAG GCGAGGGCGA CACCCCGGAG GAACGCAGAG AGTACCTTCG CGCGCAAGAG CGCATGCTGT TACGTCACAA AAACACGTCG AGGTGGGCGA AACGGGCGAT TAAAAAGGGT ATCGCCCATC TCGCCGGGAC GCGAGAAAAG TTGCAGGAGC AGCTGCGCAT CGGCCAACAA TTGAAGGAAA AGATTGAAGG CACTCGAACG ACGACGACGG AACTCGACGA AGAGAGCACG GACGCCGAAG ACTCTGAAGA CGAAGATCCC AACGACCCCG AAACCGAACG CAAGCGACGG CTCAGGGCGA AAGCCGCAGC GTTAAAGGCT TTGGAGGACG GCGGCGACGA CGATGTCGAG GGAGCGAACG ACAGCTTGTT CAAACTGCCC TTCATGGCGC GTGCGATGGA GAAACGCAAG TCGCAGACGA AAGCAGAGGC GCAAGAACTG CTTGATGAGC TCGATAGAAT GGAACAGAAC GGAGAGGAGC TCAGCGATAG CGACGATGAT CATGAAGAGT GGGACGCGAT GACAACGAAT GCGAAGCGCG CCGCCGACGC CCGAACAGCG TCTGCTTCAA AGAAGGCTCG CACGGGCGAC AACGACGTCG ACGCGCCCGC GAAAAATTCC AGCGGGAAAC AGTCGACGCA GCCGAAAAAG TCAACTGCAA AACCTATCAC GGAGATATCA GATGCGGCGA GGAAGATGCG TGCGAAAGCT GCCGCACGAG TGGCGCCGGT GAAAGTCAAG TCTGGCGAAT CCGACGAAGA CGCCGACGAC GGACCGACGG TGATGCTCGA AGAGGAACGC GACGCGGGGA CGACGAACGA AGACTTGATG CGGCGAGCGT TCGCGAACGA CGACATCGAA GCAGAATTCG AAAAGGAAAA GCTCGCCGAT GTCAGCGCAG AACTTCCTGA AACGGACATG CCGAAGAATC TTCCAGGATG GGGCGCGTGG GCGAGCGACA AGCGCGTTCC GAAATGGATG AAGGACGCGG AGAAAAAGGC GAAAACTGAG CGCGCGAAGG CTTTGAAGAG TCGTCGCGAC GCCAAACTCA AGCACGTCGT AATCAGCGAA AAGTACGATA AAAAGGCTGC GCAATTCAAC GTGGAATCGC TTCCGCACGG GTACGCGAGC AAGGCGGCGT ACGAGGGCAC GATGCGTCAA CCTTTGGGAT TGGACGTCAA CACGCACGGC ATGTTCCAAA AGTTGAACGC GCCGAAAGTA TTGAAGCCAA CGGGCTCTAT AATCAAACCC ATGAAGTTGC CGAAACACAA GGCGAAGGAA GCGTCGACGT CCAAGTCTTC GCGCAAAAAG AAGGCGCGCT GA
|
Protein sequence | MDSDDEAEEA VRAKAKSTSR TRAKPTREVV EEEDVEALVS SEDDDDEKRE RVLDELVGAR KRVTLDPRYR PVITEVGREN PMAVPLRLRE GEEKLTLKDV MASLGEDALD RDTVKRLNKI SKTKAVDAPL ARPIKERIDR KAAYAETSKD IGKYQAVVKE NREKRTLKFE PARKEMQRKD TLGALAADFT PRSEVELEIA RVLKESGHAS AKDVVRGELL EMNHLDVEDI QARQAKLAKM RAVLFYHEQK AKRLKNIKSK AFHRHNRKGE LKVIGDEDDS DYEGEGDTPE ERREYLRAQE RMLLRHKNTS RWAKRAIKKG IAHLAGTREK LQEQLRIGQQ LKEKIEGTRT TTTELDEEST DAEDSEDEDP NDPETERKRR LRAKAAALKA LEDGGDDDVE GANDSLFKLP FMARAMEKRK SQTKAEAQEL LDELDRMEQN GEELSDSDDD HEEWDAMTTN AKRAADARTA SASKKARTGD NDVDAPAKNS SGKQSTQPKK STAKPITEIS DAARKMRAKA AARVAPVKVK SGESDEDADD GPTVMLEEER DAGTTNEDLM RRAFANDDIE AEFEKEKLAD VSAELPETDM PKNLPGWGAW ASDKRVPKWM KDAEKKAKTE RAKALKSRRD AKLKHVVISE KYDKKAAQFN VESLPHGYAS KAAYEGTMRQ PLGLDVNTHG MFQKLNAPKV LKPTGSIIKP MKLPKHKAKE ASTSKSSRKK KAR
|
| |