Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38852 |
Symbol | |
ID | 5002087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 732657 |
End bp | 734438 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 60% |
IMG OID | 640417508 |
Product | predicted protein |
Protein accession | XP_001418096 |
Protein GI | 145347269 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.569718 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCG AAGACGACGT CGAGGAAGAC GGCGGCGCGG CGACGGACGG CGCGCAGGCG CCGTACGCGG CGTTTTGCAC GGCGGCGATA CTGGCGGCGG TGCACCGAGC GGCGATTTGG CGCGACTCGA AGGATTTCGT GGACACGCGG TCGAGATCAC CGCCGGCGAA GGTGTTCGAG GCGCTGGCGC AGTCGCGAGC GGCGACGTGT AGCGTGGCGG CGCGGGAATT CTTGAACGAA CACTTCGAGA GCGGACCGAG GGAGAGGTCT AAGATGCCGG AACTGGCGGA CTGGCGAAGC GAGCCGGCGG TGGCGAGAGG GGCGAGGTGC GAAAAGTCGA GGGAGTTTGC GACGCACGTG CACGAGCTGT GGCGCGTGCT CGCGCGGTTG GACGCGGATG ACTACGCTGA GGAAGAAGTC GGCGCGGAGG GCGAGGCGCG ACGGACGACG AGCTCGAGGA TTCGATTGCC GTACCCGGCG GTGGTGCCGG GGGAACGTTT TAGGGAAACC TACTACTGGG ATACGTATTG GATCGTTTTG GGGTTGTTAA CGAGCGAGAT GCCGGCCACG GCGCTGGGAG TGACGAATAA TTTGTTGTAC ATGGTCACCA CGTACGGATT CGTGCCCAAC GGCGCGCGCG TGTACTATTT GAATCGATCT CAGCCGCCGT TGTTGTCGTC GTGCGTCGCC GAGGTGTTTC AAGCGACGCG AGACGTCGAG TGGTTGCGAC AGGCGTTGCC GTTGCTGGTG CAAGAATACG CCTATCTCAC TCGAAGTGAA CGCACGGTGA CGATTCGTGA CACGGAAACC GGAGAGACGC ACGAGCTGTC GCGATATTTT GCCAACACCA CGCGTCCGAG GCCAGAGAGC TATCGCGAAG ACGTCGAGGT GGCGCGCCGA GCGACCAGAA AGGTGGAGGA CGCCGTGGCC AAGCTCGAAG CTAAGCGTAA GATATATCGA CATCTTGCTA GCGCGGCGGA GAGCGGCTTC GACTTCAGCT CGAGATGGTT CCTAGACGGC GATAACTTGG AGACGATTCG CACGTGCAAC ATCATTCCAT CCGACTTGAA CGGATTTATG CTACGAGTGG AGACGCAAAT CGCTTTGCTC GCCCGCGAGG CATTAGTGTC GTTGGAAAAC GAAGACGAGC TCTTCGCCGA GCGCGTGTAC TTGAACCATT TGCTCGAGAA GTTCTCCCGT GCGAGCGAGG TGCGGCGGCG CGCGATTGAC GCCGTGCTTT GGGACGACGA CGTCAAGCGG TGGCGAGACA TGGCGTTCGA ACCGCTCATG GGCGAAGACA CCCGAGGAAT CGTGCGCGAT CGCGATGATC TCACGGCGGC ATCTGAGAGC CCGTTTACGA GCGATTTTAC TCCGCTTTGG TGCGGCGCTT GCGATCCCGA CAGCGATCAA GCGTACGAAG TCGTCGAGTC GTTGAAAAAG TCCAAGTTGG TCACCGACAA AGGCATCGCG ACCTCACTCG TCGAGAGCGG TCAGCAATGG GATTGGCCTA ACGCTTGGGC GCCGGAGACT CACATGATTG TCGAAGCGAT ACAAATTTTC GCCCCTCGCG AGGAAGAGTA CGCGAAGACG CTCGCGCACT CGTGGCTCCG CACCGCGCAT CAAGCGTGGA AGTCAACGGG CTACATGCAC GAAAAGTACG ACGTGCGCTC GACCGAGGAC GGCGTGGGTA AAGGCGGCGA ATACATCCCT CAACGTGGTT TCGGCTGGAC CAACGGCGTG ACACTTCGAT TACTCGAACA ATACGGATTC CCTCAAGATT GA
|
Protein sequence | MTSEDDVEED GGAATDGAQA PYAAFCTAAI LAAVHRAAIW RDSKDFVDTR SRSPPAKVFE ALAQSRAATC SVAAREFLNE HFESGPRERS KMPELADWRS EPAVARGARC EKSREFATHV HELWRVLARL DADDYAEEEV GAEGEARRTT SSRIRLPYPA VVPGERFRET YYWDTYWIVL GLLTSEMPAT ALGVTNNLLY MVTTYGFVPN GARVYYLNRS QPPLLSSCVA EVFQATRDVE WLRQALPLLV QEYAYLTRSE RTVTIRDTET GETHELSRYF ANTTRPRPES YREDVEVARR ATRKVEDAVA KLEAKRKIYR HLASAAESGF DFSSRWFLDG DNLETIRTCN IIPSDLNGFM LRVETQIALL AREALVSLEN EDELFAERVY LNHLLEKFSR ASEVRRRAID AVLWDDDVKR WRDMAFEPLM GEDTRGIVRD RDDLTAASES PFTSDFTPLW CGACDPDSDQ AYEVVESLKK SKLVTDKGIA TSLVESGQQW DWPNAWAPET HMIVEAIQIF APREEEYAKT LAHSWLRTAH QAWKSTGYMH EKYDVRSTED GVGKGGEYIP QRGFGWTNGV TLRLLEQYGF PQD
|
| |