Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_92070 |
Symbol | |
ID | 4999493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 172445 |
End bp | 174757 |
Gene Length | 2313 bp |
Protein Length | 770 aa |
Translation table | |
GC content | 62% |
IMG OID | 640414914 |
Product | predicted protein |
Protein accession | XP_001415751 |
Protein GI | 145341300 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGC ATCGCGGGAC GAGAGGCGGA TGGAACGCGA CGACGACGGA GGGTGGAGAT GGACGCGAAC GCGAACGCGC GAGCGAGCGG GACGCGAACG CGGCGACGCG AGGCGAAGGC GAAGGCGAGG CGACGGGGAG AGCGTCGATC GCGGCGACGC TCGCGGGAGA GACCGCGTCG TCGATGACGT CGAGTTATAG CGCGTGGGAG GGCGAGGAGA CGAAGACGCA CCCGAGCTCG ACGTTCGACG CGGGGAGATC GACGGGGGCG GCGTTGCACG GGTGGTTACA TTTAGAGGAG TGGTTCTTCG CGCAAGGCGC GTCGCACGAC GTGAGCGCCG ATCGCAGGGA CGAGAACGGG GTGTGTTTTC CGCCGATGTT TCCCGACGCC GCGTCGTTGG GGTTCACGTG GTCGTCGGAG GGAGATTTAG TCAACAAGCT CGTCAAGCAC TTTGGCGCGA AGTCGGCGGT GACGAGTTTC TCCGCGCATC GGGCGCAGTA CATCACGGAC GAAGACTTAT TAGAAATAGC GTCGCAAGGG ATCGAAATGG TTCGATTGCC ACTGTCTTGG AGCATTTTCG CGAGAGATCC GGCCACGGTG CCTCGCGGCG GCGAGCGCAT CCTCGTCGAT CCCGTGTACC CGGATCGCTT GTTCGTCAAC ATGGCGGGCG CCGATTTAGA CGCGGTGATC GAACGAATTC GCAACGCCGG GCTGAAGGTG TTGATTGATT TGCACAGCAT GCCCGGCGGT GCGGCGGCTG GGACGTACAA CGGCGTTTTC CCGCATCCGC CGATGATGTT CGCTCGCGAC GATCTCAGTC GCACGGGGTT GCTCATCGTT CGAAACATGC TCAACTGGTT CAAATCGTTG CCGGAAGACT CTCGTCTCGC CGTACACGGA ATCACGCTAC TGAACGAGCC CGGGCACAAC CTTCCGGCGC AAAGGTCGAA GGTTCTGCGT TGGCTCGCCA AGGCTGTGCA AACGTACAAG GATGAAATCG TCAACGACGG CGTGCCCGAC GGCGAGCGCG TCCCGTACCT GTACGTCAAC TTGATCGAGA CGCTCGATCT GAACGTCGCG GACATGGCGG CTTGGATGCG TTCGCAATTC ACGGTGAAAG AGCTCGAGTC ATGGGCGGTG TTAGACGTGC ATCATTACTT TGCGTGGTCG TACACAGGAT GCATGGGCGG CACCAACGCG GGGTGCGCGT TTAGTTGCGA CGACAAACCA TCCGTCGTCG CGCAAAAAGT CGGCGAGCGC GCGGGCGAGT GGGCCGGTAC TTTCCGAGCG GCGGCGCAGA CGTACGGCGT ACAAAATCTC GCCGTATCGG AGTGGTCGCT GGCGACGTTT TCGGACTCAT CTCGCTCGTG CTCGAACAGG GAAGTCTTGG ACATTATGTT TGAGCATCAA GAGCAAGCGT ATCGGGGCGC CGGCATCCAG AGTTTCTTTT GGGGATGGAA GATGCCCCAC GGTGGTTCGC ACATCAAAGC GTGGTCGTTG AGCGATTACT TGGCCGGGCG AAGCGCGAGC GAACGTAAGC AGCACGCGCT GGTCCCGCAC GGGTACTCGA TCGAAGACAC GCTCGAGTTA CCGCAAGGCG CTTCCAAGGA GGATGCGGCG CGTCTCATTC GCGCGTACAG CCAGTTCCCG GACAAGCCCG CCGAGGATAA AGATAGCCCA GACACGATGA AGATTCAAAC CAAGGAAATG TTGAAACTTT TGAGTGCAGT GGCGGGGCAG TCGACGGCGG CTGTGGGGGA GAGCGATCAC GCCCGCCATC GACGCAGTCG CGGCGGGCCG TCCGCCGCCG ACGCCACGGA GGATTATTTG CGCGCGAGTT CCAAGCTCAA GTCAATAATC GACACTGACG CGTACGACGA CGACGACAAC GACGATGACA AGGAAATCAT CGTTGTTCGC CACTCGCATG TATCTGACGT AGAAAAGACG CCGGTTGGTG ATCCGATCCG GCTGGACTTG CCGTCTCCGC CGCCGTCTCC GAACGTCACC GCGCCGGCGG ACGTCGCGCT CGCCGCGGCG CAGCCGTCGC CATCGCCCGA GCCAGTCGAC GATGGTTCGA TCGACTCTTT GCTCGATCCG GCGAAAGATG TTCCTCCGCC ACCGCCGGCG CCACCGCCGA TGCCACCGCC GCCGCCACCG ATGCCGCCGC CGAAGAGCGT GCTGCGCGCG CAACAACGAG CCGCGGAGAC GCTCTCCGAT TTGTCGCTCA CCTCCGGGGA AGGGTTCTCG ATTCCCGACG ACGGCTTCGA CGAGTCATCG AGCGTGAGTG CGTCCACAAT CGATGGCTTC TAG
|
Protein sequence | MARHRGTRGG WNATTTEGGD GRERERASER DANAATRGEG EGEATGRASI AATLAGETAS SMTSSYSAWE GEETKTHPSS TFDAGRSTGA ALHGWLHLEE WFFAQGASHD VSADRRDENG VCFPPMFPDA ASLGFTWSSE GDLVNKLVKH FGAKSAVTSF SAHRAQYITD EDLLEIASQG IEMVRLPLSW SIFARDPATV PRGGERILVD PVYPDRLFVN MAGADLDAVI ERIRNAGLKV LIDLHSMPGG AAAGTYNGVF PHPPMMFARD DLSRTGLLIV RNMLNWFKSL PEDSRLAVHG ITLLNEPGHN LPAQRSKVLR WLAKAVQTYK DEIVNDGVPD GERVPYLYVN LIETLDLNVA DMAAWMRSQF TVKELESWAV LDVHHYFAWS YTGCMGGTNA GCAFSCDDKP SVVAQKVGER AGEWAGTFRA AAQTYGVQNL AVSEWSLATF SDSSRSCSNR EVLDIMFEHQ EQAYRGAGIQ SFFWGWKMPH GGSHIKAWSL SDYLAGRSAS ERKQHALVPH GYSIEDTLEL PQGASKEDAA RLIRAYSQFP DKPAEDKDSP DTMKIQTKEM LKLLSAVAGQ STAAVGESDH ARHRRSRGGP SAADATEDYL RASSKLKSII DTDAYDDDDN DDDKEIIVVR HSHVSDVEKT PVGDPIRLDL PSPPPSPNVT APADVALAAA QPSPSPEPVD DGSIDSLLDP AKDVPPPPPA PPPMPPPPPP MPPPKSVLRA QQRAAETLSD LSLTSGEGFS IPDDGFDESS SVSASTIDGF
|
| |