Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_19757 |
Symbol | |
ID | 5004803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | - |
Start bp | 59072 |
End bp | 61246 |
Gene Length | 2175 bp |
Protein Length | 715 aa |
Translation table | |
GC content | 59% |
IMG OID | 640420224 |
Product | predicted protein |
Protein accession | XP_001420741 |
Protein GI | 145352836 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02100] glycogen debranching enzyme GlgX |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.13703 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0212411 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGCG CGCGCGCGTC GAGACCGGAC GGCGCGCGCG CGAGGGCGAC GGCGACGGCG AGAGGGACGA GGACGGATGG ATTGGAGATT TCGAGCGGCG AACCGGCGCC GCTGGGACCG ACGGCGACGA CGTCGGGAGG GATAAACTTC GCGACGTACA GCGAGAGCGC GAGCGAGGTG AGCCTGTGCG TGTACGATGA GAGCGATGAT TGGAGCGAGG CGACGCCGCG GTGGGAGGTG CCGATGACGA GGACGGGGAA CGTGTGGCAC GCGCGCGTCG AGCGCGGGGC GCCGAGACGA GGGGCGAGGT ACGGGTATCG ATGCAAGGGC GCGGGGGGGT GGGAGACGGG CGCGCGATGG TACGAGGATC GGGTGATGAT GGATCCGTAC GCGCCGCTCG TGGAGGCGCG AAGGAAGGTT TTCGGGGAAG GGCCGAAACA CGCGACGCAC GGCGACGTCA ACGATCCCGA CATGTTGAGC GGGTACGATT TCGAATCCGA GCCGTTCGAT TGGGCGGGCG TGGAATCGCC GAAGATCGAA GAGAAAGATA TGATCGTTTA TGAAATGACG GTGCGCGCGT TCACCGCGGA TGCGAGCTCG GGGTTGGATG AGAAGACGAG AGGTTCGTAC GCGGGCGTCG CCGCCAGGGT GGATCACTTG AAAGCGCTCG GCGTCAACGT CGTCGAGCTC TTGCCGGTGT TCGAATACGA CGAGATGGAG TTTCAACGCA TTCCGAACCC GCGCGATCAC ATGATAAACA CGTGGGGTTA CAGTACGATG AGTTTCTTCG CGCCGATGTC GCGGTTCGGC ACAAAGGGCG CGTCCGCGGC GCGAGCGTCT CGAGAGTTCA AGGAGATGGT GAAAGCGTTA CACGCGGCCG GGATCCAAGT CATTCTCGAC GTCGTGTACA ATCACACGGG AGAGATGAAC GACGAGTTGC CGAACCTGTG TTCTATGCGT GGAATCGACA ACAAGACGTA TTACATGACG GACACGTCAA AGTACGTGCA AATGTTAAAC TTTACGGGGT GCGGCAACAC GCTGAATGCG AACCATCCGT ACGTTTCCAA GTTCATCGTC GATTCGCTGA AACACTGGGT GCGAGAATAT CACGTCGACG GCTTCCGTTT CGATCTCGCC AGCGCGCTGT GCCGGGACGA GAAGGGACAT CCCATGAACT CGCCTCCGGT GATTCGAGCC ATCGCGAAGG ATCCCGAACT TTCGCACGTC AAGCTCATCG CCGAGCCTTG GGATTGCGGC GGACTGTACC AAGTCGGTAG TTTCCCCAAC TGGGACCGCT GGTCTGAATG GAACGGCGCC TATCGCGACG TGTTGCGGCG CTTCATAAAG GGCGACGAGG GCGTCAAGAG CGACTTTGCG AGACGCATCA GCGGCTCTGC CGACATGTAC CACACGAACA AACGCAAGCC ATACCACTCG GTCAACTTTA TCACCGCGCA CGATGGGTTC ACGCTTCACG ATCTGGTCAG CTACAACGGT AAGCACAACA TGGCAAACGG GGAATCAAAC AACGACGGCT CGAACGACAA CTTGTCTTGG AACTGCGGAC ACGAGGGCGA AACTGGCGAC AAGGCGGTGC GAGGACTTCG CTGGCGTCAG ATGAAAAACT TTCAAGTGGC GCTGATGATC TCTCAAGGGA CGCCCATGAT GGTGATGGGC GATGAGTACG GTCACACACG GTACGGGAAT AATAACACGT ACGGCCACGA CGACAAGTTG AACAACTTTC AGTGGAACGA ACTCGAAAAG CAAAAAGCGC ACTACTTTCG ATTTTCGTCC GAGATGATCA AGTTCCGACT TGCGAACCCT TTGCTCGGTC GCGAAGACTT CTTGAACGAC GATGACGTCA CGTGGCACGA AGATCGATGG GACGATCCGT CGTCAAAGTT TTTGGCGTTT ACGTTGCACG ACCGCGGGCA AGGTTTCGGC GATACATACA TCGCCTTCAA CGCGCACGAA TTCTACGTCG ACGCCGCGCT ACCTGCGCCA CCTCACGGTA AGCGTTGGGC GAGAGTCGTG GACACAAACT TGCCATCGCC CGAAGACTTC ATCGCGGAGG GCAAATTTGG CGTCGAGTCG CGATATAACG TCGCGCCGCG CGCGAGCGTT ATCCTCGTCG CCAAGTAGAG CTACTGTACC TTATATATAG ATGCC
|
Protein sequence | MGGARASRPD GARARATATA RGTRTDGLEI SSGEPAPLGP TATTSGGINF ATYSESASEV SLCVYDESDD WSEATPRWEV PMTRTGNVWH ARVERGAPRR GARYGYRCKG AGGWETGARW YEDRVMMDPY APLVEARRKV FGEGPKHATH GDVNDPDMLS GYDFESEPFD WAGVESPKIE EKDMIVYEMT VRAFTADASS GLDEKTRGSY AGVAARVDHL KALGVNVVEL LPVFEYDEME FQRIPNPRDH MINTWGYSTM SFFAPMSRFG TKGASAARAS REFKEMVKAL HAAGIQVILD VVYNHTGEMN DELPNLCSMR GIDNKTYYMT DTSKYVQMLN FTGCGNTLNA NHPYVSKFIV DSLKHWVREY HVDGFRFDLA SALCRDEKGH PMNSPPVIRA IAKDPELSHV KLIAEPWDCG GLYQVGSFPN WDRWSEWNGA YRDVLRRFIK GDEGVKSDFA RRISGSADMY HTNKRKPYHS VNFITAHDGF TLHDLVSYNG KHNMANGESN NDGSNDNLSW NCGHEGETGD KAVRGLRWRQ MKNFQVALMI SQGTPMMVMG DEYGHTRYGN NNTYGHDDKL NNFQWNELEK QKAHYFRFSS EMIKFRLANP LLGREDFLND DDVTWHEDRW DDPSSKFLAF TLHDRGQGFG DTYIAFNAHE FYVDAALPAP PHGKRWARVV DTNLPSPEDF IAEGKFGVES RYNVAPRASV ILVAK
|
| |