Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_52106 |
Symbol | |
ID | 5006979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 202897 |
End bp | 204870 |
Gene Length | 1974 bp |
Protein Length | 626 aa |
Translation table | |
GC content | 59% |
IMG OID | 640422400 |
Product | predicted protein |
Protein accession | XP_001422838 |
Protein GI | 145357260 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4284] UDP-glucose pyrophosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.222243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACG CGCGCGCGAG CGCGACGGCC GTCGTCGACG CGCACGTCGC GCGCGGCGCG CTGACGACGG ACGACGCGCG GACGCTGCGC GAAACGATCG CGCTGGGACA AGCGCATCTG ATCGCGGATT GGCCGGCGCC GGGCGTGGAC GACGAGAGGA AGCGCGCGTT CGTCGAGGAA GTGCGACGGG CGGATCGGGG GTACCCGGGG GGGGTGGCGA AGTACGTGTC GAACGCGCGC GAGCTGCTGA GGGCGTCGAA GGAGGGGAAG AATCCGTTCG AGGGATGGAC GCCGAGCGTG CCCACGGGGA AGACGGTGGA GTACGGATCG GCGGCGCACG AGATTCTGGA GAAGATTGGG ATGCGGGAGA CGGCGGAGAC GTGCTTCGTG CTCGTCGCGG GGGGGTTGGG AGAGCGATTG GGGTACTCGG GGATCAAGGT CGCGCTGCCG GTGGAGCGGG CGACGAACGC GTGCTATTTG GAGTTGTACG TGAAGAATAT CTTGGCGATG GAGAAACGCG CGGAGGGTGC GGAGGGTGCG ACGAACGCGG GTGGGTGCGG GTGCTTCGGC GGTGGCGGCG CGAAGGCGAA ATCGTCCACG AAGATTCCGT TGGCGATCAT GACGTCGGAG GACACGCACG CGCTGACGCT CGATTTGCTC GAACGCAACG ATTACTTTGG CGCGTCTCGC GATCAAATCA CGCTCATGAA GCAAGAAAAG GTGCCGTGCT TGATGGATAA CGATGCACGT TTGGCGGTGA AAGACGACGA TCCTTACAAG CTCGCGCTCA AGCCGCACGG CCACGGTGAC GTGCACTCTC TCCTGCACAC GAGCGGGTTG TTATCAAAGT GGATGAGCCA AGGCAAGAAG TGGGTCGTCT TCTTTCAAGA CACGAACTCG CTCGTGTTCC GCGTCATCCC TGGTGCGCTC GGGGTGTCGA AGACGATGAA TCTTGAGTTC AATTCTTTGT GCGTTCCGCG AAAAGCCAAG GAAGCGGTCG GTGCGATTTC GTTGCTAACT CACGAGGATG GACGCAAGAT GACCATCAAC GTCGAATACA ACCAGCTCGA TCCGCTTTTG CGAGCCACTA CGAATCCCGA AGGCGACGTC AACGATGCCA CGGGCTTCTC CCCATTCCCG GGTAACATCA ATCAGCTCAT CGTGAGTCTT CCAGAGTACG CAAAACAACT CAAGAAGACT GGCGGCGCGA TCGAAGAATT CGTCAATCCC AAGTACAAGG ATGAGACAAA GACGGCTTTC AAATCGCCGA CGCGATTGGA GTGCATGATG CAAGACTATC CCAAGAGCCT CGGATCGAAG GCTAAGGTTG GTTTCACCGT CTTTGCGAAC TGGATTGGCT ACAGCCCGGT GAAGAACTCT CCGGCGGACG GTTTGGCCAA GTTCAAATCT AACGGCCCGA CGCACACGGC GACGAGCGGC GAGTTTGAGT TTTACGAATC GTGCGCAAAC TTGTTGCGTT TGGCCGGTGC CGACGTCCCC GCCGCCGCCG TCGACGCTGA ATTCAACGGT ATGAAGCTTC CCATGGGTCC ACGTGTTGTG CTCGGTCCGG ATGTCGCCAC ATCTTTTGAT GAACTCAAGT CGAAAGTCGG CGCCGTCAAG TTGGGCGCGA AGAGCGCGCT CGTCGTCGAA GGCTCTGGCG TCAATTTGAA AAACGTCGAA GTGGATGGTG CGCTCGTCAT CAAGGCGTGC GAGGGCGCGG AAGTCATCGT CGATGGTTTG AAAGTGACGA ACAAGGGTTG GCAGTGGAAG CCGACCGGCA AAGGTGCGCC CGAAGTCGAC GCGCTCGCGG GATTCGTCGT GAAGAAAAAC GAAACGGCCG AGTACGTCTT CGACAAGCCT GGCAAGTACA CGCTCCCGTA AGCGTTTCTC TTCTCCCTCG TCCTAGTAAA AACCACCAGC GACGCAAAAA CACGCCATTT GAATTGAAAT AACAACACGA ATGAATACTT TGTG
|
Protein sequence | MDDARASATA VVDAHVARGA LTTDDARTLR ETIALGQAHL IADWPAPGVD DERKRAFVEE VRRADRGYPG GVAKYVSNAR ELLRASKEGK NPFEGWTPSV PTGKTVEYGS AAHEILEKIG MRETAETCFV LVAGGLGERL GYSGIKVALP VERATNACYL ELYVKNILAM EKRAEGAEGA TNAGGCGCFG GGGAKAKSST KIPLAIMTSE DTHALTLDLL ERNDYFGASR DQITLMKQEK VPCLMDNDAR LAVKDDDPYK LALKPHGHGD VHSLLHTSGL LSKWMSQGKK WVVFFQDTNS LVFRVIPGAL GVSKTMNLEF NSLCVPRKAK EAVGAISLLT HEDGRKMTIN VEYNQLDPLL RATTNPEGDV NDATGFSPFP GNINQLIVSL PEYAKQLKKT GGAIEEFVNP KYKDETKTAF KSPTRLECMM QDYPKSLGSK AKVGFTVFAN WIGYSPVKNS PADGLAKFKS NGPTHTATSG EFEFYESCAN LLRLAGADVP AAAVDAEFNG MKLPMGPRVV LGPDVATSFD ELKSKVGAVK LGAKSALVVE GSGVNLKNVE VDGALVIKAC EGAEVIVDGL KVTNKGWQWK PTGKGAPEVD ALAGFVVKKN ETAEYVFDKP GKYTLP
|
| |