Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88889 |
Symbol | |
ID | 5005067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009367 |
Strand | + |
Start bp | 47012 |
End bp | 49231 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | |
GC content | 58% |
IMG OID | 640420488 |
Product | predicted protein |
Protein accession | XP_001420890 |
Protein GI | 145353156 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4284] UDP-glucose pyrophosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.000760017 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.165003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGATTG CGGGGGAGGG CGCGTCGAGC GAGACGGAGA CGTACGAGAA AAGAGTGGCG TTGATGGACG AACTAGAGAT CGTTCGCGCG TGGGTCGAAC GCGGAGGCGC GGTGGTGGCG ATGAGCGAGA CGATGTCGCC AAGAGAGCGG TATTTGGTGC GCGCGTGCGT GGCGTGCGGG CAGGCGCACC TGTTTGACTT TCAAGACGAC GACGACGCGG GAGAAGACGC GCTCGAGGGT CGAATCGCGC GTTTGGTCGC GGTGTTGGGC AAAGTAGAAA CGTTTTATGA CATGCTCGGC GGTGTCGTGG GGTATCAGTG CACGGCGCTG GAGCTTTGCT TGGAATACGC CACGGGGGAG CCGGCGATGT TGCACAGCGG GGCGGATTGC TACGGCGTGG ATTGCTACGG CGTTCCGGGA GACGTGGATT TTCACGTCCC CCCGGGCGTT GATCTGAGAG CGAGCGATGG TGCGTTCGCA GCCACGGCTG CGCGTTGGGG ATTGGAAGAG CTTCCGAACA TGGCGGAGAT TTATCCGCTC GGCGGCGCCG GCGACAGGCT CGGGTTGGTG GACGCCGAAA CCGGCGAGAG CTTGCCCGCA GCGCTCTTGC CGTACAACGG GCGGCCTCTC ATCGAAGGCC TCGTCCGCGA TTTGACCGCT CGAGAGTGGT TGTACTACAA ATTGACTGGT GAGCACCACA AAACACCGGT GGCGGTCATG ACGAGCGCCG CAAAGGGTAA CCACAGACGG ATCACGGCTT TGCTCAAAGA GAACAACTGG TTCGGTAGAG GTGAGGAAAA CTATCGACTG TTCGAGCAAC CACTCGTTCC AGTGATCAGT ATGGATGGCG GCCGTTGGGT GAGAGAAGGC TTCTCGCAAA TGGCGCTCAA GCCCGGGGGA CACGGCGCCA TTTGGAAGCT CATGCACGAC GACGGCGTGT TTGATTGGTT GGAGTCTCGC GACCGAACAG GCGGGATCGT GCGCCAAATA ACGAATCCGA TGGCGGGAAC GGATACCACA CTTCTATCCC TTTCAGGCGT CGGCATTAAG GGCGACAAAG CGCTCGGATT CGCGAGTTGC GAGAGACACG TGGGCGCCTC AGAAGGCGTC AACGTGCTCA TCGAAAAGAA GAATGCTCTG ACGGATGAAT TTGTGTACGG CGTGTCCAAC ATCGAGTACA CCGATCTCGA TCGACTGGGC GTGAGTGATA AAGCGAACGG TGACGGCGGT ACGGAAAGTG CGTATCCCGC AAACACGAAT GTTTTGTACG TTGGATTGAA GCACATACGG GACGCGCTCG TCGGTTCGTC TCGCGCGGCG TTTCCGGGCA TGTTGATAAA CCTCACGAAA CCGGTGCTGG CGAACGGCAC TAAAGGTGGT CGCCTCGAGT GCTCGATGCA AAACATCGCG GACGCGCTCA TGCGCCGATC GAGCCATCGA CTTGGTCCGG AAGATTTCGA TAATTTGCCA ACGTTTGTCC TGTACACGCT TCGCCGACGC ATCACCAGTA GCGCGAAGAA GAAGCGAGCA CCGGAATCGA TGAATCTCGC GCAAACCCCC GATGGCTCAT TTTTGGATTT ACTTCGCAAC GCGAGCGATC TGCTGAAGCG TTGTAACGTC GCCCACCCAC CGCCGGACGA TCAACCCTTG GAGGAGTACT TGAGCGACGG ACCAGAGTTC ATCTACAGCG CCCTTCCTAG CATCGGCCCG CTTTGGGATG TCGTAGAGCA AAAAATTCAA GGCGGCGAAA TCAAGAAAGG CTCAGAAGTG CGCTTAGAGA TCGCGGAGAT TGAATGGCGT GACGTTTCTG TGCAAGGATC ACTCTTCGTC GAGTCGTCCT CGCCGTTTGG TACTACTTCA GCCGAGAGCG TGTGCTTTGA CGAATCCGCG TGCGGCCGTT GTCGATTGAA CAACGTCGTC GTCTCAAACG CTGGAATCGA TTGGAGCGAA GCGTCAAACG TATACTGGAG TAACTTCATC ACGCGCCGCG AGCGTTGCTC CATCGTCGTC GAAGGCAACG GTGAATTCGA CGCCAAAGAC GTCGCTCTCG AAGGCGACGT GCGTTACGTC GTTCCGACCG GTAAGCGACT GATGTTGCGA CCGGACGGCG CGGGTGGCGT TCAAGAAACA TGGAGCGATA TTTCGATGCC ATCATGGCGC TGGAAGTATA CGTTCGGTGA CGACGACCGC GTCAACGTCG TCATGGAAGA ACTGAGTTGA
|
Protein sequence | LEIAGEGASS ETETYEKRVA LMDELEIVRA WVERGGAVVA MSETMSPRER YLVRACVACG QAHLFDFQDD DDAGEDALEG RIARLVAVLG KVETFYDMLG GVVGYQCTAL ELCLEYATGE PAMLHSGADC YGVDCYGVPG DVDFHVPPGV DLRASDGAFA ATAARWGLEE LPNMAEIYPL GGAGDRLGLV DAETGESLPA ALLPYNGRPL IEGLVRDLTA REWLYYKLTG EHHKTPVAVM TSAAKGNHRR ITALLKENNW FGRGEENYRL FEQPLVPVIS MDGGRWVREG FSQMALKPGG HGAIWKLMHD DGVFDWLESR DRTGGIVRQI TNPMAGTDTT LLSLSGVGIK GDKALGFASC ERHVGASEGV NVLIEKKNAL TDEFVYGVSN IEYTDLDRLG VSDKANGDGG TESAYPANTN VLYVGLKHIR DALVGSSRAA FPGMLINLTK PVLANGTKGG RLECSMQNIA DALMRRSSHR LGPEDFDNLP TFVLYTLRRR ITSSAKKKRA PESMNLAQTP DGSFLDLLRN ASDLLKRCNV AHPPPDDQPL EEYLSDGPEF IYSALPSIGP LWDVVEQKIQ GGEIKKGSEV RLEIAEIEWR DVSVQGSLFV ESSSPFGTTS AESVCFDESA CGRCRLNNVV VSNAGIDWSE ASNVYWSNFI TRRERCSIVV EGNGEFDAKD VALEGDVRYV VPTGKRLMLR PDGAGGVQET WSDISMPSWR WKYTFGDDDR VNVVMEELS
|
| |