Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25793 |
Symbol | |
ID | 5006372 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009372 |
Strand | + |
Start bp | 126984 |
End bp | 130561 |
Gene Length | 3578 bp |
Protein Length | 389 aa |
Translation table | |
GC content | 49% |
IMG OID | 640421793 |
Product | predicted protein |
Protein accession | XP_001422307 |
Protein GI | 145356163 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0381] UDP-N-acetylglucosamine 2-epimerase |
TIGRFAM ID | [TIGR00236] UDP-N-acetylglucosamine 2-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000177629 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGATA TCGAGTCAGA ACGTTTCGAC AGAACGGCCT ATTCTGCTGA AATCAAAATC GCGATTGTCT TCGGTACGCG ACCAGAGGCT GTGAAAATGG CTCCGGTGAT TCAAGCGGTG GCCAGATCGT CGACTCTGAG TGCGATACTG ATCTCCACGG GTCAACACAA ACAGATGCTT GAGCAAGTTC TGCGCCAGTT CAGCTTGCAA GATAAAATTC AACACGAGCT AGCGCTGATG AAGCCAAATC AACAGCTCGC AGAGCTCACA TCGAGCGCGG TGCGCGCGGT AGACGGCGTG CTACGTTCGT CAAAACCAGA CGCGGTGCTC GTGCAGGGCG ATACAACGAC AGCGTTTATC ACGTCTTTGG CGGCGTTTTA CTTGAAAATA CCAGTTGGAC ACATCGAAGC GGGTCTTCGT ACGCGTGATA TCTACTCACC CTTTCCCGAA GAAGTCAATA GACAGTGTAT CTCAGTTATG GCGACGTATC ACTTCGCACC CACCGAACAC GCGGCAAAAA ATCTGTACGA CGAGGGTAGA CGCACGAATG TGTTTACAAC CGGTAACACT GTTGTCGATG CCCTATATGC AATTTTAAAG ACTGAACCGA GTGATCGAGT GATTGAGCTA TCCAAAGTCG TGAAGACTGT ATCCACTCTG CGCGACGTGA GGCTGTTACT TCTCACCGCT CATCGACGTG AAAACCTTGG CGAACCCATA CTCAACATCT TCACAAGCAT AGAGAAGCTT CTGCAAGAAT ACCCTGACGT CGTCGTGATT TATCCCATTC ACTTGAACCC TATGGTGAAG CACTTAGCAG AACAGCACTT CGGCGTTGAT ACATTCCACA GCCTTGTACA GTCGGACCAC GCGCCGCCAA CGACTACGCA CTTACGGCGA TTGCTGATAG TACCACCTCT AGACCATGCA GATTTATTAT TCATGATGAA AGAGAGTTTT TTCGTCATGA CAGATTCGGG AGGTATCCAA GAGGAAGCGG TCACTCTAGG AAAACCGGTG TTAGTGCTCA GAGACACAAC AGAGAGACCA GAAGGAGTGC TCGCGGGCGC CGCAAAACTA GTTGGTCACG GTGCAGAAAG TATATACACC GAAGCGGCGT CTTTGCTGAA AGATCCCGAT TCTTATAGAT CCATGTCAGG CTCGAAGAAA ACTTATGGCG ATGGGAATGC TGCTGGAAAC ATCGTGCGCA TCCTCGAGAC AATGTTGAGA ACGCCGCCGC CAAAGACACT TCCAGTATCG TCTTCGTTTC ATAATGAACA CTCCACTGCG CCGTACGAGA ATGTGGTCGT CGTGACGGTT TGGAAACGTA AAACAATCAC ACAAGTGATG AAAATGATAA GCAAACAGAC GACGCTGATG ACGAAAAGAA CTGCCATCGT TGTCGTTCAA AACGGGGAGC ACGTGGATAT TAGCGACGCG CTTCATCGAT GGAATAGCTC AGAGGCTTGG TCGGGTGTTC CACCACACGT ATATCACGTG CATTCTAAAG TTGAAACGGG TTATTACGGT CGCTTTCTCG CGCCGATGTT CGTACAAACG ACTCCACACG CAACGTTTAT TGTTTTGGAT GATGACGTTA TGTTCGGATC GAGGTACTTT GAGAACATGC TGCGTGTCGT CGATGAAGGC TTTTTGGCGA CGAGAAATGG TAGATTTTTG AACGAAAAGT TTTTAGAATT CGATTGGCGA GGTCACTGGA AAGAAGGTCC GGTTGATACG TTTGACGAAG ATGATGAATA CGATTTTGGA GGTCATTTGT GGGCTGGGAG GATGGTCTGG TTACAGGCCG CGTTTCGCGC GCCCCCTCCT TTACTTTACA ATGCGGAAGA CTTTTGGATA AGTGCTGTAT TGCAAACAAC CTTGGGCATC GGTACAAAGC GCGCGAGATG CCCTCGACCG GAAAATGGAG GCGATCTAGA ACTTTGTGCT TGCAGCATGA AACAAGCCGG AAAGCACGAA GCCCCAAATC TTGGAAGTGA CAAAATCAAC GAGAGAAGAT CAAAACGTCA CGATGCGATG AAAACTATCG CCAAGCACTA CGACTACGTG ACGTTGTCTT CCAGGCGACG GTCGGCGGTT GAAGATGTTA AAAATCGCCA CGAAGAAATA CGTATGGAAC GTTTTCACGC CGACGAGGAA ACGTTGGAAA TGTTTCAGCA GTGTCTTTAC TGGTATTAGT AGTGACATCC TCTTCTTGGT AGCGCTCTTG CGCGCGGCAA TCTCGCGACA TATTCAAGAG ATGTTTAGCC TTAGTAATCT AGTACGTAGT GAATAAATAC TATCAGTTTT AGAGCGAACG CTATAAGTTT TCACGGCGAT GTCGTTCAAG GGTGAGTACG AGGTCACTCC GGCTTCATAC ACGGTCGCTA TCGATTCAAA GCTCACACGA CGCATACGAT CGCCATTTCC CATCCCCGGT GCGCATTACC GCGAACGCAT CGCGCACAAA TAGTGGTGAG TATGGCGAAA CCGAACCCCG GGCGGTTGAA AAGTTCGTTC CCGGCTGTTG CGTCACTTGT TAAGAGGAGT ACTTTGGTGA CTGTTTGCGC CTTCTGCGCC GCCTGTATCG CGTGCTGGTA CATTCGTACA GCTTTGAGTG AATCTCCTGT ACGTGACAAT AACGCAGCTC GTTCCAGCAG ATTTAACTGC GGCGTCCGGC AGGCTCAGCT TCTATCGGTC GATGGAAATC CGTGGCTTTT TCATTCGGGC GATGCGATCA CTTCTACTGT TAAGGCGAAG GGCACTTTCG AGCCAGCGAT AAGCAATTTG ATTCGGCGTC AACTCAGTAT TGCAGATGGA ATGATGGTCG ATATAGGTGC AAATGTCGGT GTTATGACGA GTGTTGCGCT TTCTCTGGGC CGCGAAGTGG TCGCCGTGGA GGCACTCCCG GATAACGCGA ATCTGTTACG CTGTAGCGCA GATCACAACG GCTGGACCAC GCTGTTGAAA CTACATAATA CGCCACTTGC TGATCCTGCG TCAAAATCCG AACGGTTTTG TGTGTGCCGG CCTATTGGAA ACCCGAGCGA CGGTATTCTA GTACCGTTTA CTCAGTTTGC AGATCATCCT TTCTGTGGAG GTGGCCAAAG AAAAACAAAA GGAGCGACGG GGCACTGTGC CGAAGAAATG AAGTCCATAA CCCTTGATGA AATACTAGGC AGTGGTTCGA ATCCTATCGC CTTCATGAAA ATGGATATTG AAGGAAATGA GTGTCGCGTC TTGAAAGGCG CGGCGGCGAC CTTACATGGT GGCAACCCAT GTACAATTTT AACTGAGTAT AATCCTGGTC TTCAGCAATA TACCAATTGT ACACTCATCG ACATGATCGA TCACATGAAG CTATTAGGTT ACTTACCGCA CACTTTTCAG CGACAAGGAT GCTCTTTGAA GGAGCTTACA TCCGTTCAGG CAGGAAAAAT GAGCGCGCCA GGATCGGTGC ATAATATTTG CTGGAAACCT CTTTTAGTTC CCAAACATTG CGTGAAAAAA GCGAGCTGAA ACATCGTTCC AACGCATAGA ATAAGTACGC TCTAAGGTGA TATTCGTAAA TCTGTTGC
|
Protein sequence | MKDIESERFD RTAYSAEIKI AIVFGTRPEA VKMAPVIQAV ARSSTLSAIL ISTGQHKQML EQVLRQFSLQ DKIQHELALM KPNQQLAELT SSAVRAVDGV LRSSKPDAVL VQGDTTTAFI TSLAAFYLKI PVGHIEAGLR TRDIYSPFPE EVNRQCISVM ATYHFAPTEH AAKNLYDEGR RTNVFTTGNT VTEPSDRVIE LSKVVKTVST LRDVRLLLLT AHRRENLGEP ILNIFTSIEK LLQEYPDVVV IYPIHLNPMS DHAPPTTTHL RRLLIVPPLD HADLLFMMKE SFFVMTDSGG IQEEAVTLGK PVLVLRDTTE RPEGVLAGAA KLVGHGAESI YTEAASLLKD PDSYRSMSGS KKTYGDGNAA GNIVAKEKQK ERRGTVPKK
|
| |