Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31870 |
Symbol | |
ID | 5001792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 672390 |
End bp | 674523 |
Gene Length | 2134 bp |
Protein Length | 686 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417213 |
Product | predicted protein |
Protein accession | XP_001417840 |
Protein GI | 145346737 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains |
TIGRFAM ID | [TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.691414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.373209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGCGACGCGA CGCGCGCGCG CGCGCGACGA CGACGCGAGC TCGATCGCGG ATCGCGTCGC GATGTGCGGG ATCTTCGCGT ACAGTAATTG GAACTGCCCG AAGAGCCAGA AGGAGATCGT CGAGAAGCTG CTCACGGGAC TGAAGCGATT GGAGTACCGC GGATACGACA GCGCGGGACT GGCGCTCGAG GACGGGGAGG ACGTGTCGCG GACGACGGCG AAGGTGTTTC GCGAGACGGG GAAGATCGCG AACCTGGAGG GGTTGCTGGA GGCGAGTGAG AAGGATCTGC ACGGGGATTT GGTGTTCGAG TCGCACTGCG GCATCGCGCA CACGAGATGG GCGACGCACG GACCGCCGGC GCCGAAGAAT TCGCACCCGC ACACGAGCGA TGAGGAGAAT GATTTTTTGG TGGTGCATAA CGGGATCATA ACGAATCATC AGGCGCTCAG GGAGACGTTG CAGCGGAAGG GGTACATGTT TGAGAGCGAT ACGGATACCG AGGTCATTCC AAAGTTGACA AAGTATTTAT TCGATAAATT TCACGATAAG TGCTCGTTCA GACAGCTGGT GATGGAGGTG TTGAGACAGT TGCACGGGGC GTACGCGCTG GCGTTTAAAT CGAGGCATTA CCCGGGGGAG TTGGTGGCGG CGAAGCGCGG GTCGCCGTTG CTCTTGGGCA TCGCCGAGGG ACCGCATCCG GGAGAGCAGC ACGCGTTGGT GACGAGCGAA GGCTTCGTGC CGACGTCTAA GCGCGCGAAG CGGACGTCGA TGGAATTTTA CTTCGCTTCC GACGCATCGG CCATGGTAGA GCACACTAAG CGCGTGTTGC ACCTGGAGGA CGACGACGTG GCGCACATTC ACAACGGAGG GTACGGCATC TATCGCATGG AGAAGATTCA CACCGAGGGC GAAGATTCGC CGAGTTTGGC GTATGCTCCG ACGGTGAAAT CCGCTGAAGT CGAGCGTACG ATTGAGACGC TGACTATGGA GGTTGAGCAA ATCATGAAGG GAAACTTTGA TCACTTTATG AAGAAGGAAA TTCACGAACA ACCGGACGCG ATTCAGCAGA CGATGCGCGG TCGCGTCGTC TTCGACGCCG ACGGAAACGT GCAACGCGTG TTCCTCGGTG GCATGGTTGA TTACTTGTCC ACCATTCGAC GGTCACGTAG AATAATCTTG TGTGGATGCG GGACGAGTTA TAACAGCGCC ATCGCTGTTC GTCAGCTCAT GGAAGAACTG ACCGAGTTGC CGGTGACGCT CGAGCTCGCC TCGGACGTCC TAGATCGTCA GTGCCCGTTC TTCCGCGATG ATTCCATTAT TTTCATCTCG CAATCCGGTG AAACCGCGGA TACTTTGCGC GCTCTCGAGT ACGCGAAGTC CAAGGGGGCG TTGTGCATCG GGATCGTCAA CGTAGTCGGT TCGGCGATTT CCCGCGCCAC CGATTGCGGT CTCCACATCA ACGCCGGCGC CGAAATCGGC GTTGCCTCCA CCAAGGCTTA CACGTGCCAA ATCACCTCCA TGGTGCTCCT CGCCTTGGCT CTCAGCGAAG ATTCTCGCTC TCGCGCTGAT CGCCGCATGG ACATCATGCG CGGCGTCGTC ACATTGCCAG ACACCATGCG CCGTGCGCTC GAGCTCGATC AGAAAATGCT CGCGCTCGCC CGCACTCTCG TGGACGAGAA CTCTTTGCTG TTATTCGGTC GTGGTTACAA CTACGCCACC GCCCTCGAAG GCGCCCTGAA GGTGAAAGAA GTCGCCCTTC TTCACTCTGA AGGCATCTTG GCGGGTGAGA TGAAACACGG TCCATTGGCG TTGGTCGACG AGACCCTTCC TTTGGTCGTC ATCGCCACGC GCGATTCCTC CTACCTCAAG CAAAAGTCCG TCATCGAGCA GCTTCGCGCT CGCGACGCGC GCTGCATCTT GATCGTCAGC GAAGACGATG ATTCTTTGGA CAAATTCGCC TCGAACGAAG ACATGATCAT CAAGGTTCCC GAGGTGTGCG ACTGCTTGCA ACCTTTGATC AACATCGTCC CCTTGCAGTT GCTCTCGTAT CACCTCACCG TCTTGCGCGG GCACAACGTC GATCAACCGC GCAACCTCGC GAAATCGGTG ACGGTAGAAT AGACTGCACC AGGC
|
Protein sequence | MCGIFAYSNW NCPKSQKEIV EKLLTGLKRL EYRGYDSAGL ALEDGEDVSR TTAKVFRETG KIANLEGLLE ASEKDLHGDL VFESHCGIAH TRWATHGPPA PKNSHPHTSD EENDFLVVHN GIITNHQALR ETLQRKGYMF ESDTDTEVIP KLTKYLFDKF HDKCSFRQLV MEVLRQLHGA YALAFKSRHY PGELVAAKRG SPLLLGIAEG PHPGEQHALV TSEGFVPTSK RAKRTSMEFY FASDASAMVE HTKRVLHLED DDVAHIHNGG YGIYRMEKIH TEGEDSPSLA YAPTVKSAEV ERTIETLTME VEQIMKGNFD HFMKKEIHEQ PDAIQQTMRG RVVFDADGNV QRVFLGGMVD YLSTIRRSRR IILCGCGTSY NSAIAVRQLM EELTELPVTL ELASDVLDRQ CPFFRDDSII FISQSGETAD TLRALEYAKS KGALCIGIVN VVGSAISRAT DCGLHINAGA EIGVASTKAY TCQITSMVLL ALALSEDSRS RADRRMDIMR GVVTLPDTMR RALELDQKML ALARTLVDEN SLLLFGRGYN YATALEGALK VKEVALLHSE GILAGEMKHG PLALVDETLP LVVIATRDSS YLKQKSVIEQ LRARDARCIL IVSEDDDSLD KFASNEDMII KVPEVCDCLQ PLINIVPLQL LSYHLTVLRG HNVDQPRNLA KSVTVE
|
| |