Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16284 |
Symbol | |
ID | 5002704 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 722211 |
End bp | 724474 |
Gene Length | 2264 bp |
Protein Length | 679 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418125 |
Product | predicted protein |
Protein accession | XP_001418785 |
Protein GI | 145348705 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCA AGGTCGTCAC GCGCGCCGCC GTCGCCGCGC CGCCGGGCGT CTCCGCGGAC ACGGTGAACG ACGCGATCAA CACCGTGCGA TTTTTGGCGA TCGACGCGAT CAACAAGTCC AACTCCGGTC ACCCCGGCTT GCCGATGGGC TGCGCGCCGA TGGGCTACGT GATTTTCCGC GAAGCGATGA CGCACAACCC GAAGAACACG AAGTGGTTCA ACCGCGATCG ATTCGTGCTG TCCGCGGGAC ACGGGTGCAT GCTGCAGTAC TCTTTGATGC ATTTGACCGG TTACCCGAGC GTTTCGGTGC GTTCGTCGCG ACGCGCGCGC GCGGGCGACG CGATTCTCGA ACGCGCGCGC CGATCGTGTG GATAAAGCCT CAAGTTGGGG TTTTGGATAA GAGCCCGCGC GGACGCGCGA AGGATGGACG CGAACGCGCG AGAGCGTCGC TCTGGCGGAC GCGCGAGCGA TGATGGAGAT AAAACTCGAT GACTGACGAT GAATGCGCGA TGCACGCTCG ACGCGCGTAG ATTGAAGACA TCAAGCAGTT CCGTCAGTGG GACTCCAAGA CCCCGGGTCA CCCGGAGAAC TTCATCACCG ACGGCATCGA AGTCACCACG GGCCCGCTCG GTATGGGTAT TTGCAACGCC GTCGGTCTCG CGATGGTCGA GAAGCACCTC GCCGGTCGAT TCAACAAGCC GGATTGCGAA ATCGTCGATC ACTACACGTA CTGCGTCATG GGCGACGGCT GCAACATGGA GGGCATGTCC GGCGAAGGCG CGTCCCTCGC CGGTCACTGG GGCCTCGGCA AGTTGATTGT CTTCTATGAC GACAACCACA TCTCCATCGA TGGTCACACC GACATCTCCT TCACGGAAGA CGTCGTCGCG CGCTTCAACG CGTACGGCTG GCACACGCAA CACGTCGAGA ACGGTAACAC CGACGTCGAC TCGATTCGCG CCGCCGTCAA CGCCGCCAAG GCTGACCCGC GCCCGTCGCT CATCAAGGTG ACCACCTTGA TTGGCTACGG TTCCCCGAAC AAGTCCAACA CCCACGACGT GCACGGCGCG CCGCTCGGTA AGGATGAAAC CGCCGCGACT CGCGAAAACT TGAAGTGGAA GTACGGCGAA TTCGAAGTTC CGGAAGCCGT CAAGGCGTAC ATGGATTGCT CCGAAAAGGG CACCGCCGCC GAAGCCGAAT GGAACGCCAA GTGGGCCACC TACAAGAGCA AGTACGCCGA AGATGCGGCC GAGCTCGAGT CCATCATGTC CGGCAAGTTG CCGTCTGGCT GGGAAAAGTC TCTCCCGACG TTCACGCCGG AGGACAAGGG TGTCGCCACG CGCATCCACT CCCAAACCAT GCTCAACGCC CTCGGTGGCG CGATCCCGGG TTTCATGGGT GGTTCCGCCG ATCTTGCGCC GTCCAACATG ACGTTGATGA AGCAATTCGG TGACTTCCAA AAGGACACCC CGGCCGAGCG CAATGTTCGC TTCGGCGTCC GCGAGCACGG TATGGGTGCC ATCGCCAACG GCATGAAGCT CCACTCCCCG GGTATCATTC CGTACTGCGC GACCTTCTTC ATCTTCACCG ACTACATGCG TTGCGCCATG CGCATCGCCG CGCTCTCCCA AGCCGGTACC ATCTTCGTCA TGACGCACGA TTCCATCGGC GTCGGCGAGG ACGGCCCGAC TCACCAGCCG ATTGAGCACG TCGCGTCCTT CCGCGCCATG CCGGGTATGG ACATGGCTCG CCCGGCCGAC GGCAACGAGA CCGCCGCCAT GTACAAGATG GCTGTCGAAA ACTCGATGAA CGGTGCCCCG ACTACGTTGG CGCTCTCCCG CCAAGTCGTG CCGAACCTTG CGGGTACCTC CATGGAAGGC GCCGCCAAGG GTGCGTACGT CGTCCAAGGC GCCGCCGCTG GCGAAGCGTG CGACGTGATC CTCATCGGTA CCGGTACCGA ACTCGAACTC GCGTGCCAAG CTGGTGCCGA GCTCGAATCT AAGGGTAAGA AGGTTCGCGT TGTCTCCATG CCGTGCTGGG AAGCCTTCGA GCGCCAACCG GCCGCCTACC AAGAGTCCGT CTTGCCGGCT GCCATGCGCG CGAAGACCGT CTCCATCGAG GCGGGCACCA CGTTCGGCTG GGCTAAGTAC GCCGGCGCTT CCATCGGACA TGACGATTTC GGCGCTTCGG CACCGGCCCC GATTCTGTAC AAGCAGTTCG GAATCACTGC CGATGCCATG GCCGCGAAGG CGATGTCGTT GTAA
|
Protein sequence | MSAKVVTRAA VAAPPGVSAD TVNDAINTVR FLAIDAINKS NSGHPGLPMG CAPMGYVIFR EAMTHNPKNT KWFNRDRFVL SAGHGCMLQY SLMHLTGYPS VSIEDIKQFR QWDSKTPGHP ENFITDGIEV TTGPLGMGIC NAVGLAMVEK HLAGRFNKPD CEIVDHYTYC VMGDGCNMEG MSGEGASLAG HWGLGKLIVF YDDNHISIDG HTDISFTEDV VARFNAYGWH TQHVENGNTD VDSIRAAVNA AKADPRPSLI KVTTLIGYGS PNKSNTHDVH GAPLGKDETA ATRENLKWKY GEFEVPEAVK AYMDCSEKGT AAEAEWNAKW ATYKSKYAED AAELESIMSG KLPSGWEKSL PTFTPEDKGV ATRIHSQTML NALGGAIPGF MGGSADLAPS NMTLMKQFGD FQKDTPAERN VRFGVREHGM GAIANGMKLH SPGIIPYCAT FFIFTDYMRC AMRIAALSQA GTIFVMTHDS IGVGEDGPTH QPIEHVASFR AMPGMDMARP ADGNETAAMY KMAVENSMNG APTTLALSRQ VVPNLAGTSM EGAAKGAYVV QGAAAGEACD VILIGTGTEL ELACQAGAEL ESKGKKVRVV SMPCWEAFER QPAAYQESVL PAAMRAKTVS IEAGTTFGWA KYAGASIGHD DFGASAPAPI LYKQFGITAD AMAAKAMSL
|
| |