Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36053 |
Symbol | |
ID | 5000198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 776334 |
End bp | 777512 |
Gene Length | 1179 bp |
Protein Length | 393 aa |
Translation table | |
GC content | 62% |
IMG OID | 640415619 |
Product | predicted protein |
Protein accession | XP_001416243 |
Protein GI | 145342547 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0524] Sugar kinases, ribokinase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.630596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCACCG TCGACGCCAA ACCCGCCGTT CTCGCGCGTT GGCGCCGCCC ATCGCCTCGA ACGTCGTCGC GCGCCGCGTC ATCGCGCGAT CCGAGCGACG CGCGACCTGC AATTTTGGGT TTCGGCAGCG CCGGCGTGGA CTACATAGCC AAACTCGACG GCGCTTTCCC GACGCCTGAC GCCAAAACGC GCGCGTCCGA GCTCGAAATC GTGGGCGGAG GGAACTGCGC GAACGCGCTC GTCGCCGCTT CGCGCCTTGG CGCGCGCACG GCGCTGGTCT CGAAAGTGGG AACCGACGGG GTGGGGACGC AGATTTTGAC AGAGCTGGGC GAACGCGAAG GCGTCGACGT GTCTCATGTC GTACGACGCG GGAACAGGTC GCCTTTCACG TACATCATGG TCACATCGTC GTCAAACGGA GATGGAGAGT CCACGCGAAC GTGCGTGCAC ACGCCTGGGG AGACGTTGGA GGTGGAAGAG TTGGGCGACG TCGCCGCGCT GCTGGAAGCG GTTCATCCGG ATGTCGTCTT CTTCGATGGA CGACTCACAG AGAGCGCTAT CGCGCTCGCG CGCGTCGCGG AAACGCGTGG AATTAGGGTG CTCGTCGAGT GTGAACGATT GAGAGATGGA CTAGACGAAC TTGTGCGGCT TGCGGACGTC GTGGTGACGT CGAAGAATTA CCCGCTTGAT AGATTTACGG AGACGAAGAC GCTAGGAGAC GCGATGACAG AAATGTTTGC GTGTTTGCCG AAGGCAAAAG TGATGGTGAC GACGCTCGGC GCGCGAGGGG CGGTAGCTTT GGTACGAGAT GGTGTCGAAA CTCCGGAAGT GGGGGAAGGG ACGGCGTTGG ATGACGTCGT GTCGAGATTG GAGAACGCGG CCCTCCGCGG CGATGACGAG ACGCCCGGAC CGAGCGTGGA GACGGAATCC TTGGTAATCC GAGACGCAAG CGGCGAACGA CGTTTCAAGG CAAAAGTTGT GTTCACGCCG GCGAAACGTT TGACGGACAA CCAAGTGGTC GACACCACCG GGGCAGGCGA CGCGTTCATA GGCACGCTCG CAATGTCGGC GTGCTCGGAG GATTTCAACG TCGCCAGCGC GATGCGCCTT GGCGCATACG TTGCGGCGAC GAAATGCGGT GGCATTGGAG CGCGAAGCGC ATTGCCGCAT CGCAAAGAT
|
Protein sequence | MRTVDAKPAV LARWRRPSPR TSSRAASSRD PSDARPAILG FGSAGVDYIA KLDGAFPTPD AKTRASELEI VGGGNCANAL VAASRLGART ALVSKVGTDG VGTQILTELG EREGVDVSHV VRRGNRSPFT YIMVTSSSNG DGESTRTCVH TPGETLEVEE LGDVAALLEA VHPDVVFFDG RLTESAIALA RVAETRGIRV LVECERLRDG LDELVRLADV VVTSKNYPLD RFTETKTLGD AMTEMFACLP KAKVMVTTLG ARGAVALVRD GVETPEVGEG TALDDVVSRL ENAALRGDDE TPGPSVETES LVIRDASGER RFKAKVVFTP AKRLTDNQVV DTTGAGDAFI GTLAMSACSE DFNVASAMRL GAYVAATKCG GIGARSALPH RKD
|
| |