Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_93762 |
Symbol | |
ID | 5005827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 48812 |
End bp | 50116 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | |
GC content | 60% |
IMG OID | 640421248 |
Product | predicted protein |
Protein accession | XP_001421559 |
Protein GI | 145354581 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0221416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGC GGCGTAAACG GTTCCAAGTA ATTGTCAGCG CGTGTTACGG CTCGCGCGGG GACGTGATGC CGTGTTTGAC GATCGCCGCC GCGTTGGCGC GGCGGCTCGA AGGCCAAGCA ACGGTGGTCG CCATGGCGAA TCCAGTGTTT GAGAATGCCG TCGCGGCGAA CGTGCGGTTC ATCGGTGTGG GGAGTCGAGA TGAATACGAG TGCGTGCAAC GCGACCACGC TTCTAGGCGA GACGCGCGCG TGGTGGTGCG GTATTGGTTG GATCATTTGG AGACGCACGC GGAGCGTATT CTTGAGGCGC GCGCAGACGC TGGAAGGACG ATTGTGTTGG CTCACACGCT CGATTTGGCG GTGCGGTGCG TGGACGAGCG CGCTAACGAC GAATGTTTAA CGTGTTACTC CCTGGTGCTC TCGCCCGCGC TGTTGAGAAC GCCAAATCAA ACGATTCCGC CGTTCGCGGG TGAGCGCCTT CGGTGTTTCG GGCGATCGCG TTGGGCGATG CGCGCAGCGG ATTGCCTAGT CGATATCGCG TTCGCACCGC AGCTCAACGC GTTTCGCGCT TCGCTCGGGT CGACAAAACC AGTCAAAAGA GTGTTTGACG AGTGGTTTTT ATGTAAAGCC GGCGTGTTCG CGATGTATCC CGAATATTTC GAGCGAGCCG CAGACGCCGG CTCGAGAAAA GTATTCCAAA TAGACTTTCC CCAAGAAGGC GGCGACGTGA ACGTGACTGG CTTGAATGCG ATTCGCGTCG CTCGAGAGTT CATCGATCGC GATGACGCGC CGACCGTCGT TTTCGTATCA GCCAGTGGAA ATCCACCCTT TGCGTCGCAA TTTTTCGCCA CCGCCGTGAA GGCGATGCGA CGGCTGGCGG GCGCGAAAGC CATCTTGCTG ACGCGTCATC GCGATCGGAT CGGCGAGTTA CCCGAAAACG CGATCCACAT CGACTTTCTC CCTTTGCACC TCTGTCGCGA CGCTGGCTTG AAAATCGCTG CGCTCGTTCA TCACGGGACC ATCGGGTGCT CGGCGACGGC GTTGCGAAGC GCATGGCCGC AAGTCGTCGT TCCAGCGGCG TTCGACCAGC CGTACAACGC CGCGCTCCTC GAAGCGATGG GTGCGGCGCA AATTATCCAC ATGACACGTC TCACGCGCAC GCGGCTTGTG AAAGCGCTTA AAATTGTGTT CAAGGAAGAG GCCCGCGCGC CGGAAACCGC GCCGCGACGT CGAGCGAACG ACGCGTCGTC ACCTCAAGCG CGCGTCGCGG AAATCATCGC TAAGGAATTA TTGCAATACG GCTAA
|
Protein sequence | MRARRKRFQV IVSACYGSRG DVMPCLTIAA ALARRLEGQA TVVAMANPVF ENAVAANVRF IGVGSRDEYE CVQRDHASRR DARVVVRYWL DHLETHAERI LEARADAGRT IVLAHTLDLA VRCVDERAND ECLTCYSLVL SPALLRTPNQ TIPPFAGERL RCFGRSRWAM RAADCLVDIA FAPQLNAFRA SLGSTKPVKR VFDEWFLCKA GVFAMYPEYF ERAADAGSRK VFQIDFPQEG GDVNVTGLNA IRVAREFIDR DDAPTVVFVS ASGNPPFASQ FFATAVKAMR RLAGAKAILL TRHRDRIGEL PENAIHIDFL PLHLCRDAGL KIAALVHHGT IGCSATALRS AWPQVVVPAA FDQPYNAALL EAMGAAQIIH MTRLTRTRLV KALKIVFKEE ARAPETAPRR RANDASSPQA RVAEIIAKEL LQYG
|
| |