Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_32105 |
Symbol | |
ID | 5002563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 281764 |
End bp | 283043 |
Gene Length | 1280 bp |
Protein Length | 370 aa |
Translation table | |
GC content | 68% |
IMG OID | 640417984 |
Product | predicted protein |
Protein accession | XP_001418201 |
Protein GI | 145347499 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0547] Anthranilate phosphoribosyltransferase |
TIGRFAM ID | [TIGR01245] anthranilate phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00110753 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0624968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCGGGACGCG ACGCGGCCGA CGCGCGAAGA CTGCGCGCGA TCCGAGCGCG ATGCGCGCGT TCAACGCGCC CGCGGCGACG GCGACGCGCG GTGCGGTGAC GACGGCGACG CGACGACGAC GCGCGACGAC GATGGAGGGA CCGAGCGCGA CGATCGCGGC GACGACGATG CGCGCGAGGC GATGGGAACG CGCGCGAGGG TGCGGGAACG GGCGCGCGGG ACGCGCGGTC GTCGCGCGGG CGACGGCGAG GCGCGCGGCG GTGGATCTGC GGAAGGTGAT CGAGGATTTG TGCGAGGGGG CGGATTTATC CGAGGAGGAC GCGCACGCGG CGATGGAGGC GCTGCTGGAC GCGGATCCGA CGCAGATCGC GGCGTTTTTG GTGCTGCTCC GGGCGAAGGG GGAGACGGCG AGCGAGATGG CGGGGTTGGC GCGAGCGATG CAGTCGAGAG CGGTGACGGT GGACGCGGGG GACGACGTGC TGGACATCGT CGGCACGGGG GGGGACGACG CGGGCACGGT GAATATTTCC ACTGGATCGT GCGTGTTGGC CGCCGCGGCG GGAGCGAAGG TGGCGAAACA CGGTTCGCGA TCGGTGTCTT CGCTGTGCGG GTCGGGCGAC GTGCTGGAGG CGCTCGGCGT GGACATCGAG CTGGGACCGG AGAGCATGAA GCGCTGCGTC GAGGAAGTCG GCGTCGGATT TATGTTCGCG CCGAGATATC ATCCGGCGAT GGCGAAGGTC TCGCCCGTGC GCAAGGCGCT CAAGGTGCGA ACGGCGTTTA ACATGTTGGG CCCGATGTTG AACCCGGCGC ACAGCAAGTA CGCGCTCGTC GGCGTGTACA GCACGGGGGT GCAGCAACTC ATGGCGGACT CGTTGATGAA GCTTGGGATG AAGAAGGCGT TGATCGTGCA CTCCATGGGA TTGGACGAAC TCACGCCGGC GGGACCCGCG GACGTCGTCG AGGTGACGCC GAGCGGCACG CGCGCGTACA CGTTCGAGCC GAAGGATGTC GACATTAAGC CGTGCACGCT CGAGGATTTG CGCGGCGGCG ACCCGACGAC AAACGCGAGA ATTTTGCGAG CCGCGTTGGA GGGTGAGAAG GGCCCGGTCG CCGAGACCCT GATTTTGAAT GCCGGCGTCG CTATGGCGGC CGCGCAGCAA GCGAAGGACG TCGTGGAAGG CATCGCCATG GCGAGAGAGG CGCACGAGAG CGGCAAGGCG GGCAAAACGC TCGACTCCTG GATCAAGCTC ACCCAAGAAT TGAGAAAGAC CGAGGCGTAG
|
Protein sequence | MRARRWERAR GCGNGRAGRA VVARATARRA AVDLRKVIED LCEGADLSEE DAHAAMEALL DADPTQIAAF LVLLRAKGET ASEMAGLARA MQSRAVTVDA GDDVLDIVGT GGDDAGTVNI STGSCVLAAA AGAKVAKHGS RSVSSLCGSG DVLEALGVDI ELGPESMKRC VEEVGVGFMF APRYHPAMAK VSPVRKALKV RTAFNMLGPM LNPAHSKYAL VGVYSTGVQQ LMADSLMKLG MKKALIVHSM GLDELTPAGP ADVVEVTPSG TRAYTFEPKD VDIKPCTLED LRGGDPTTNA RILRAALEGE KGPVAETLIL NAGVAMAAAQ QAKDVVEGIA MAREAHESGK AGKTLDSWIK LTQELRKTEA
|
| |