Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87709 |
Symbol | |
ID | 5002947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 181065 |
End bp | 182645 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418368 |
Product | predicted protein |
Protein accession | XP_001418632 |
Protein GI | 145348391 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGAG AGACGCAACG CGCGGCGTTT GATCGCGCGC TGTCGCTCGG GAACAACGTC GTGCCGCTGT ATCGAAGGAT TTTTGACGAT CAGCTGACGC CCATCCTGGC GTATCGACTG CTCGTCAAGG AGGACGAGCG CGAGGCGCCG AGCTTTTTGC TCGAGTCCGT CGTCGGTGGG ACGCAGATTG GACGCTATTC GTTCCTCGGA CGACGACCGG TGATGGAGGT GACGGCGAAG GATTACGAGG TGACGGTGAC GCGACACGAG GGCGGGGGGG CGGCGAGCGA GACGACGACG GAGCGGGATC CGATGGAGGT GATGAAGCGA ATCGGGGAGA GCTGGCGAGC GTGTAAGACG CCCGGGTTAC CGGATTGCTT CGCGGGTGGA TGGGTCGGGT TCACGGGGTA CGACACGGTG AGATATCAGT ATCAAAGCAA GCTCGGTTTC GAAGGCGCGC CGAAGGACGA TAGGTCGCTG CCGGATATTC ATCTCGGATT GTATAAGGAT GTTGTGGTGT TCGATAACGC CACAAAACAG CTCTATGCGG TACACTGGGT GATGGTTGAC GAATACTCGA GCGCCGACGA GGCGTACACA ACGGGTATGG ACGCTTTGGA CGCGATGATA GACGACTTGC AGCCGTCCAA GTCCCCGCCG ATGAAGCAAG GATACGTCAA CTTGGAGCTA AATCAGCGAC CGAGCGAACC AAAAGATAGC ACGATGACCA AGGATGAGTT TTTGGGGGCG GTAGCGGCGA CGAAGGAGCA TATCAAAGCG GGTGACATCT TCCAACTCGT TCTCAGCCAT AGATTTCAAC GGAAGACGTC CGTGGACCCG TTCGAAGTGT ACCGAGCTTT GCGAGTCGTC AACCCGTCGC CGTACATGAT TTATTACCAG GGTCGAGACT GCATCCTAGT CGCATCGAGC CCTGAAATCT TGTGCCGCGT CGACAAGGCG CGCACGGTGG TGAACCGTCC GCTAGCGGGA ACGCGCATGC GAGGGAAGAC GCCTGAAGAA GACGAGGCGC TCGAGGTGGA TCTCCTAGCG GATGAAAAGG AGCGCGCCGA GCACGTTATG CTCGTCGATC TTGGGCGAAA TGACGTCGGT CGCGTATCCA AGGCTGGTAC GGTCAAAGTC GAGAAGCTCA TGGAAATTGA ACGCTATTCT CACGTGATGC ACATTTCCTC TACTGTGACG GGTAACTTAG TAGACGACCT CGGCCCGTGG GACGTGCTCC GTGCAGCGTT GCCCGCGGGC ACAGTCAGCG GCGCGCCGAA GGTCCGCGCG ATGCAGATTA TCGACAACTT AGAGGTGACG CGTCGCGGTC CGTACGGTGG TGGCATCGGC TACGTAGGCT TCACCGGCGA GATGGACATG GCGCTCGCGT TGCGCACCAT GGTCGTTCCG ACTCGCCAGG CCAAAGAAAC TAACGGTTCG CGCGAGTGGA TGATTCACCT TCAAGCTGGC GCTGGGATCG TCGCGGATTC CAATCCAGAG AGTGAGTATC AAGAGACTGT CAACAAGGCG GCGGCGCTCG GTAGAGCCAT CGACCTCGCG GAGAGCGCGT TTACGGATTG A
|
Protein sequence | VARETQRAAF DRALSLGNNV VPLYRRIFDD QLTPILAYRL LVKEDEREAP SFLLESVVGG TQIGRYSFLG RRPVMEVTAK DYEVTVTRHE GGGAASETTT ERDPMEVMKR IGESWRACKT PGLPDCFAGG WVGFTGYDTV RYQYQSKLGF EGAPKDDRSL PDIHLGLYKD VVVFDNATKQ LYAVHWVMVD EYSSADEAYT TGMDALDAMI DDLQPSKSPP MKQGYVNLEL NQRPSEPKDS TMTKDEFLGA VAATKEHIKA GDIFQLVLSH RFQRKTSVDP FEVYRALRVV NPSPYMIYYQ GRDCILVASS PEILCRVDKA RTVVNRPLAG TRMRGKTPEE DEALEVDLLA DEKERAEHVM LVDLGRNDVG RVSKAGTVKV EKLMEIERYS HVMHISSTVT GNLVDDLGPW DVLRAALPAG TVSGAPKVRA MQIIDNLEVT RRGPYGGGIG YVGFTGEMDM ALALRTMVVP TRQAKETNGS REWMIHLQAG AGIVADSNPE SEYQETVNKA AALGRAIDLA ESAFTD
|
| |