Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42458 |
Symbol | |
ID | 5003172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 460002 |
End bp | 461228 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418593 |
Product | HAAAP family transporter: tyrosine |
Protein accession | XP_001419386 |
Protein GI | 145349943 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0166063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.266354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTGGG GGGACGGGGC GACGCGCTTC GCGCGGGGCT CCACCGACGA GGCGGAGTCG TGTCCGTTGG AATTGCACGC AGAACGCGCG AGCGACTTCG AAGCCATCAT GCTAGTCGTC GGAACGACCG TCGGGGGTGG ATTCTTGGCG ATGCCGTACT TTGCCGCGCC CGCGGGGTTC GTGCCGGCGG TTTTGATATC GTGCGGCGCG TGGGCGGTGC TCGCGGCGAG CGGACTGTTG GTGTCCGAGA CGCTCATGCA CACGTGGGCG CGGTCGAACG GTCGAGCGGT AAGTTTGTTG AGCGTAACAT CGGATTATCT CGGAAAGAGT TGGGGTAGCA TCGTCGCGGT GTCGTTCTTT GTGATGATGA ACTGCACCCT GGTGAGTCAG CTCGCAAAGT GCGGGTCGTT GGCGAGTTTT TTTAGCGGTG GCGCGGTGTC GCACGTTTTC GGTGCGGCCG TGACCGCGGC GATTGTTGGA TACGTGTCGT TTTCAAACAA AGCGGCCAAG GTGAACGCGT ATGCGACCGT TGGTATTTTT ATGTCGTTTG CGGCGATTTG CGTCTTTGGA GTGACGAATT TGACGCCTAG CAAGCTCACT TTCATGAACT TCGCCGCGGC AGTGCCGGCA CTCCCCGGTT TGGTACAACT ATACACGTAT GGAGAGTGCT TGCCGACGTT AGTTGACATG CTCAGAGGTG ACAGAGAACG TATCAGGCGC GTGATTTTGC TCGGCACCTC GGTCCCCCTC GCAATGTACA CTTGTTGGCT CGTCGTTTCG CTCGCTCAGA GTGGCGCTTG GGCTGGAACC GCGGACTTAG CGCAAGCAAT GTTGGAAAGT GGCGGTATTT TGGGTGGAGC GACCGCTTCT GTCGCGATCG CGGCGTCGAT CTCAACGCTC ATCGGGGGCT ACTTGGCGTT GAGTCGGTTT TGCGCCGACG CCTTGAAGAA AAAGACGGTG AGTCATTCAA AGTCCGTGAT TGCGCTTACC CTGCTTCCCT CGCTACTTTT CGCCTGTAAA GGCCCCGACG TGTACTTTTC CGCGCTCAAA TTCTCTGGAG CTGTGGTCGT CATCATTTTG TGGGGTATTT TGCCTCCGCT GTTGGCAAAA TCTTTATGGA AGCGAGAAGG AGCTTTCACC GAAATCCGAA AGTTCTTCGT TTACGTCTGG ACTACTCTCG CGAGCGTCGC TCTGTGCTTT GGCGTTCGGT CTCTTGTCGT CGCGTGA
|
Protein sequence | MAWGDGATRF ARGSTDEAES CPLELHAERA SDFEAIMLVV GTTVGGGFLA MPYFAAPAGF VPAVLISCGA WAVLAASGLL VSETLMHTWA RSNGRAVSLL SVTSDYLGKS WGSIVAVSFF VMMNCTLVSQ LAKCGSLASF FSGGAVSHVF GAAVTAAIVG YVSFSNKAAK VNAYATVGIF MSFAAICVFG VTNLTPSKLT FMNFAAAVPA LPGLVQLYTY GECLPTLVDM LRGDRERIRR VILLGTSVPL AMYTCWLVVS LAQSGAWAGT ADLAQAMLES GGILGGATAS VAIAASISTL IGGYLALSRF CADALKKKTV SHSKSVIALT LLPSLLFACK GPDVYFSALK FSGAVVVIIL WGILPPLLAK SLWKREGAFT EIRKFFVYVW TTLASVALCF GVRSLVVA
|
| |