Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_47852 |
Symbol | |
ID | 5006237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009371 |
Strand | + |
Start bp | 16387 |
End bp | 18116 |
Gene Length | 1730 bp |
Protein Length | 534 aa |
Translation table | |
GC content | 61% |
IMG OID | 640421658 |
Product | HAAAP family transporter: tyrosine |
Protein accession | XP_001422074 |
Protein GI | 145355665 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.00117106 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | TCGACGAGCG AGGGCGAGGA CCGACGATGA CGACGGTACC GACGCGCGCG ACGACGCGCG CGACGACGGT ATCGACGCGC GCGCGAGCGA CGCGCGCGAT GGCGACGACG ACGACGACAG CGTCAACGGC TCGGCACGCG GCGACGGGAG GGCGACGCGC GCGCGACGGA CGCGCGCGAT TAGAGATGAC ACGCGGGGGC GGTGGAGTGA ACGTCAACGA CGCGTCGACG GCGGGCAGGG CGCGCCGACG CGCGCGAGGG GGAAGACGGA CGACGACGAC GACGACGAAC GCGATTAGCG ACGCGACGGA GCCAGAGCTC GCGGCGACGC GCGCGACGAT TCCGATATCG TCGTTCGATG AAGAAGAGCG CGCCGTGGAT GAAGAGAGGG ACGAAGAAAC GCACACCGGA TCGCTCGCGG GCGTCGTGGC GCTCATCGTG GGAAGTACCG TGGGGGCGGG GGTGCTGGCG CTTCCGGCGG CGACGGCGGA AGCCGGCATC TTGCCGGCGA GCGGGGCGTT GATAGGGGTG TGGATCTTAC TCGTGTGCGA CGCCTTGCTG TTGGCCGAGG TGAATGTGGG GATTATGCGA GAGCGAGACG AAGATCGGTT GACGCACGGA CGAGGGCACT CGCCGGTGGT GATTTCTTTG AGCGACATGG CGGAGCGGAC GCTCGGCACC GAAGGCAAAG TTTTCGCGTC GGCGCTGTAC TCATTCATGT CGCTCACCGT GCTCGTAGCG TACATAAGCA AGGGCGCGGA GATTTTAGAT GGCGCGCTCG AAATCGGCCC ATCGCTCGCC GCAGTGATGT TCACCGCCGG TTTAGGGGGT ACGATTTGCT TGGGGGGATC TAAGGTGGCC GACAAGCTGA ACCAAATGTT GACGTACGGT TCCCTCATCG CGTTCGCCGC GTTCGTGGGA AGCGGAGCGT TTTACGCGGA TTGGAGCCAT GCGAATTGGA TGGGCTCGAC CGATGCCGTC CCGTCGACGA TTCCAATCAT TTTCCTCACG CTCGTGTATC ACGACTTGAT ACCCGTCATG TGCGCGTTTT TGCAAGGCGA CATGAAGCAA ATTCGGCGAG CTATTTTAAT CGGTTCATCG ATCCCGCTCG CCATGTTTTT GCTGTGGAAT ACCGTCGCTT TGGCTATGGC TGGAGGTGAT ATCACGGCGG ATCCTCTGAG CATCATATCC GAAGATCTCG GTGGCTCCGC AAGCGTGCTT TTGAGCCTCT TCGGCGTCAG CGCCATCGGC ACGTCCTTCA TCGGCGTCTC TCTCGGTATG TCCGAGTATT TGATGCCATT CATGGAGAAC GCGTTAAGCG GTCTCGACGA ACCGGAAGAA GAGGCTAGAT TGACGCGCGC TTTGTACGAG GCGATGAACA CGTACGAAGA CGAGAAACCA TCTGGCTTGC CGAGCGTCTC GCGCGTGCTC ACGTTCGCCG CCTTCCTCTC CGTCCCCCTC TTCGTCGCGG AACAATGTCC AAACATTTTT CTTCCCGTCA CCAACTTTGT CGGTGCGTAC GGAATGACGA CACTTTACGG TGTTATGCCG CCAATCATGG CGTATACGAT GCGACAAGAG AGGAGAGAGT CTCGGCGAGC CGACCCATTC GCGCCGCTCG CGTTGCGCTT CAAGACAAAC ACACTGCTTC CCGGCGGTCG TCTCACGCTC AGCGCGCTCA GTCTATCCGC CGTCGCCATC GCACTTTCTA AAGTGTACGA GGATATCTCC
|
Protein sequence | MTTVPTRATT RATTVSTRAR ATRAMATTTT TASTARHAAT GGRRARDGRA RLEMTRGGGG VNVNDASTAG RARRRARGGR RTTTTTTNAI SDATEPELAA TRATIPISSF DEEERAVDEE RDEETHTGSL AGVVALIVGS TVGAGVLALP AATAEAGILP ASGALIGVWI LLVCDALLLA EVNVGIMRER DEDRLTHGRG HSPVVISLSD MAERTLGTEG KVFASALYSF MSLTVLVAYI SKGAEILDGA LEIGPSLAAV MFTAGLGGTI CLGGSKVADK LNQMLTYGSL IAFAAFVGSG AFYADWSHAN WMGSTDAVPS TIPIIFLTLV YHDLIPVMCA FLQGDMKQIR RAILIGSSIP LAMFLLWNTV ALAMAGGDIT ADPLSIISED LGGSASVLLS LFGVSAIGTS FIGVSLGMSE YLMPFMENAV SRVLTFAAFL SVPLFVAEQC PNIFLPVTNF VGAYGMTTLY GVMPPIMAYT MRQERRESRR ADPFAPLALR FKTNTLLPGG RLTLSALSLS AVAIALSKVY EDIS
|
| |