Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_10137 |
Symbol | |
ID | 5002636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 299098 |
End bp | 300339 |
Gene Length | 1242 bp |
Protein Length | 414 aa |
Translation table | |
GC content | 66% |
IMG OID | 640418057 |
Product | HAAAP family transporter: tyrosine/tryptophan |
Protein accession | XP_001418666 |
Protein GI | 145348459 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.000465151 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0223874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCTGTGGA GCAACGTCGA GCGCGGCGAC GATGGGTTGC GACGCCGCGC CGGATCGCTC GGCGCGTCCG TGGCGCTCGT CGCGGGGACG ACGCTCGGCG CGGGCATGCT CGCGCTACCG CTCGTCTTGC GCGACGCGGG CTTCGTGCCG AGCACGGTCG TCATCGTCGC CTGCTGGGTC TTCTTCGCCG CGACCGGGCT GTGCGTGCTG GAAGTAAACC TCGGGACGAT GTGCGAGCTC GGACGGGGCG GCGGGGTGTC GGTGAATGCG ATGTGTCGAA GGACGCTCGG CGACGCCGGC GTAAACGCGG CGACGGCGTC GTTCGCGTTC ATCCACTACG CGTTACTCGT GGCGTACGTG CAAAAGGTCG GGGAGCTCGC GGTTGAGATT TGGCCGCAGG TGCCAGGAGG GGCGAACGCG GCGTCGGTGG CGTACGCGAC GGCGATGTCG ACATTTTTGT ACCTCGCCTC GCCGGCGAAA ATCGAACGGT TCAACTCGGC GCTCTTCGCG GGCGTCGTCG GGACGTTTGT ACCGCTGTTG CTGCTCGCCG CGAGGAGCGA AACGACTTCA GTGGATAATC TGCTCGCCGT GTCGGATTGG AGCGCGGCGC CGGCGACGAT TCCAATCGTC GCCGTGGCGT TCGTGTATCA CCAAGTCGTT CCAGTCGTGG CGACTTCGTT GGAGGGCGAC AGGAAGCGCG CCACGACGGC CGTGCTCGCG GGCACGGCGA TACCGGCGTT GATGTTCATC TTGTGGGACG CCGCCGTGCT CGGAAGCGTC GACGCGGGGG CGATCGATGT CGACCCTATC GGCGCTCTGC AAGCGTCTTC GCCGTTGACG GCGGCGCTCG TTCGTGGGTT CGAATTCTTC GCCGTATCGA CGTCTTTCTT GGGATTCGGA TGGGGTCTCG CGGATTTCCT CGCGGACGGC ATGAAGACGA CTGATATTCA CGATCCGCGG CCGTGGGCCT TGGCGCTCGT GCCTCCCGTG ATTTTCGCGC TCGCGTGCCC GGGCGTCTTC CTAGCCGCGC TCGACAGCGC GGGCGCTTTC GGGGTTTTAG TCGTGTTCGG TATGATTCCA CCCGCGATGG TGTATAGACA CAGGCAAATG CGAGACGAGT GTGCGCTCGA GAACGACCCA GTGGGGTGCT TGCCGGTATT AGATCCGGTG CTTCCGGGCG GTGCGCTCAC GCTAGCGCTC ATGTTCGCGT TCGCAGCGAG CGAGGTCGGT TCGGAGACCA TC
|
Protein sequence | RLWSNVERGD DGLRRRAGSL GASVALVAGT TLGAGMLALP LVLRDAGFVP STVVIVACWV FFAATGLCVL EVNLGTMCEL GRGGGVSVNA MCRRTLGDAG VNAATASFAF IHYALLVAYV QKVGELAVEI WPQVPGGANA ASVAYATAMS TFLYLASPAK IERFNSALFA GVVGTFVPLL LLAARSETTS VDNLLAVSDW SAAPATIPIV AVAFVYHQVV PVVATSLEGD RKRATTAVLA GTAIPALMFI LWDAAVLGSV DAGAIDVDPI GALQASSPLT AALVRGFEFF AVSTSFLGFG WGLADFLADG MKTTDIHDPR PWALALVPPV IFALACPGVF LAALDSAGAF GVLVVFGMIP PAMVYRHRQM RDECALENDP VGCLPVLDPV LPGGALTLAL MFAFAASEVG SETI
|
| |