Gene OSTLU_47852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_47852 
Symbol 
ID5006237 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp16387 
End bp18116 
Gene Length1730 bp 
Protein Length534 aa 
Translation table 
GC content61% 
IMG OID640421658 
ProductHAAAP family transporter: tyrosine 
Protein accessionXP_001422074 
Protein GI145355665 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00117106 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
TCGACGAGCG AGGGCGAGGA CCGACGATGA CGACGGTACC GACGCGCGCG ACGACGCGCG 
CGACGACGGT ATCGACGCGC GCGCGAGCGA CGCGCGCGAT GGCGACGACG ACGACGACAG
CGTCAACGGC TCGGCACGCG GCGACGGGAG GGCGACGCGC GCGCGACGGA CGCGCGCGAT
TAGAGATGAC ACGCGGGGGC GGTGGAGTGA ACGTCAACGA CGCGTCGACG GCGGGCAGGG
CGCGCCGACG CGCGCGAGGG GGAAGACGGA CGACGACGAC GACGACGAAC GCGATTAGCG
ACGCGACGGA GCCAGAGCTC GCGGCGACGC GCGCGACGAT TCCGATATCG TCGTTCGATG
AAGAAGAGCG CGCCGTGGAT GAAGAGAGGG ACGAAGAAAC GCACACCGGA TCGCTCGCGG
GCGTCGTGGC GCTCATCGTG GGAAGTACCG TGGGGGCGGG GGTGCTGGCG CTTCCGGCGG
CGACGGCGGA AGCCGGCATC TTGCCGGCGA GCGGGGCGTT GATAGGGGTG TGGATCTTAC
TCGTGTGCGA CGCCTTGCTG TTGGCCGAGG TGAATGTGGG GATTATGCGA GAGCGAGACG
AAGATCGGTT GACGCACGGA CGAGGGCACT CGCCGGTGGT GATTTCTTTG AGCGACATGG
CGGAGCGGAC GCTCGGCACC GAAGGCAAAG TTTTCGCGTC GGCGCTGTAC TCATTCATGT
CGCTCACCGT GCTCGTAGCG TACATAAGCA AGGGCGCGGA GATTTTAGAT GGCGCGCTCG
AAATCGGCCC ATCGCTCGCC GCAGTGATGT TCACCGCCGG TTTAGGGGGT ACGATTTGCT
TGGGGGGATC TAAGGTGGCC GACAAGCTGA ACCAAATGTT GACGTACGGT TCCCTCATCG
CGTTCGCCGC GTTCGTGGGA AGCGGAGCGT TTTACGCGGA TTGGAGCCAT GCGAATTGGA
TGGGCTCGAC CGATGCCGTC CCGTCGACGA TTCCAATCAT TTTCCTCACG CTCGTGTATC
ACGACTTGAT ACCCGTCATG TGCGCGTTTT TGCAAGGCGA CATGAAGCAA ATTCGGCGAG
CTATTTTAAT CGGTTCATCG ATCCCGCTCG CCATGTTTTT GCTGTGGAAT ACCGTCGCTT
TGGCTATGGC TGGAGGTGAT ATCACGGCGG ATCCTCTGAG CATCATATCC GAAGATCTCG
GTGGCTCCGC AAGCGTGCTT TTGAGCCTCT TCGGCGTCAG CGCCATCGGC ACGTCCTTCA
TCGGCGTCTC TCTCGGTATG TCCGAGTATT TGATGCCATT CATGGAGAAC GCGTTAAGCG
GTCTCGACGA ACCGGAAGAA GAGGCTAGAT TGACGCGCGC TTTGTACGAG GCGATGAACA
CGTACGAAGA CGAGAAACCA TCTGGCTTGC CGAGCGTCTC GCGCGTGCTC ACGTTCGCCG
CCTTCCTCTC CGTCCCCCTC TTCGTCGCGG AACAATGTCC AAACATTTTT CTTCCCGTCA
CCAACTTTGT CGGTGCGTAC GGAATGACGA CACTTTACGG TGTTATGCCG CCAATCATGG
CGTATACGAT GCGACAAGAG AGGAGAGAGT CTCGGCGAGC CGACCCATTC GCGCCGCTCG
CGTTGCGCTT CAAGACAAAC ACACTGCTTC CCGGCGGTCG TCTCACGCTC AGCGCGCTCA
GTCTATCCGC CGTCGCCATC GCACTTTCTA AAGTGTACGA GGATATCTCC
 
Protein sequence
MTTVPTRATT RATTVSTRAR ATRAMATTTT TASTARHAAT GGRRARDGRA RLEMTRGGGG 
VNVNDASTAG RARRRARGGR RTTTTTTNAI SDATEPELAA TRATIPISSF DEEERAVDEE
RDEETHTGSL AGVVALIVGS TVGAGVLALP AATAEAGILP ASGALIGVWI LLVCDALLLA
EVNVGIMRER DEDRLTHGRG HSPVVISLSD MAERTLGTEG KVFASALYSF MSLTVLVAYI
SKGAEILDGA LEIGPSLAAV MFTAGLGGTI CLGGSKVADK LNQMLTYGSL IAFAAFVGSG
AFYADWSHAN WMGSTDAVPS TIPIIFLTLV YHDLIPVMCA FLQGDMKQIR RAILIGSSIP
LAMFLLWNTV ALAMAGGDIT ADPLSIISED LGGSASVLLS LFGVSAIGTS FIGVSLGMSE
YLMPFMENAV SRVLTFAAFL SVPLFVAEQC PNIFLPVTNF VGAYGMTTLY GVMPPIMAYT
MRQERRESRR ADPFAPLALR FKTNTLLPGG RLTLSALSLS AVAIALSKVY EDIS