Gene OSTLU_42458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42458 
Symbol 
ID5003172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp460002 
End bp461228 
Gene Length1227 bp 
Protein Length408 aa 
Translation table 
GC content57% 
IMG OID640418593 
ProductHAAAP family transporter: tyrosine 
Protein accessionXP_001419386 
Protein GI145349943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0166063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.266354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTGGG GGGACGGGGC GACGCGCTTC GCGCGGGGCT CCACCGACGA GGCGGAGTCG 
TGTCCGTTGG AATTGCACGC AGAACGCGCG AGCGACTTCG AAGCCATCAT GCTAGTCGTC
GGAACGACCG TCGGGGGTGG ATTCTTGGCG ATGCCGTACT TTGCCGCGCC CGCGGGGTTC
GTGCCGGCGG TTTTGATATC GTGCGGCGCG TGGGCGGTGC TCGCGGCGAG CGGACTGTTG
GTGTCCGAGA CGCTCATGCA CACGTGGGCG CGGTCGAACG GTCGAGCGGT AAGTTTGTTG
AGCGTAACAT CGGATTATCT CGGAAAGAGT TGGGGTAGCA TCGTCGCGGT GTCGTTCTTT
GTGATGATGA ACTGCACCCT GGTGAGTCAG CTCGCAAAGT GCGGGTCGTT GGCGAGTTTT
TTTAGCGGTG GCGCGGTGTC GCACGTTTTC GGTGCGGCCG TGACCGCGGC GATTGTTGGA
TACGTGTCGT TTTCAAACAA AGCGGCCAAG GTGAACGCGT ATGCGACCGT TGGTATTTTT
ATGTCGTTTG CGGCGATTTG CGTCTTTGGA GTGACGAATT TGACGCCTAG CAAGCTCACT
TTCATGAACT TCGCCGCGGC AGTGCCGGCA CTCCCCGGTT TGGTACAACT ATACACGTAT
GGAGAGTGCT TGCCGACGTT AGTTGACATG CTCAGAGGTG ACAGAGAACG TATCAGGCGC
GTGATTTTGC TCGGCACCTC GGTCCCCCTC GCAATGTACA CTTGTTGGCT CGTCGTTTCG
CTCGCTCAGA GTGGCGCTTG GGCTGGAACC GCGGACTTAG CGCAAGCAAT GTTGGAAAGT
GGCGGTATTT TGGGTGGAGC GACCGCTTCT GTCGCGATCG CGGCGTCGAT CTCAACGCTC
ATCGGGGGCT ACTTGGCGTT GAGTCGGTTT TGCGCCGACG CCTTGAAGAA AAAGACGGTG
AGTCATTCAA AGTCCGTGAT TGCGCTTACC CTGCTTCCCT CGCTACTTTT CGCCTGTAAA
GGCCCCGACG TGTACTTTTC CGCGCTCAAA TTCTCTGGAG CTGTGGTCGT CATCATTTTG
TGGGGTATTT TGCCTCCGCT GTTGGCAAAA TCTTTATGGA AGCGAGAAGG AGCTTTCACC
GAAATCCGAA AGTTCTTCGT TTACGTCTGG ACTACTCTCG CGAGCGTCGC TCTGTGCTTT
GGCGTTCGGT CTCTTGTCGT CGCGTGA
 
Protein sequence
MAWGDGATRF ARGSTDEAES CPLELHAERA SDFEAIMLVV GTTVGGGFLA MPYFAAPAGF 
VPAVLISCGA WAVLAASGLL VSETLMHTWA RSNGRAVSLL SVTSDYLGKS WGSIVAVSFF
VMMNCTLVSQ LAKCGSLASF FSGGAVSHVF GAAVTAAIVG YVSFSNKAAK VNAYATVGIF
MSFAAICVFG VTNLTPSKLT FMNFAAAVPA LPGLVQLYTY GECLPTLVDM LRGDRERIRR
VILLGTSVPL AMYTCWLVVS LAQSGAWAGT ADLAQAMLES GGILGGATAS VAIAASISTL
IGGYLALSRF CADALKKKTV SHSKSVIALT LLPSLLFACK GPDVYFSALK FSGAVVVIIL
WGILPPLLAK SLWKREGAFT EIRKFFVYVW TTLASVALCF GVRSLVVA