Gene OSTLU_28839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28839 
Symbol 
ID4999465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp386869 
End bp389103 
Gene Length2235 bp 
Protein Length744 aa 
Translation table 
GC content55% 
IMG OID640414886 
ProductAAAP family transporter: amino acid 
Protein accessionXP_001415816 
Protein GI145341438 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.046591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGGCA GCGGCGCGCT CGCGGTGCCG TCGTGCTTCG CGGCGTGCGG ATTGGCGCTG 
ACGACGGCGA CGAGCGCGGG GGCGTGCCTG GCGTGCGCGC TGTCGCTCGA GCAAATGTTG
TACTGCGCGA ACGCGCACGG CGCGTGGTCG TACGAAGATT TGACGAGCGC GACGCTGGGA
CGACGAGGGA GGATCGTCGC GAAGTATAGC ATCGCCGCGC TGTTGGTCGG GGTGGTGGTG
GCGTACGTAA ACATCGTGGG GGATAATTTC TTCACGGCGG CGTCGAGCGG CGTGCTGCCG
GCTGGGGTGG AGGTGAATCG CGAACGCGTC ATGGCGGGGG TGACGTTTGG GGTGTTTTTG
CCGTTGGCGA CGTTCGTGAA GAGTCAAAAA AGACTCGAAA AAGTGGTGGG GTTCGGGTTG
GCGACGACGG GGTTGTTTGG ATTGTGTTTG GTGACGCTCG CGGCGCGACG GTTGGCGCTC
GGCGACGGGA AATCGAGCGC GTCGAAGACG TGGGACGCGA GCGGGCTGGG TCTGGTGCTA
CCGATCATGG TATTTAATTT TGCGGCGCAC GTGATGGTGT TTCCAGCGTT GAAGAGTTCG
GGCACGGCGG TGATGTCGAC CTCGCGCGTC GTCGGCATCG CGAACGAGAC GATGGTGCAT
TTGTTGATCT TTTACATCGT CACCGGCGCG TGCGGATACG TGGCGTTCGG GTCGAGCGTG
AACGGTAACG TTTTGCGCAA CTTTGGTGCG GAAAGTGGAT GGTTTGGGAT TTACACGCAG
TTGGTACGGT TCTTGTACGG GTGCGCTATA TGCACCGGCG TGCCGCTTCT CTTCATTTCC
TTGCGAGAAA TATCGCCCTC GCTCTTGCTG TCGCTCTCGC GCGTGGCGGG AAGACACACC
GAATTAGTGT TTGATGTTCT CGTGCTTTAC AGCTGCTTGC GCGTGTCCAT TTCAGTGCCA
AACATTCAAC ACGTGTTCGG TCTCATCGGT AGCACGACAT GCTCGATACT TACATTCATT
CTTCCTGGGA TGATGTTTCT GCGCACGTGT CCGTCGTCGA GCTCGAAGAA AAGTATGAAT
GATTTCAAAC TACTGCAGCA GCTCGGGAGC GGTGTGCGTC TCAGCGCGCA CGCGCTAATT
CTATTTGGCG TATTTATTGG TATCGCATGT ACACGAAGCA CACTGCAGTC CTTGAAAGAA
GAAGCTGAGG TTGTCGTCAT CATACAACGA CTCGTCGCCG CTCAAACATT CGTACGATCG
AGGGTGCAAG TTTACGACAG GATTCTCACC GCGGCGGCGA AATTCAGAAG AATCGATGCC
GTAGAACGAG TTGTTTCACA ACTTTCCGAA GAGTCGAAAT TAAGCGAGAG CGTGGTGAAA
GATGTGACAT CCACGCTCGC GCTCGCTTCA AGTCAGGAAA CGCGTTCAGC GGCGATGCGC
GAATTCGACT CGCTCAATCC GTTTCGAAAA GATGTCGACG AAGAGGACGT GCGAGAGGCA
AAGGAGAAAT TCGCCAAAGC GAGAGAATCG TACAGAAATG TAAGCGATAC CCTACGCGAG
GTTCGCGAGA CGTTGAGAGA TATAGAAGAG GAAGGGGAAG CGAGCCAGGC TGATGAGAGC
TCCGACGACG TTCACGAGCG CGCAGATGAG GCGCTTGAGA AAGTACGAGC AACGGACGAT
GCGCTCGAAG AAACATCGGA CGTCCTAGAA CTGACCGAAG ACGTCAACAC CGATCTCGAT
GGATTACAGG TTGCGTCTGA ACGGTTAGAA ATGACGGACA CGATCATCGA GCAAACGTTG
GAGGAGGTAC GAGGGGCGAA AAAGTCTGGA GCGGAGGCCG TGTGGCAAGC GGCGACTAAT
GTGGTCGAAG AGACGGAGAA TAAAGATGAA CTCATCAAGA ATCTGAAAGA ACCCATAAAT
ATCGCACCAC GTGCGTGGAA GAAGGGCAAG GGCAAAGACG TCGATAGTCA GGAAGATATC
GTTGAAAAGA TAGTCGCCGC CTCGAACAAG GTGACAGACG AAGAGGCGGA GCGCGCTGTC
GAGACGGTGC TCCAAAACAA CACGCAAGAC GGTGCTTCGA GCCGCGCCGT GCAACGAGCC
AGTGAAATCT TAAAAGAAAT CACCGCAGAA AGTCCGTCGG ATTCTTCGAA GAACCCGACT
AAAGCCGACG AGAACGTCGT ATTTAAAAAG TTTGTCAACG CGACGACGGA AGGGCGCCAA
GTAGAGCAGC AGTAA
 
Protein sequence
MLGSGALAVP SCFAACGLAL TTATSAGACL ACALSLEQML YCANAHGAWS YEDLTSATLG 
RRGRIVAKYS IAALLVGVVV AYVNIVGDNF FTAASSGVLP AGVEVNRERV MAGVTFGVFL
PLATFVKSQK RLEKVVGFGL ATTGLFGLCL VTLAARRLAL GDGKSSASKT WDASGLGLVL
PIMVFNFAAH VMVFPALKSS GTAVMSTSRV VGIANETMVH LLIFYIVTGA CGYVAFGSSV
NGNVLRNFGA ESGWFGIYTQ LVRFLYGCAI CTGVPLLFIS LREISPSLLL SLSRVAGRHT
ELVFDVLVLY SCLRVSISVP NIQHVFGLIG STTCSILTFI LPGMMFLRTC PSSSSKKSMN
DFKLLQQLGS GVRLSAHALI LFGVFIGIAC TRSTLQSLKE EAEVVVIIQR LVAAQTFVRS
RVQVYDRILT AAAKFRRIDA VERVVSQLSE ESKLSESVVK DVTSTLALAS SQETRSAAMR
EFDSLNPFRK DVDEEDVREA KEKFAKARES YRNVSDTLRE VRETLRDIEE EGEASQADES
SDDVHERADE ALEKVRATDD ALEETSDVLE LTEDVNTDLD GLQVASERLE MTDTIIEQTL
EEVRGAKKSG AEAVWQAATN VVEETENKDE LIKNLKEPIN IAPRAWKKGK GKDVDSQEDI
VEKIVAASNK VTDEEAERAV ETVLQNNTQD GASSRAVQRA SEILKEITAE SPSDSSKNPT
KADENVVFKK FVNATTEGRQ VEQQ