Gene OSTLU_32754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32754 
Symbol 
ID5002775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp580350 
End bp581423 
Gene Length1074 bp 
Protein Length357 aa 
Translation table 
GC content57% 
IMG OID640418196 
ProductDMT family transporter: phosphate/phosphoenolpyruvate 
Protein accessionXP_001418975 
Protein GI145349094 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.053836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGCGC TTAAAGCGCT CGCGCGCGAG ATCGCGGGGC GTCCGAGCTC GCGAGCGCCG 
AGCGCGGAAC TGACGCGAGC GCTGTACACG CTCAAAGCGA CGCTGTACCT GCTCGCGTGG
GGGACGTGCA GCGGGCTGAT CATCTTGGTG AACGACGCGG TGTTGAACAG ATACGACTTC
CCGTACCCGA TCGCCGTCAG CGCGACGGGA CCGCTGCTGT CGTGGATGAT CGCGGCGATT
TTAGTGTTGA CGAATTCGGT GAAATTGGAG CGCACGCTTT CGCTGAAGGA GTGGCTCGTG
ACGGTGTTTC CGATCGGGTT CTTCACCGCG GTGACGTTCG CGGCGGGGAA TCAGTTATAT
TTGTACCTCA GCGTGAGCTT CATACAGATG ATGAAGTCCC TGTCGCCGTG CGTGGTGTTT
TTAATGTTGG TCGTCGTGGG TCTAGACACG GCGACGAAGG AGAAGGTGAT AGCGGTGGGT
ACGATGACTG TCGGCATGGC GGTGGCGTGC GCGACGGAGG AAACGTTTAC GGTGTTGGGG
TTGTCGCTCA TGATCATAGG CGAAGGTGCA GAGGCGATGC GCATGGTGTT GTTTCAACAT
TTCATGGGCA ATCGTGGGTT TGGGTTACTG GAGGGTTTGT TTTACACGTG TCCCGCGAAT
TTCTTCTTTC TCTCCGTCGG CGTGGCGATT TTCGAGCAGC GAGAGATTAC GTTGAGAGGA
GACTTAGCCA TCGTTCGCGC CAACCCTTGG CCGTTCGTCG CGGTGTCAGT GTTAGGGTTT
CTCGTAATGG TGACCACTTT GGGGGTGATC AAGACGTGCG GGTCGCTGAC GTTTAAAGCC
GCCGGTCAAG TTCGCAACGT CGCAATCATC ATGTTTAGCG TCGTCTTCAT GGGCGAGAAG
ACGACGCCCG TGCAGCTCGT CGGATACGCG ATGAACGTCT TGGGATTCGC GTATTATCAA
AAGTATAAAA CAGACGAGGA TGTGAGCAAA ATCACGGCTT CGAGCGACGG CGAAGTCGAG
CGCGAAAAGC TCTTGGACTC GCCGCGTTCG AGCAACGGCT CGGCGGATTT GTGA
 
Protein sequence
MSALKALARE IAGRPSSRAP SAELTRALYT LKATLYLLAW GTCSGLIILV NDAVLNRYDF 
PYPIAVSATG PLLSWMIAAI LVLTNSVKLE RTLSLKEWLV TVFPIGFFTA VTFAAGNQLY
LYLSVSFIQM MKSLSPCVVF LMLVVVGLDT ATKEKVIAVG TMTVGMAVAC ATEETFTVLG
LSLMIIGEGA EAMRMVLFQH FMGNRGFGLL EGLFYTCPAN FFFLSVGVAI FEQREITLRG
DLAIVRANPW PFVAVSVLGF LVMVTTLGVI KTCGSLTFKA AGQVRNVAII MFSVVFMGEK
TTPVQLVGYA MNVLGFAYYQ KYKTDEDVSK ITASSDGEVE REKLLDSPRS SNGSADL