Gene P9303_10171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_10171 
Symbol 
ID4778314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp926715 
End bp927977 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content45% 
IMG OID640086526 
Productputative proline/betaine transporter, MFS family protein 
Protein accessionYP_001017031 
Protein GI124022724 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.643051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGTAA AGGCCCAAGA GACATCAAAC ATACGTGTGA TACTCGCTGG TCTCGTTGGC 
AACGTGATCG AATGGTATGA CTTCGCTTTG TATGGATACT TTGCCAGCGT TATTGGAAAA
CAGTTCTTTC CCTCTAGTAA TCCTTCAGTC TCTCTAATTG CTGCTTTCGG AGCGTTTGCT
GTCGGCTTTC TAGTTCGCCC TTTCGGAGGA CTTTTGTTCG GACGTATTGC TGATTTGCTG
GGACGAAAAC AGGCGCTTAT CCTTACCTTG CTGGCGATGG CTATCCCAAC AGTGCTGATG
GCCTGTATGC CCAACTACAG CAGGATTGGC ATAGCTGCTC CGATCATAAT CGTTTTGTTG
CGTATTATCC AAGGATTATC AGTTGGCGGC GAGTACACAA CATCAATTGT TTATCTCGTT
GAGAATGCCC CTGATAAACG ACGAGCCTTC TTTGCTATTT GGGGTCTATG GGGAGCAGTA
TTGGGAATCC TCTTGGCTTC TGCCATAGCC AGTTTGCTTG CCAATATTCT TGACCCTCAA
CAGCTAGACA TCTGGGGTTG GAGAGTGCCT TTTGCGCTCG GTTCACTTGT CGCATTAATA
GGACTTTTAA TACGACGTGG TCTTGTAACT GATGTATGTA CTGAAGAGGC AATAGACCCA
GTACAGCAGG TTTTCGGCAA ATACCGTATG CAGGTATTAC GCTTGTTCTT GCTTAATATT
GGTGGCGGTG TTGGCTTCTA TGCAGCTTTT GTGTATGTTG TGAGTTACGT CAAGGAAATA
GATATGGTGC CCGAACGAAT AGCTCTGAAT ATAAATACAG TTTCTATGGC AATACTTTTA
ATACTTTATC CATTAACCGC TTGGCTTTCA GACCGCATTG GACGTAAGCC CTTGTTGATC
GCTGGTGGTG GCATGTTGAT GTTTGGCTCG ATTCCACTTT TTCACTTGAT TCACACCACT
GATCCATTAC GAATTTTCTT TGGGCAGCTC GGATTTGTGA TTGCACTCGC AACTCTTTCA
GGAGGATTAA ATGTCGCGAA TGTGGAGCTT ATGCCTAAGG CGGTTCGCTG TACCGGCCTG
GCCTTTGCCT ACAACACTTC TATGGGGATT TTTGGTGGTA CAACACCATT AATTGCGACC
TGGTTAATTC AGGGAAGTGG AAATCCAATT AGCCCTGCTT ACTGGTTAGC GGGCAGTGCT
TCGATCACTT TATTAACAAG TATCTTTTGG GTTAGAGAAA CGAGACTTTC AAGCCTGTCT
TGA
 
Protein sequence
MIVKAQETSN IRVILAGLVG NVIEWYDFAL YGYFASVIGK QFFPSSNPSV SLIAAFGAFA 
VGFLVRPFGG LLFGRIADLL GRKQALILTL LAMAIPTVLM ACMPNYSRIG IAAPIIIVLL
RIIQGLSVGG EYTTSIVYLV ENAPDKRRAF FAIWGLWGAV LGILLASAIA SLLANILDPQ
QLDIWGWRVP FALGSLVALI GLLIRRGLVT DVCTEEAIDP VQQVFGKYRM QVLRLFLLNI
GGGVGFYAAF VYVVSYVKEI DMVPERIALN INTVSMAILL ILYPLTAWLS DRIGRKPLLI
AGGGMLMFGS IPLFHLIHTT DPLRIFFGQL GFVIALATLS GGLNVANVEL MPKAVRCTGL
AFAYNTSMGI FGGTTPLIAT WLIQGSGNPI SPAYWLAGSA SITLLTSIFW VRETRLSSLS