Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10171 |
Symbol | |
ID | 4778314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 926715 |
End bp | 927977 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640086526 |
Product | putative proline/betaine transporter, MFS family protein |
Protein accession | YP_001017031 |
Protein GI | 124022724 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.643051 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGTAA AGGCCCAAGA GACATCAAAC ATACGTGTGA TACTCGCTGG TCTCGTTGGC AACGTGATCG AATGGTATGA CTTCGCTTTG TATGGATACT TTGCCAGCGT TATTGGAAAA CAGTTCTTTC CCTCTAGTAA TCCTTCAGTC TCTCTAATTG CTGCTTTCGG AGCGTTTGCT GTCGGCTTTC TAGTTCGCCC TTTCGGAGGA CTTTTGTTCG GACGTATTGC TGATTTGCTG GGACGAAAAC AGGCGCTTAT CCTTACCTTG CTGGCGATGG CTATCCCAAC AGTGCTGATG GCCTGTATGC CCAACTACAG CAGGATTGGC ATAGCTGCTC CGATCATAAT CGTTTTGTTG CGTATTATCC AAGGATTATC AGTTGGCGGC GAGTACACAA CATCAATTGT TTATCTCGTT GAGAATGCCC CTGATAAACG ACGAGCCTTC TTTGCTATTT GGGGTCTATG GGGAGCAGTA TTGGGAATCC TCTTGGCTTC TGCCATAGCC AGTTTGCTTG CCAATATTCT TGACCCTCAA CAGCTAGACA TCTGGGGTTG GAGAGTGCCT TTTGCGCTCG GTTCACTTGT CGCATTAATA GGACTTTTAA TACGACGTGG TCTTGTAACT GATGTATGTA CTGAAGAGGC AATAGACCCA GTACAGCAGG TTTTCGGCAA ATACCGTATG CAGGTATTAC GCTTGTTCTT GCTTAATATT GGTGGCGGTG TTGGCTTCTA TGCAGCTTTT GTGTATGTTG TGAGTTACGT CAAGGAAATA GATATGGTGC CCGAACGAAT AGCTCTGAAT ATAAATACAG TTTCTATGGC AATACTTTTA ATACTTTATC CATTAACCGC TTGGCTTTCA GACCGCATTG GACGTAAGCC CTTGTTGATC GCTGGTGGTG GCATGTTGAT GTTTGGCTCG ATTCCACTTT TTCACTTGAT TCACACCACT GATCCATTAC GAATTTTCTT TGGGCAGCTC GGATTTGTGA TTGCACTCGC AACTCTTTCA GGAGGATTAA ATGTCGCGAA TGTGGAGCTT ATGCCTAAGG CGGTTCGCTG TACCGGCCTG GCCTTTGCCT ACAACACTTC TATGGGGATT TTTGGTGGTA CAACACCATT AATTGCGACC TGGTTAATTC AGGGAAGTGG AAATCCAATT AGCCCTGCTT ACTGGTTAGC GGGCAGTGCT TCGATCACTT TATTAACAAG TATCTTTTGG GTTAGAGAAA CGAGACTTTC AAGCCTGTCT TGA
|
Protein sequence | MIVKAQETSN IRVILAGLVG NVIEWYDFAL YGYFASVIGK QFFPSSNPSV SLIAAFGAFA VGFLVRPFGG LLFGRIADLL GRKQALILTL LAMAIPTVLM ACMPNYSRIG IAAPIIIVLL RIIQGLSVGG EYTTSIVYLV ENAPDKRRAF FAIWGLWGAV LGILLASAIA SLLANILDPQ QLDIWGWRVP FALGSLVALI GLLIRRGLVT DVCTEEAIDP VQQVFGKYRM QVLRLFLLNI GGGVGFYAAF VYVVSYVKEI DMVPERIALN INTVSMAILL ILYPLTAWLS DRIGRKPLLI AGGGMLMFGS IPLFHLIHTT DPLRIFFGQL GFVIALATLS GGLNVANVEL MPKAVRCTGL AFAYNTSMGI FGGTTPLIAT WLIQGSGNPI SPAYWLAGSA SITLLTSIFW VRETRLSSLS
|
| |