Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2108 |
Symbol | putP |
ID | 6147531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2115993 |
End bp | 2117501 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616984 |
Product | sodium/proline symporter |
Protein accession | YP_001744159 |
Protein GI | 170682638 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.275652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATTA GCACACCGAT GTTGGTGACA TTTTGTGTCT ATATCTTTGG CATGATATTG ATTGGGTTTA TCGCCTGGCG ATCAACGAAA AACTTCGACG ACTATATTCT GGGCGGTCGT AGTCTTGGGC CATTCGTGAC GGCATTATCG GCGGGGGCGT CGGATATGAG CGGCTGGCTG TTAATGGGGT TGCCAGGCGC TGTTTTTCTT TCCGGGATTT CCGAAAGCTG GATCGCCATT GGCCTGACAT TAGGCGCGTG GATCAACTGG AAGCTGGTGG CCGGGCGGTT GCGTGTGCAT ACCGAATACA ACAATAACGC CTTAACATTG CCGGATTATT TCACCGGGCG CTTTGAAGAT AAAAGCCGCA TTTTGCGCAT TATCTCCGCG CTGGTTATTT TGCTGTTCTT CACCATTTAT TGCGCTTCGG GCATTGTGGC AGGCGCGCGT CTGTTTGAAA GTACCTTTGG CATGAGCTAC GAAACGGCTC TGTGGGCGGG GGCTGCGGCG ACGATCCTTT ACACCTTTAT TGGCGGTTTC CTCGCGGTGA GCTGGACTGA CACTGTACAG GCCAGCCTGA TGATTTTTGC CCTGATCCTG ACGCCGGTTA TCGTAATCAT TAGCGTTGGT GGGTTTGGCG ACTCTCTGGA AGTGATCAAA CAAAAGAGCA TCGAAAACGT GGATATGCTC AAAGGGCTGA ACTTTGTCGC CATTATCTCA CTGATGGGCT GGGGACTGGG TTACTTCGGG CAGCCGCACA TCCTGGCGCG TTTTATGGCG GCGGATTCTC ACCACAGCAT TGTCCATGCG CGTCGTATCA GTATGACCTG GATGATCCTC TGCCTGGCAG GGGCGGTGGC TGTCGGCTTC TTTGGTATTG CTTACTTTAA TGAGCACCCG GCGGTAGCAG GTGCGGTAAA TCAGAACGCT GAACGCGTGT TTATCGAACT GGCGCAAATT CTGTTTAACC CGTGGATTGC CGGGATTCTG CTGTCGGCGA TTCTGGCGGC GGTAATGTCA ACCTTAAGCT GCCAGCTGCT GGTATGTTCC AGTGCGATTA CCGAAGATTT ATACAAAGCG TTTCTGCGTA AACATGCCAG CCAGAAAGAG CTGGTGTGGG TAGGGCGTGT GATGGTGCTG GTGGTGGCGC TGGTGGCGAT TGCGCTGGCG GCAAACCCGG AAAACCGCGT GCTGGGCTTA GTGAGCTACG CGTGGGCAGG CTTTGGCGCG GCGTTTGGTC CGGTGGTGCT GTTCTCGGTG ATGTGGTCAC GCATGACGCG CAACGGTGCG CTGGCGGGGA TGATCATCGG TGCGCTGACG GTTATCGTCT GGAAACAGTT CGGCTGGCTG GGACTGTACG AAATTATTCC GGGCTTCATC TTCGGCAGTA TCGGGATTGT AGTGTTTAGT TTGCTGGGTA AAGCGCCGTC AGCGGCGATG CAAAAACGCT TTGCCGAGGC CGATGCGCAC TATCATTCGG CTCCGCCGTC ACGGTTGCAG GAAGGGTAA
|
Protein sequence | MAISTPMLVT FCVYIFGMIL IGFIAWRSTK NFDDYILGGR SLGPFVTALS AGASDMSGWL LMGLPGAVFL SGISESWIAI GLTLGAWINW KLVAGRLRVH TEYNNNALTL PDYFTGRFED KSRILRIISA LVILLFFTIY CASGIVAGAR LFESTFGMSY ETALWAGAAA TILYTFIGGF LAVSWTDTVQ ASLMIFALIL TPVIVIISVG GFGDSLEVIK QKSIENVDML KGLNFVAIIS LMGWGLGYFG QPHILARFMA ADSHHSIVHA RRISMTWMIL CLAGAVAVGF FGIAYFNEHP AVAGAVNQNA ERVFIELAQI LFNPWIAGIL LSAILAAVMS TLSCQLLVCS SAITEDLYKA FLRKHASQKE LVWVGRVMVL VVALVAIALA ANPENRVLGL VSYAWAGFGA AFGPVVLFSV MWSRMTRNGA LAGMIIGALT VIVWKQFGWL GLYEIIPGFI FGSIGIVVFS LLGKAPSAAM QKRFAEADAH YHSAPPSRLQ EG
|
| |