Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1253 |
Symbol | putP |
ID | 6967057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1262403 |
End bp | 1263911 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385244 |
Product | sodium/proline symporter |
Protein accession | YP_002269739 |
Protein GI | 209396363 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.325594 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATTA GCACACCGAT GTTGGTGACA TTTTGTGTCT ATATCTTTGG CATGATATTG ATTGGGTTTA TCGCCTGGCG ATCAACGAAA AACTTTGACG ACTATATTCT GGGCGGACGT AGCCTTGGGC CATTCGTGAC GGCATTATCG GCGGGTGCAT CGGATATGAG CGGCTGGCTG TTAATGGGGT TGCCGGGCGC GGTTTTTCTT TCCGGGATTT CCGAAAGTTG GATCGCCATT GGCCTGACAT TAGGCGCGTG GATTAACTGG AAGCTGGTGG CCGGGCGGTT GCGTGTGCAT ACCGAATACA ACAATAACGC CTTAACATTA CCGGATTATT TCACCGGGCG TTTTGAAGAT AAAAGCCGCA TTTTGCGCAT TATCTCCGCG CTGGTGATTT TGCTGTTCTT CACCATTTAT TGCGCTTCGG GCATTGTGGC AGGCGCGCGT CTGTTTGAAA GTACCTTTGG CATGAGCTAC GAAACGGCTC TGTGGGCGGG AGCTGCGGCG ACGATCCTTT ACACCTTTAT TGGCGGTTTC CTCGCGGTGA GCTGGACTGA CACTGTACAG GCCAGCCTGA TGATTTTTGC CCTGATCCTG ACGCCGGTTA TCGTCATTAT CAGTGTCGGT GGCTTTGGTG ACTCGCTGGA AGTGATCAAA CAAAAGAGCA TCGAAAACGT GGATATGCTC AAAGGGCTGA ACTTTGTCGC CATTATCTCG CTGATGGGGT GGGGGCTGGG TTACTTCGGG CAACCGCACA TCCTGGCGCG TTTTATGGCG GCGGATTCTC ACCACAGCAT CGTCCATGCG CGTCGTATCA GTATGACCTG GATGATCCTC TGCCTGGCAG GGGCGGTGGC TGTCGGCTTC TTTGGTATCG CTTACTTTAA CGAGCACCCG TCGGTAGCTG GTGCGGTAAA CCAGAACGCC GAGCGCGTGT TTATCGAACT GGCGCAAATT CTGTTTAACC CGTGGATTGC CGGGATTCTG CTGTCGGCGA TTCTGGCGGC GGTAATGTCA ACGTTAAGTT GCCAGCTGCT GGTATGTTCC AGTGCGATTA CCGAAGATTT GTACAAAGCG TTTCTGCGTA AACATGCCAG CCAGAAAGAA CTGGTGTGGG TAGGGCGTGT GATGGTGCTG GTGGTGGCGC TGGTGGCGAT TGCGTTGGCA GCGAACCCGG AAAACCGCGT GCTGGGCTTA GTGAGCTACG CGTGGGCAGG CTTTGGCGCG GCGTTTGGTC CGGTGGTGCT GTTTTCGGTG ATGTGGTCAC GCATGACGCG CAACGGTGCG CTGGCGGGGA TGATCATCGG TGCGCTGACG GTGATCGTCT GGAAACAGTT CGGCTGGCTG GGACTGTACG AAATTATTCC GGGCTTTATC TTTGGCAGTA TTGGGATTGT AGTGTTTAGT TTGCTGGGTA AAGCACCGTC AGCGGCGATG CAAAAACGCT TTGCCGAGGC CGATGCGCAC TATCATTCGG CTCCGCCGTC ACGGTTGCAG GAAGGCTAA
|
Protein sequence | MAISTPMLVT FCVYIFGMIL IGFIAWRSTK NFDDYILGGR SLGPFVTALS AGASDMSGWL LMGLPGAVFL SGISESWIAI GLTLGAWINW KLVAGRLRVH TEYNNNALTL PDYFTGRFED KSRILRIISA LVILLFFTIY CASGIVAGAR LFESTFGMSY ETALWAGAAA TILYTFIGGF LAVSWTDTVQ ASLMIFALIL TPVIVIISVG GFGDSLEVIK QKSIENVDML KGLNFVAIIS LMGWGLGYFG QPHILARFMA ADSHHSIVHA RRISMTWMIL CLAGAVAVGF FGIAYFNEHP SVAGAVNQNA ERVFIELAQI LFNPWIAGIL LSAILAAVMS TLSCQLLVCS SAITEDLYKA FLRKHASQKE LVWVGRVMVL VVALVAIALA ANPENRVLGL VSYAWAGFGA AFGPVVLFSV MWSRMTRNGA LAGMIIGALT VIVWKQFGWL GLYEIIPGFI FGSIGIVVFS LLGKAPSAAM QKRFAEADAH YHSAPPSRLQ EG
|
| |