Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4351 |
Symbol | proP |
ID | 5594377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4354523 |
End bp | 4356025 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923449 |
Product | proline/glycine betaine transporter |
Protein accession | YP_001460894 |
Protein GI | 157163576 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00883] metabolite-proton symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAA GGAAAAAAGT AAAACCGATT ACCCTTCGTG ATGTCACCAT TATTGATGAC GGTAAACTGC GTAAAGCCAT TACCGCAGCA TCACTGGGTA ATGCAATGGA ATGGTTTGAT TTTGGTGTTT ATGGTTTTGT TGCTTACGCA TTAGGTAAAG TTTTTTTCCC GGGGGCTGAC CCCAGCGTGC AGATGGTTGC TGCACTTGCC ACGTTCTCCG TTCCCTTTCT GATTCGACCG CTTGGCGGAC TCTTCTTTGG TATGTTGGGC GATAAATATG GTCGCCAGAA GATCCTCGCT ATCACTATTG TGATTATGTC GATCAGTACG TTCTGTATTG GCTTAATACC GTCCTACGAC ACGATTGGTA TTTGGGCACC GATTCTGCTG TTGATCTGTA AGATGGCACA AGGTTTCTCG GTCGGCGGTG AATATACCGG GGCGTCGATA TTTGTTGCGG AATACTCCCC TGACCGTAAA CGTGGCTTTA TGGGCAGCTG GCTGGACTTT GGTTCTATTG CCGGGTTTGT GCTGGGTGCT GGCGTAGTGG TGTTAATTTC GACCATTGTC GGCGAAGCGA ACTTCCTCGA CTGGGGCTGG CGTATTCCGT TCTTTATTGC TCTGCCGTTA GGGATTATCG GGCTTTACCT GCGCCATGCG CTGGAAGAAA CTCCGGCGTT CCAGCAGCAT GTCGATAAAC TGGAACAGGG CGACCGCGAA GGTTTGCAGG ATGGCCCGAA AGTCTCGTTT AAAGAGATTG CCACCAAACA CTGGCGCAGC CTGTTGACAT GTATTGGTCT GGTAATTGCC ACCAACGTGA CTTACTACAT GTTGCTGACC TATATGCCGA GTTATTTGTC GCATAACCTG CATTACTCCG AAGACCACGG GGTGCTGATT ATTATCGCCA TTATGATCGG TATGCTGTTT GTCCAGCCGG TGATGGGCTT GCTGAGTGAC CGTTTTGGCC GTCGTCCGTT TGTGCTACTT GGTAGTGTTG CCCTGTTTGT GTTGGCGATC CCGGCGTTTA TTCTGATTAA CAGTAACGTC ATCGGCCTGA TTTTTGCCGG GTTACTGATG CTGGCGGTGA TCCTTAACTG CTTTACGGGC GTTATGGCTT CTACCTTGCC AGCGATGTTC CCGACGCATA TCCGTTACAG CGCGCTGGCG GCGGCATTTA ATATTTCGGT GCTGGTTGCC GGTCTGACGC CAACACTGGC GGCCTGGCTG GTCGAAAGCT CGCAGAATCT GATGATGCCT GCCTATTACC TGATGGTAGT GGCGGTGATT GGTTTAATCA CCGGCGTAAC CATGAAAGAG ACGGCAAATC GTCCGTTGAA AGGTGCGACA CCGGCGGCGT CAGATATACA GGAAGCGAAG GAAATTCTCG TCGAGCATTA CGATAATATC GAGCAGAAAA TCGATAATAT TGACCACGAG ATTGCCGATT TGCAGGCGAA ACGTACCCGC CTGGTGCAGC AACATCCGCG AATTGATGAA TAA
|
Protein sequence | MLKRKKVKPI TLRDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD PSVQMVAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYD TIGIWAPILL LICKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA GVVVLISTIV GEANFLDWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE GLQDGPKVSF KEIATKHWRS LLTCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI IIAIMIGMLF VQPVMGLLSD RFGRRPFVLL GSVALFVLAI PAFILINSNV IGLIFAGLLM LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLVA GLTPTLAAWL VESSQNLMMP AYYLMVVAVI GLITGVTMKE TANRPLKGAT PAASDIQEAK EILVEHYDNI EQKIDNIDHE IADLQAKRTR LVQQHPRIDE
|
| |