Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0433 |
Symbol | proY |
ID | 6144255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 443258 |
End bp | 444634 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615329 |
Product | putative proline-specific permease |
Protein accession | YP_001742536 |
Protein GI | 170680647 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1113] Gamma-aminobutyrate permease and related permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATGGAAA GTAAGAACAA GCTAAAGCGT GGGCTAAGTA CCCGACACAT ACGCTTTATG GCACTGGGTT CAGCAATTGG CACCGGGCTG TTTTACGGTT CGGCAGATGC CATCAAAATG GCCGGTCCAA GCGTGTTGTT GGCCTATATT ATCGGTGGTA TCGCGGCGTA TATCATTATG CGCGCGCTGG GGGAAATGTC GGTACATAAC CCGGCCGCCA GCTCTTTCTC GCGTTATGCG CAGGAAAACC TCGGACCGCT GGCAGGTTAC ATTACCGGCT GGACCTACTG CTTTGAAATC CTAATTGTCG CCATCGCCGA TGTGACCGCT TTTGGTATCT ATATGGGGGT CTGGTTCCCG ACGGTGCCGC ACTGGATTTG GGTACTGAGC GTGGTGCTGA TCATTTGCGC CGTAAACCTG ATGAGCGTGA AGGTATTCGG TGAGCTGGAG TTCTGGTTCT CGTTCTTTAA AGTCGCCACT ATCATCATCA TGATTGTCGC CGGTTTCGGC ATCATCATCT GGGGAATTGG CAACGGCGGG CAACCGACCG GTATTCATAA CCTGTGGAGC AACGGCGGCT TCTTCAGTAA CGGCTGGCTT GGTATGGTGA TGTCGTTGCA AATGGTGATG TTTGCTTACG GTGGGATCGA AATTATCGGG ATTACCGCCG GTGAAGCGAA AGATCCTGAG AAATCGATTC CGCGTGCGAT TAACTCCGTG CCGATGCGTA TTCTGGTGTT CTACGTCGGT ACGCTGTTCG TCATAATGTC TATCTACCCG TGGAATCAGG TTGGCACTGC CGGTAGCCCG TTCGTGCTGA CGTTCCAGCA TATGGGCATT ACCTTTGCCG CCAGCATTCT TAACTTTGTT GTGCTGACCG CTTCGCTGTC GGCAATTAAC AGTGACGTAT TTGGCGTAGG CCGTATGCTC CACGGTATGG CAGAGCAGGG CAGCGCGCCA AAAATTTTCA GCAAAACCTC GCGTCGCGGT ATTCCGTGGG TTACGGTGCT GGTGATGACT ACCGCGCTGC TATTTGCGGT GTATCTGAAC TACATCATGC CGGAAAACGT CTTCCTGGTG ATCGCTTCGC TGGCAACCTT CGCCACGGTG TGGGTGTGGA TTATGATCCT GCTGTCGCAA ATCGCCTTCC GTCGCCGTTT GCCTCCAGAA GAAGTTAAGG CGCTGAAATT TAAAGTGCCG GGTGGGGTAG CAACGACCAT CGGCGGTTTG ATTTTCCTGC TCTTTATTAT CGGGTTGATT GGTTATCACC CGGATACGCG TATCTCGCTG TACGTTGGTT TCGCGTGGAT TGTTGTGCTG TTGATTGGCT GGATGTTTAA ACGCCGCCAC GATCGTCAGC TGGCTGAAAA CCAGTAA
|
Protein sequence | MMESKNKLKR GLSTRHIRFM ALGSAIGTGL FYGSADAIKM AGPSVLLAYI IGGIAAYIIM RALGEMSVHN PAASSFSRYA QENLGPLAGY ITGWTYCFEI LIVAIADVTA FGIYMGVWFP TVPHWIWVLS VVLIICAVNL MSVKVFGELE FWFSFFKVAT IIIMIVAGFG IIIWGIGNGG QPTGIHNLWS NGGFFSNGWL GMVMSLQMVM FAYGGIEIIG ITAGEAKDPE KSIPRAINSV PMRILVFYVG TLFVIMSIYP WNQVGTAGSP FVLTFQHMGI TFAASILNFV VLTASLSAIN SDVFGVGRML HGMAEQGSAP KIFSKTSRRG IPWVTVLVMT TALLFAVYLN YIMPENVFLV IASLATFATV WVWIMILLSQ IAFRRRLPPE EVKALKFKVP GGVATTIGGL IFLLFIIGLI GYHPDTRISL YVGFAWIVVL LIGWMFKRRH DRQLAENQ
|
| |