Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3785 |
Symbol | |
ID | 6146935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3852287 |
End bp | 3853756 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618611 |
Product | inner membrane transporter YhiP |
Protein accession | YP_001745751 |
Protein GI | 170682598 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCTGTTTTC TTCGTTAAGC AGCTTGGATT CTCGCAAGAA CAGGCTTTTG TCACTTTTGG TGCTTTTGCT GCGCTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC AAACGCACCA TTGTTCTCGG AGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCCTG TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT CGGTAACGGC CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGG CTTGATGGCG CATTTACCCT GTTCTATATG TCGATCAACA TCGGCTCGTT GATAGCGTTA TCGCTGGCCC CTGTGATCGC TGATAGATTC GGCTATTCAG TCACCTACAA CCTGTGCGGT GCGGGATTAA TTATCGCGCT ATTGGTTTAC ATCGCCTGTC GCGGAATGGT GAAAGATATT GGTTCTGAAC CCGACTTCTG CCCGATGAGC TTCAGTAAAC TGTTGTACGT GTTACTTGGC AGCGTGGTGA TGATCTTCGT CTGTGCATGG CTGATGCACA ACGTAGAAGT CGCCAATCTG GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATTTTCT TTCGTCAGGC ATTCAAGCTG GATAAAACTG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG TTTTACATTC TCTACGCCCA GATGCCAACG TCGCTGAACT TCTTTGCCAT CAACAACGTG CATCATGAAA TTCTCGGCTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGAATGT TTATGTGCTC GCTGGGCTTT TTGACGGCTG CAGCGGCTGG AATGTGGTTT GCGGATGCCC AGGGGCTGAC ATCGCCATGG TTTATCGTGC TGGTGTACTT ATTCCAGAGC CTGGGTGAAC TGTTTATTAG CGCCCTGGGC CTGGCGATGA TTGCTGCCCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG TTCCTGACGC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCAGTA CCAGACAACA TTACCGATCC GCTTGAGACG TTGCCCGTCT ATACCAACGT GTTTGGTAAG ATTGGCCTGG TTACGCTGGG CGTTGCGGTG GTGATGCTGC TGATGGTGCC GTGGCTGAAA CGCATGATTG CGACACCCGA AAGCCATTAA
|
Protein sequence | MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGL SLLKPDLIFI ALGTIAVGNG LFKANPASLL SKCYPPKDPR LDGAFTLFYM SINIGSLIAL SLAPVIADRF GYSVTYNLCG AGLIIALLVY IACRGMVKDI GSEPDFCPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK RMIATPESH
|
| |