Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1568 |
Symbol | pepT |
ID | 6972211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1532992 |
End bp | 1534260 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385533 |
Product | peptidase T |
Protein accession | YP_002270027 |
Protein GI | 209398016 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2195] Di- and tripeptidases |
TIGRFAM ID | [TIGR01882] peptidase T |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00000893895 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCTTCTTA ATAATGTTGT CACAAAAAGT GAGGGTGACT ACATGGATAA ACTACTTGAG CGATTTTTGA ACTACGTGTC TCTGGATACC CAATCAAAAG CAGGGGTGAG ACAGGTTCCC AGCACGGAAG GCCAATGGAA GTTATTGCAT CTGCTGAAAG AGCAGCTCGA AGAGATGGGG CTTATCAATG TGACCTTAAG TGAGAAGGGC ACTTTGATGG CGACGTTACC GGCTAACGTC CCTGGCGATA TCCCGGCGAT TGGCTTTATT TCTCATGTGG ATACCTCACC GGATTGCAGC GGCAAAAATG TGAATCCGCA AATTGTTGAA AACTATCGCG GTGGCGATAT TGCGCTGGGT ATCGGCGATG AAGTTTTATC ACCGGTTATG TTCCCGGTGC TGCATCAGCT ACTGGGTCAG ACGCTGATTA CCACCGATGG TAAAACCTTG TTAGGTGCCG ATGACAAAGC AGGTATTGCA GAAATCATGA CCGCGCTGGC GGTATTGCAA CAGAAAAACA TTCCGCATGG TGATATTCGC GTCGCCTTTA CCCCGGATGA AGAAGTGGGC AAAGGGGCGA AACATTTTGA TGTTGATGCC TTCGATGCCC GCTGGGCTTA CACTGTTGAT GGTGGTGGCG TAGGCGAACT GGAGTTTGAA AACTTCAACG CCGCATCGGT CAATATCAAA ATTGTCGGTA ACAATGTTCA TCCGGGCACG GCGAAAGGAG TGATGGTAAA TGCGCTGTCG CTGGCGGCAC GTATTCATGC GGAAGTTCCG GCGGATGAAA GCCCGGAAAT GACAGAAGGC TATGAAGGTT TCTATCACCT GGCGAGCATG AAAGGCACCG TTGAACGGGC CGATATGCAC TACATCATCC GTGATTTCGA CCGTAAACAG TTTGAAGCGC GTAAACGTAA AATGATGGAG ATCGCCAAAA AAGTGGGCAA AGGGTTACAT CCTGATTGCT ACATTGAATT GGTGATTGAA GACAGTTACT ACAATATGCG CGAGAAAGTG GTTGAGCATC CGCATATTCT CGATATCGCC CAGCAGGCGA TGCGTGACTG CGATATTGAA CCGGAACTGA AACCGATCCG CGGCGGTACC GACGGCGCGC AGTTGTCGTT TATGGGATTA CCGTGCCCGA ACCTGTTCAC TGGCGGTTAC AACTATCATG GTAAGCATGA GTTTGTGACT CTGGAAGGTA TGGAAAAAGC GGTGCAGGTG ATCGTCCGTA TTGCCGAGTT AACGGCGCAA CGGAAGTAA
|
Protein sequence | MLLNNVVTKS EGDYMDKLLE RFLNYVSLDT QSKAGVRQVP STEGQWKLLH LLKEQLEEMG LINVTLSEKG TLMATLPANV PGDIPAIGFI SHVDTSPDCS GKNVNPQIVE NYRGGDIALG IGDEVLSPVM FPVLHQLLGQ TLITTDGKTL LGADDKAGIA EIMTALAVLQ QKNIPHGDIR VAFTPDEEVG KGAKHFDVDA FDARWAYTVD GGGVGELEFE NFNAASVNIK IVGNNVHPGT AKGVMVNALS LAARIHAEVP ADESPEMTEG YEGFYHLASM KGTVERADMH YIIRDFDRKQ FEARKRKMME IAKKVGKGLH PDCYIELVIE DSYYNMREKV VEHPHILDIA QQAMRDCDIE PELKPIRGGT DGAQLSFMGL PCPNLFTGGY NYHGKHEFVT LEGMEKAVQV IVRIAELTAQ RK
|
| |