Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3754 |
Symbol | pepB |
ID | 6968488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3479847 |
End bp | 3481130 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643387544 |
Product | aminopeptidase B |
Protein accession | YP_002271997 |
Protein GI | 209396200 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0773649 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAG CGATGAAGAT TACCCTCTCT ACCCAACCTG CCGACGCGCG CTGGGGAGAA AAAGCAACTT ACAGCATTAA TAATGACGGC ATTACCCTGC ATTTGAACGG GGCAGACGAT CTGGGGCTGA TCCAGCGTGC GGCGCGCAAG ATTGACGGTC TGGGCATCAA GCATGTTCAG TTAAGCGGTG AAGGCTGGGA TGCGGATCGA TGCTGGGCAT TCTGGCAAGG TTACAAAGCC CCGAAAGGCA CGCGTAAAGT GGAGTGGCCG GATCTGGACG ATGCCCAGCG CCAGGAACTG GATAACCGCC TGATGATCAT CGACTGGGTG CGTGACACCA TCAACGCACC GGCAGAAGAA TTGGGACCAT CGCAACTGGC ACAGCGTGCT GTTGATCTGA TCAGCAACGT CGCGAGCGAT CGTGTGACTT ATCGGATCAC CAAAGGCGAA GATCTGCGTG AGCAAGGTTA TATGGGGCTG CACACCGTCG GACGCGGTTC AGAACGTTCT CCGGTATTGC TGGCGCTGGA TTACAACCCA ACTGGCGATA AAGAAGCGCC AGTGTACGCG TGCCTGGTAG GTAAAGGTAT CACTTTTGAC TCCGGCGGCT ACAGCATCAA ACAGACTGCG TTTATGGACT CGATGAAGTC GGACATGGGC GGCGCGGCAA CGGTTACCGG GGCGCTGGCA TTTGCCATTA CGCGCGGACT GAACAAGCGC GTGAAGCTGT TCCTCTGCTG TGCGGATAAC CTGATTAGCG GCAATGCGTT CAAGCTGGGC GATATCATCA CTTATCGCAA CGGTAAAAAA GTTGAAGTGA TGAACACTGA TGCCGAAGGG CGCCTGGTGC TTGCCGATGG TCTGATTGAT GCCAGTGCGC AGAAACCGGA AATGATCATT GATGCGGCGA CCCTCACCGG GGCGGCGAAA ACTGCGCTGG GTAATGATTA TCACGCGCTG TTCAGTTTTG ACGATGCGCT TGCCGGTCGT TTGCTGGCGA GTGCCTCACA AGAGAACGAA CCATTCTGGC GTCTGCCGCT GGCGGAATTC CACCGCAGCC AGCTGCCGTC TAACTTTGCC GAACTGAACA ATACCGGAAG CGCGGCGTAT CCGGCAGGCG CGAGCACGGC AGCGGGCTTC CTGTCGCACT TTGTTGAGAA CTATCAGCAA GGCTGGCTGC ATATCGACTG CTCGGCGACT TACCGTAAAG CGCCGGTTGA ACAGTGGTCT GCGGGTGCTA CGGGACTTGG TGTGCGCACG ATTGCTAATC TGTTAACGGC GTAA
|
Protein sequence | MTEAMKITLS TQPADARWGE KATYSINNDG ITLHLNGADD LGLIQRAARK IDGLGIKHVQ LSGEGWDADR CWAFWQGYKA PKGTRKVEWP DLDDAQRQEL DNRLMIIDWV RDTINAPAEE LGPSQLAQRA VDLISNVASD RVTYRITKGE DLREQGYMGL HTVGRGSERS PVLLALDYNP TGDKEAPVYA CLVGKGITFD SGGYSIKQTA FMDSMKSDMG GAATVTGALA FAITRGLNKR VKLFLCCADN LISGNAFKLG DIITYRNGKK VEVMNTDAEG RLVLADGLID ASAQKPEMII DAATLTGAAK TALGNDYHAL FSFDDALAGR LLASASQENE PFWRLPLAEF HRSQLPSNFA ELNNTGSAAY PAGASTAAGF LSHFVENYQQ GWLHIDCSAT YRKAPVEQWS AGATGLGVRT IANLLTA
|
| |