Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2674 |
Symbol | pepB |
ID | 5591590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2692714 |
End bp | 2693997 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640921792 |
Product | aminopeptidase B |
Protein accession | YP_001459316 |
Protein GI | 157161998 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGAAG CGATGAAAAT TACCCTCTCT ACCCAACCTG CCGACGCGCG CTGGGGAGAA AAAGCAACTT ACAGCATTAA TAATGATGGC ATTACCCTGC ATTTGAACGG GGCAGACGAT CTGGGGCTGA TCCAGCGTGC GGCCCGCAAG ATTGACGGTC TGGGCATCAA GCATGTTCAG TTAAGCGGTG AAGGTTGGGA TGCGGATCGC TGCTGGGCAT TCTGGCAAGG TTACAAAGCC CCGAAAGGCA CGCGTAAAGT GGAGTGGCCG GATCTGGACG ATGCCCAGCG CCAGGAACTG GATAACCGCC TGATGATCAT CGACTGGGTG CGTGACACCA TCAACGCACC GGCAGAAGAA TTGGGACCAT CGCAACTGGC ACAGCGTGCT GTTGATCTGA TCAGCAACGT CGCGGGCGAT CGTGTGACTT ATCGGATCAC CAAAGGCGAA GATCTGCGTG AGCAAGGTTA TATGGGGCTG CACACCGTCG GACGCGGTTC AGAACGTTCT CCGGTATTGC TGGCGCTGGA TTACAACCCA ACTGGCGATA AAGAAGCGCC AGTGTACGCG TGCCTGGTAG GTAAAGGTAT CACTTTTGAC TCCGGCGGCT ACAGCATCAA ACAGACTGCG TTTATGGACT CGATGAAGTC GGACATGGGC GGCGCGGCAA CGGTTACCGG GGCGCTGGCA TTTGCCATTA CGCGCGGACT GAACAAGCGC GTGAAGCTGT TCCTCTGCTG TGCGGATAAC CTGATTAGCG GCAATGCGTT CAAGCTGGGC GATATCATCA CCTATCGCAA CGGTAAAAAA GTTGAAGTGA TGAACACTGA TGCCGAAGGG CGTCTGGTGC TTGCCGATGG TCTGATTGAT GCCAGTGCGC AGAAACCGGA AATGATCATT GATGCGGCGA CCCTCACCGG GGCGGCGAAA ACTGCGCTGG GTAATGATTA TCACGCGCTG TTCAGTTTTG ACGATGCGCT GGCCGGTCGC TTGCTGGCGA GTGCCGCGCA GGAGAACGAA CCGTTCTGGC GTCTGCCGCT GGCGGAGTTC CACCGCAGCC AGCTGCCGTC TAACTTTGCC GAACTGAACA ATACCGGAAG CGCGGCGTAT CCGGCAGGCG CGAGCACGGC GGCGGGCTTC CTGTCGCACT TTGTTGAGAA CTATCAGCAA GGCTGGTTGC ATATCGACTG CTCGGCGACT TACCGTAAAG CGCCGGTTGA ACAGTGGTCT GCGGGCGCTA CGGGACTTGG TGTGCGCACG ATAGCTAATC TGTTAACGGC GTAA
|
Protein sequence | MTEAMKITLS TQPADARWGE KATYSINNDG ITLHLNGADD LGLIQRAARK IDGLGIKHVQ LSGEGWDADR CWAFWQGYKA PKGTRKVEWP DLDDAQRQEL DNRLMIIDWV RDTINAPAEE LGPSQLAQRA VDLISNVAGD RVTYRITKGE DLREQGYMGL HTVGRGSERS PVLLALDYNP TGDKEAPVYA CLVGKGITFD SGGYSIKQTA FMDSMKSDMG GAATVTGALA FAITRGLNKR VKLFLCCADN LISGNAFKLG DIITYRNGKK VEVMNTDAEG RLVLADGLID ASAQKPEMII DAATLTGAAK TALGNDYHAL FSFDDALAGR LLASAAQENE PFWRLPLAEF HRSQLPSNFA ELNNTGSAAY PAGASTAAGF LSHFVENYQQ GWLHIDCSAT YRKAPVEQWS AGATGLGVRT IANLLTA
|
| |