Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2675 |
Symbol | pepB |
ID | 6147186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2750709 |
End bp | 2751992 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641617546 |
Product | aminopeptidase B |
Protein accession | YP_001744711 |
Protein GI | 170681265 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.334715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.941339 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAG CGATGAAAAT TACCCTCTCT ACCCAACCTG CCGATGCGCG CTGGGGAGAA AAAGCAACTT ACAGCATTAA TAATGATGGC ATTACCCTGC ACTTGAACGG AGCGGACGAT CTGGGGCTGA TCCAGCGTGC GGCGCGCAAG ATTGACGGTC TGGGCATCAA ACATGTTCAG CTAAGCGGTG AAGGCTGGGA TGCGGATCGT TGCTGGGCAT TCTGGCAAGG TTACAAAGCC CCGAAAGGCA CGCGTAAAGT GGAGTGGCCG GATCTGGACG ATGTCCAGCG CCAGGAACTG GATAACCGCC TGATGATCAT CGACTGGGTG CGTGACACCA TTAACGCACC GGCGGAAGAG CTGGGGCCAT CGCAACTGGC ACAGCGTGCT GTTGATCTGA TCAGCAATGT CGCGGGCGAT CGTGTGACTT ACCGGATCAC CAAAGGCGAA GATCTGCGTG ATCAAGGTTA TATGGGGCTG CACACGGTCG GACGCGGTTC AGAACGTTCT CCGGTATTGC TGGCGCTGGA TTACAACCCG ACTGGCGATA AAGAAGCGCC AGTGTACGCG TGCCTGGTAG GTAAAGGTAT CACTTTTGAC TCCGGCGGCT ACAGCATCAA ACAGACCGCA TTTATGGACT CGATGAAGTC GGACATGGGC GGCGCGGCAA CGGTTACCGG GGCGCTGGCG TTTGCCATTA CGCGCGGACT GAACAAGCGC GTGAAGCTGT TCCTCTGCTG TGCGGATAAC CTGATCAGCG GCAACGCGTT CAAACTGGGC GATATTATTA CCTATCGCAA CGGTAAAAAA GTTGAAGTGA TGAACACTGA TGCGGAAGGG CGTCTGGTGC TGGCCGATGG CCTGATTGAT GCCAGTGCGC AGAAACCGGA ACTGATCATT GATGCGGCGA CCCTCACCGG GGCGGCGAAA ACTGCGCTGG GTAATGATTA TCACGCGCTG TTCAGTTTTG ACGATGCGCT TGCCGGTCGC TTGCTGGCGA GTGCCGCGCA GGAGAATGAA CCGTTCTGGC GTCTGCCGCT GGCGGAGTTC CACCGCAGCC AGCTGCCGTC TAACTTTGCC GAACTGAACA ATACCGGAAG CGCGGCGTAC CCGGCAGGCG CGAGCACGGC GGCGGGCTTC CTGTCGCACT TTGTTGAGAA CTATCAGCAA GGCTGGCTGC ATATCGACTG CTCGGCGACT TACCGTAAAG CGCCGGTTGA ACAGTGGTCT GCGGGTGCTA CGGGACTTGG TGTGCGCACG ATTGCTAATC TGTTAACGGC GTAA
|
Protein sequence | MTEAMKITLS TQPADARWGE KATYSINNDG ITLHLNGADD LGLIQRAARK IDGLGIKHVQ LSGEGWDADR CWAFWQGYKA PKGTRKVEWP DLDDVQRQEL DNRLMIIDWV RDTINAPAEE LGPSQLAQRA VDLISNVAGD RVTYRITKGE DLRDQGYMGL HTVGRGSERS PVLLALDYNP TGDKEAPVYA CLVGKGITFD SGGYSIKQTA FMDSMKSDMG GAATVTGALA FAITRGLNKR VKLFLCCADN LISGNAFKLG DIITYRNGKK VEVMNTDAEG RLVLADGLID ASAQKPELII DAATLTGAAK TALGNDYHAL FSFDDALAGR LLASAAQENE PFWRLPLAEF HRSQLPSNFA ELNNTGSAAY PAGASTAAGF LSHFVENYQQ GWLHIDCSAT YRKAPVEQWS AGATGLGVRT IANLLTA
|
| |