Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2522 |
Symbol | ypdF |
ID | 5591250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2536949 |
End bp | 2538034 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640921643 |
Product | aminopeptidase |
Protein accession | YP_001459176 |
Protein GI | 157161858 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 81 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTAC TCGCTTCGCT GCGCGACTGG CTTAAGGCGC AACAACTGGA TGCAGTGCTT CTCTCCTCAC GGCAGAACAA ACAGCCGCAT CTGGGGATCT CCACCGGATC AGGTTATGTG GTGATTAGCC GTGAAAGTGC GCACATTCTG GTGGATTCGC GCTATTACGT TGAGGTGGAA GCCCGTGCGC AAGGCTACCA GCTGCATTTG CTTGACGCGA CGAACACGCT TACCACTATC GTCAATCAAA TCATTGCCGA TGAACAGTTG CAAACGCTCG GTTTTGAGGG CCAGCAGGTG AGTTGGGAAA CCGCGCACCG CTGGCAGTCT GAACTCAATG CGAAACTGGT TAGCGCCACG CCGGATGTGC TGCGGCAAAT CAAAACGCCA GAGGAGGTGG AGAAAATCCG CCTCGCCTGT GGGATTGCTG ATCGCGGTGC AGAGCATATT CGCCGCTTTA TTCAGGCGGG GATGAGCGAG CGCGAGATAG CCGCTGAACT GGAGTGGTTT ATGCGCCAGC AGGGCGCAGA AAAAGCCTCT TTTGACACCA TTGTCGCCAG TGGCTGGCGT GGGGCGCTGC CGCACGGCAA AGCCAGCGAC AAGATTGTTG CAGCGGGCGA GTTTGTCACT CTCGATTTCG GTGCGCTGTA TCAGGGCTAC TGCTCTGATA TGACGCGCAC CTTGCTGGTG AATGGCGAAG GGGTGAGCGC CGAATCTCAC CCGCTGTTTA ACGTCTATCA AATTGTCCTG CAGGCACAGC TCGCAGCAAT CTCCGCGATT CGCCCCGGCG TGCGCTGCCA GCAGGTTGAC GATGCCGCGC GCCGGGTCAT TACAGAAGCA GGTTATGGCG ACTATTTCGG TCATAACACC GGTCACGCTA TCGGCATTGA AGTTCATGAA GATCCGCGTT TTTCACCGCG GGACACCACG ACGCTACAGC CAGGCATGTT ACTGACCGTG GAGCCGGGGA TTTATTTGCC AGGGCAAGGG GGCGTGCGCA TCGAAGATGT TGTGCTGGTT ACCCCGCAAG GCGCAGAAGT GCTCTACGCC ATGCCGAAAA CAGTGTTGCT CACGGGAGAG GCATAA
|
Protein sequence | MTLLASLRDW LKAQQLDAVL LSSRQNKQPH LGISTGSGYV VISRESAHIL VDSRYYVEVE ARAQGYQLHL LDATNTLTTI VNQIIADEQL QTLGFEGQQV SWETAHRWQS ELNAKLVSAT PDVLRQIKTP EEVEKIRLAC GIADRGAEHI RRFIQAGMSE REIAAELEWF MRQQGAEKAS FDTIVASGWR GALPHGKASD KIVAAGEFVT LDFGALYQGY CSDMTRTLLV NGEGVSAESH PLFNVYQIVL QAQLAAISAI RPGVRCQQVD DAARRVITEA GYGDYFGHNT GHAIGIEVHE DPRFSPRDTT TLQPGMLLTV EPGIYLPGQG GVRIEDVVLV TPQGAEVLYA MPKTVLLTGE A
|
| |