Gene ECH74115_3617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3617 
SymbolypdF 
ID6966630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3337797 
End bp3338882 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content57% 
IMG OID643387412 
Productaminopeptidase 
Protein accessionYP_002271871 
Protein GI209397138 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTAC TCGCTTCGCT GCGCGACTGG CTTAAGGCGC AACAACTGGA TGCGGTGCTT 
CTCTCCTCAC GGCAGAACAA ACAGCCGCAT CTGGGGATCT CCACCGGATC AGGCTATGTG
CTGATTAGCC GTGAAAGTGC GCACATTCTG GTGGACTCGC GCTATTACGC GGATGTAGAA
GCCCGCACGC AAGGTTACCA GCTGCATTTG CTTGACGCGA CGCACACGCT TACAACCATC
GCCAGGCAAA TCATTGCCGA TGAGCAGTTG CAAACGCTCG GTTTTGAAGG CCAGCAGGTG
AGTTGGGAAA CCGCGCATCG CTGGCAGTCT GAACTCAATG CGAAACTGGT TAGCGCCACG
CCGGATGTGC TGCGGCAAAT CAAAACGCCA GAGGAGGTGG AGAAAATCCG CCTTGCCTGT
GGGATTGCCG ATCGCGGTGC AGAGCATATT CGCCGCTTTA TTCAGGCGGG GATGAGCGAG
CGCGAGATAG CCGCTGAACT GGAGTGGTTT ATGCGCCAGC AGGGCGCAGA AAAAACCTCT
TTTGACACCA TTGTCGCCAG TGGCTGGCGT GGGGCGCTGC CGCACGGCAA AGCCAGCGAC
AAGATTGTTG CAGCGGGCGA GTTTGTCACT CTCGATTTCG GTGCGCTCTA TCAGGGCTAC
TGCTCTGATA TGACGCGCAC CTTGCTGGTG AATGGCGAAG GGGTGAGCGC CGAATCTCAC
CCGCTGTTTA ACGTCTATCA GATTGTTTTG CAGGCACAGC TCGCAGCAAT CTCTGCAATT
CGCCCCGGCG TGCGCTGCCA GCAGGTTGAC GAAGCCGCGC GTCGGGTGAT TACCGAGGCA
GGTTTTAGCC ACTATTTCGG TCATAACACC GCTCATGCTA TCGGCATTGA AGTTCATGAA
GATCCGCGTT TTTCACCGCG GGACACCACG ACGCTACAGC CAGGCATGTT ACTGACCGTG
GAGCCGGGGA TTTATTTGCC AGGGCAAGGG GGCGTGCGCA TCGAGGATGT TGTGCTGGTC
ACCCCGCAAG GCGCAGAAGT GCTCTACGCC ATGCCGAAAA CAGTGTTGCT CACGGGAGAG
GCATAA
 
Protein sequence
MTLLASLRDW LKAQQLDAVL LSSRQNKQPH LGISTGSGYV LISRESAHIL VDSRYYADVE 
ARTQGYQLHL LDATHTLTTI ARQIIADEQL QTLGFEGQQV SWETAHRWQS ELNAKLVSAT
PDVLRQIKTP EEVEKIRLAC GIADRGAEHI RRFIQAGMSE REIAAELEWF MRQQGAEKTS
FDTIVASGWR GALPHGKASD KIVAAGEFVT LDFGALYQGY CSDMTRTLLV NGEGVSAESH
PLFNVYQIVL QAQLAAISAI RPGVRCQQVD EAARRVITEA GFSHYFGHNT AHAIGIEVHE
DPRFSPRDTT TLQPGMLLTV EPGIYLPGQG GVRIEDVVLV TPQGAEVLYA MPKTVLLTGE
A