Gene ECH74115_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1094 
SymbolpepN 
ID6969524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1123449 
End bp1126061 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content54% 
IMG OID643385105 
Productaminopeptidase N 
Protein accessionYP_002269604 
Protein GI209396683 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0163604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.415463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT 
ACTGATATTG ACTTGACCTT TGACCTCGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC
CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCGCTTTGTC TCAACGGCGA AGACCTCAAA
CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA
CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCATTAATGA AATAAGCCCG
GCGGCGAATA CGGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT
GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT
TTTACCACCA AAATTATTGC CGATAAAACC AAATATCCCT TCCTGCTTTC CAACGGTAAC
CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGCCATTGGG TACAGTGGCA GGACCCGTTC
CCGAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TTGATGTACT GCGCGATACC
TTTACCACGC GTTCTGGTCG CGAAGTAGCA CTGGAGCTGT ACGTCGATCG CGGCAACCTT
GATCGCGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAACGC
TTTGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG
GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC
GATACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC
CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA
GGTTTAACCG TCTTCCGCGA TCAGGAATTC AGCTCTGACC TTGGTTCCCG CGCGGTAAAC
CGTATCAATA ATGTGCGCAC CATGCGCGGA TTGCAGTTTG CGGAAGACGC CAGCCCGATG
GCGCACCCAA TCCGCCCGGA CATGGTCATT GAGATGAACA ATTTCTACAC CCTGACCGTT
TACGAGAAGG GCGCGGAAGT GATTCGTATG ATCCACACTC TGCTTGGCGA AGAAAACTTC
CAGAAAGGGA TGCAGCTCTA TTTCGAGCGC CATGATGGCA GTGCAGCGAC CTGCGACGAT
TTCGTTCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CCCATTTCCG CCGTTGGTAC
AGCCAGTCCG GTACGCCGGT TGTGACCGTC AAAGACGACT ACAATCCGGA AACCGAGCAG
TACACCCTGA CCATCAGCCA GCGCACGCCA GCCACGCCGG ATCAAGCAGA AAAACAGCCG
CTGCATATTC CGTTTGCCAT CGAATTGTAT GACAACGAAG GCAAAGTGAT CCCGTTGCAG
AAAGGCGGTC ATCCGGTGAA TTCCGTGCTG AACGTCACCC AGGCGGAACA GACCTTTGTT
TTTGATAATG TCTACTTCCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG
AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGTCA TGCGCGTAAT
GATTTTTCCC GTTGGGATGC GGCGCAAAGT TTGCTGGCAA CCTACATCAA GCTGAACGTC
GCCCGTCATC AGCAAGGGCA GCCGCTGTCT CTGCCGGTAC ATGTGGCTGA CGCTTTCCGC
GCGGTGCTGC TCGATGAGAA GATTGATCCG GCGCTGGCGG CAGAAATCCT GACGCTGCCT
TCTGTCAATG AAATGGCTGA ACTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA
CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTACT GGCTATTTAC
AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCAAA ACGCACTCTG
CGTAATGCCT GCCTGCGCTT CCTTGCTTTT GGTGAAACGC ATCTGGCTGA CGTGCTGGTG
AGCAAGCAAT ACCATGAAGC AAACAATATG ACCGATGCGC TGGCAGCGCT TTCTGCGGCG
GTTGCTGCAC AGCTGCCTTG CCGTGACGCG CTGATGCAGG AGTACGACGA CAAGTGGCAT
CAGGATGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT
GTGCTGGAGA CGGTGCGCGG CCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC
CGCATTCGTT CGTTGATTGG TGCGTTTGCG GGCAGCAACC CGGCAGCGTT CCATGCCGAA
GATGGGAGCG GTTACCAGTT CCTGGTGGAA ATGCTTACCG ACCTGAACAG CCGTAACCCG
CAGGTAGCTT CACGCCTGAT TGAACCGCTG ATTCGCCTGA AACGTTATGA TGCCAAACGT
CAGGAGAAAA TGCGCGCGGC GCTGGAACAG TTGAAAGGGC TGGAAAATCT ATCTGGCGAT
CTGTACGAGA AGATCACCAA AGCACTGGCT TAA
 
Protein sequence
MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLCLNGEDLK 
LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIINEISP AANTALEGLY QSGDALCTQC
EAEGFRHITY YLDRPDVLAR FTTKIIADKT KYPFLLSNGN RVAQGELENG RHWVQWQDPF
PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER
FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF
HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM
AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD
FVQAMEDASN VDLSHFRRWY SQSGTPVVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP
LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV
KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR
AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY
NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQYHEANNM TDALAALSAA
VAAQLPCRDA LMQEYDDKWH QDGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN
RIRSLIGAFA GSNPAAFHAE DGSGYQFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR
QEKMRAALEQ LKGLENLSGD LYEKITKALA