Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1094 |
Symbol | pepN |
ID | 6969524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1123449 |
End bp | 1126061 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385105 |
Product | aminopeptidase N |
Protein accession | YP_002269604 |
Protein GI | 209396683 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0163604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.415463 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT ACTGATATTG ACTTGACCTT TGACCTCGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCGCTTTGTC TCAACGGCGA AGACCTCAAA CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCATTAATGA AATAAGCCCG GCGGCGAATA CGGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT TTTACCACCA AAATTATTGC CGATAAAACC AAATATCCCT TCCTGCTTTC CAACGGTAAC CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGCCATTGGG TACAGTGGCA GGACCCGTTC CCGAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TTGATGTACT GCGCGATACC TTTACCACGC GTTCTGGTCG CGAAGTAGCA CTGGAGCTGT ACGTCGATCG CGGCAACCTT GATCGCGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAACGC TTTGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC GATACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA GGTTTAACCG TCTTCCGCGA TCAGGAATTC AGCTCTGACC TTGGTTCCCG CGCGGTAAAC CGTATCAATA ATGTGCGCAC CATGCGCGGA TTGCAGTTTG CGGAAGACGC CAGCCCGATG GCGCACCCAA TCCGCCCGGA CATGGTCATT GAGATGAACA ATTTCTACAC CCTGACCGTT TACGAGAAGG GCGCGGAAGT GATTCGTATG ATCCACACTC TGCTTGGCGA AGAAAACTTC CAGAAAGGGA TGCAGCTCTA TTTCGAGCGC CATGATGGCA GTGCAGCGAC CTGCGACGAT TTCGTTCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CCCATTTCCG CCGTTGGTAC AGCCAGTCCG GTACGCCGGT TGTGACCGTC AAAGACGACT ACAATCCGGA AACCGAGCAG TACACCCTGA CCATCAGCCA GCGCACGCCA GCCACGCCGG ATCAAGCAGA AAAACAGCCG CTGCATATTC CGTTTGCCAT CGAATTGTAT GACAACGAAG GCAAAGTGAT CCCGTTGCAG AAAGGCGGTC ATCCGGTGAA TTCCGTGCTG AACGTCACCC AGGCGGAACA GACCTTTGTT TTTGATAATG TCTACTTCCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGTCA TGCGCGTAAT GATTTTTCCC GTTGGGATGC GGCGCAAAGT TTGCTGGCAA CCTACATCAA GCTGAACGTC GCCCGTCATC AGCAAGGGCA GCCGCTGTCT CTGCCGGTAC ATGTGGCTGA CGCTTTCCGC GCGGTGCTGC TCGATGAGAA GATTGATCCG GCGCTGGCGG CAGAAATCCT GACGCTGCCT TCTGTCAATG AAATGGCTGA ACTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTACT GGCTATTTAC AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCAAA ACGCACTCTG CGTAATGCCT GCCTGCGCTT CCTTGCTTTT GGTGAAACGC ATCTGGCTGA CGTGCTGGTG AGCAAGCAAT ACCATGAAGC AAACAATATG ACCGATGCGC TGGCAGCGCT TTCTGCGGCG GTTGCTGCAC AGCTGCCTTG CCGTGACGCG CTGATGCAGG AGTACGACGA CAAGTGGCAT CAGGATGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT GTGCTGGAGA CGGTGCGCGG CCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC CGCATTCGTT CGTTGATTGG TGCGTTTGCG GGCAGCAACC CGGCAGCGTT CCATGCCGAA GATGGGAGCG GTTACCAGTT CCTGGTGGAA ATGCTTACCG ACCTGAACAG CCGTAACCCG CAGGTAGCTT CACGCCTGAT TGAACCGCTG ATTCGCCTGA AACGTTATGA TGCCAAACGT CAGGAGAAAA TGCGCGCGGC GCTGGAACAG TTGAAAGGGC TGGAAAATCT ATCTGGCGAT CTGTACGAGA AGATCACCAA AGCACTGGCT TAA
|
Protein sequence | MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLCLNGEDLK LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIINEISP AANTALEGLY QSGDALCTQC EAEGFRHITY YLDRPDVLAR FTTKIIADKT KYPFLLSNGN RVAQGELENG RHWVQWQDPF PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD FVQAMEDASN VDLSHFRRWY SQSGTPVVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQYHEANNM TDALAALSAA VAAQLPCRDA LMQEYDDKWH QDGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN RIRSLIGAFA GSNPAAFHAE DGSGYQFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR QEKMRAALEQ LKGLENLSGD LYEKITKALA
|
| |