Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1041 |
Symbol | pepN |
ID | 5592250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 1052238 |
End bp | 1054850 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920207 |
Product | aminopeptidase N |
Protein accession | YP_001457772 |
Protein GI | 157160454 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.230058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT ACTGATATTG ACTTGACCTT TGACCTCGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCCCTTCGTC TCAACGGCGA AGACCTCAAA CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCATTAATGA AATAAGCCCG GCGGCGAATA CGGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT TTTACCACCA AAATTATTGC CGATAAAATT AAATATCCCT TCCTGCTTTC CAACGGTAAC CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGCCATTGGG TACAGTGGCA GGACCCGTTC CCGAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TTGATGTACT GCGCGATACC TTTACCACGC GTTCTGGTCG CGAAGTAGCA CTGGAGCTGT ACGTCGATCG CGGCAACCTT GATCGCGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAACGC TTTGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC GACACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA GGTTTAACCG TCTTCCGCGA TCAGGAGTTC AGCTCTGACC TTGGTTCCCG CGCAGTTAAC CGCATCAATA ATGTACGCAC CATGCGCGGA TTGCAGTTTG CAGAAGACGC CAGCCCGATG GCGCACCCGA TCCGCCCGGA TATGGTCATT GAGATGAACA ACTTCTACAC CCTGACCGTT TACGAGAAGG GCGCGGAAGT GATTCGCATG ATCCACACCC TGCTTGGCGA AGAAAACTTC CAGAAAGGGA TGCAGCTTTA TTTCGAGCGT CATGATGGTA GTGCAGCGAC CTGTGACGAC TTTGTGCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CCCATTTCCG CCGTTGGTAC AGCCAGTCCG GTACACCGAT TGTGACCGTC AAAGACGACT ACAATCCAGA AACTGAGCAG TACACCCTGA CCATCAGCCA GCGCACGCCA GCTACGCCGG ATCAGGCAGA AAAACAGCCG CTGCATATTC CGTTTGCCAT CGAACTGTAT GATAACGAAG GCAAAGTGAT CCCGTTGCAG AAAGGCGGTC ATCCGGTGAA TTCCGTGCTG AACGTCACTC AGGCGGAACA GACCTTTGTT TTTGATAATG TCTACTTCCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGTCA TGCGCGTAAT GATTTCTCCC GCTGGGATGC GGCGCAAAGT TTGCTGGCAA CCTACATCAA GCTGAACGTC GCGCGTCATC AGCAAGGTCA GCCGCTGTCT CTGCCGGTGC ATGTGGCTGA TGCTTTCCGC GCGGTACTGC TTGATGAGAA GATTGATCCA GCGCTGGCGG CAGAAATCCT GACGCTGCCT TCTGTCAATG AAATGGCTGA ATTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTACT GGCTATTTAC AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCAAA ACGCACTCTG CGTAATGCCT GCCTGCGTTT CCTCGCTTTT GGTGAAACGC ATCTGGCTGA TGTGCTGGTG AGCAAGCAGT TCCACGAAGC AAACAATATG ACTGATGCGC TGGCGGCGCT TTCTGCGGCG GTTGCCGCAC AGCTGCCTTG CCGTGACGCG CTGATGCAGG AGTACGACGA CAAGTGGCAT CAGAACGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT GTGCTGGAGA CGGTGCGCGG CCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC CGTATTCGTT CGTTGATTGG CGCGTTTGCG GGCAGCAATC CGGCAGCGTT CCATGCCGAA GATGGCAGCG GTTACCTGTT CCTGGTGGAA ATGCTTACCG ACCTCAACAG CCGTAACCCG CAGGTGGCTT CACGTCTGAT TGAACCGCTG ATTCGCCTGA AACGTTACGA TGCCAAACGT CAGGAGAAAA TGCGCGCGGC GCTGGAACAG TTGAAAGGGC TGGAAAATCT CTCTGGCGAT CTGTACGAGA AGATAACTAA AGCACTGGCT TGA
|
Protein sequence | MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLRLNGEDLK LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIINEISP AANTALEGLY QSGDALCTQC EAEGFRHITY YLDRPDVLAR FTTKIIADKI KYPFLLSNGN RVAQGELENG RHWVQWQDPF PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD FVQAMEDASN VDLSHFRRWY SQSGTPIVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQFHEANNM TDALAALSAA VAAQLPCRDA LMQEYDDKWH QNGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN RIRSLIGAFA GSNPAAFHAE DGSGYLFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR QEKMRAALEQ LKGLENLSGD LYEKITKALA
|
| |