Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2187 |
Symbol | pepN |
ID | 6142897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2194212 |
End bp | 2196824 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617063 |
Product | aminopeptidase N |
Protein accession | YP_001744237 |
Protein GI | 170683651 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.113364 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.739046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT ACTGATATTG ACTTGACCTT TGACCTGGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCGCTTCGTC TTGATGGCGA AGATCTTAAA CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCGTTAATGA AATTAGCCCG GCGACGAATA CCGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT TTTACCACCA AAATTATTGC CGATAAAACC CAATATCCCT TCCTGCTTTC CAACGGTAAC CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGTCACTGGG TACAGTGGCA GGACCCGTTC CCTAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TCGACGTGCT GCGCGACACC TTCACCACGC GTTCTGGTCG CGAAGTTGCG CTGGAGCTGT ACGTCGATCG CGGCAACCTT GATCGTGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAGCGC TTCGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC GATACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA GGTTTAACCG TCTTCCGCGA TCAGGAGTTC AGCTCTGACC TTGGTTCTCG CGCGGTAAAC CGTATCAACA ACGTGCGCAC CATGCGCGGA TTGCAGTTTG CGGAAGACGC CAGCCCGATG GCGCACCCGA TCCGCCCGGA TATGGTCATT GAGATGAACA ACTTCTACAC CCTGACCGTT TACGAGAAGG GCGCGGAAGT CATTCGCATG ATCCACACCC TGCTGGGCGA AGAAAACTTC CAGAAAGGGA TGCAGCTCTA TTTCGAGCGC CATGATGGCA GCGCGGCAAC CTGCGACGAC TTTGTTCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CTCATTTCCG CCGTTGGTAC AGCCAGTCCG GTACGCCGGT TGTGACCGTC AAAGACGACT ATAATCCGGA AACCGAGCAG TACACCCTGA CCATCAGCCA GCGTACGCCT GCTACGCCGG ATCAGGCAGA AAAACAGCCG CTGCATATTC CGTTTGCCAT CGAACTGTAT GACAACGAAG GCAAAGTGAT CCCGTTGCAA AAAGGCGGTC ATCCGGTGAA TTCGGTGCTG AACGTCACCC AGGCGGAACA GACCTTTGTC TTTGATAATG TCTACTTTCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGCCA TGCGCGTAAT GATTTCTCCC GCTGGGATGC GGCGCAAAGC CTGCTGGCAA CCTACATCAA GCTAAACGTC GCCCGTCATC AGCAAGGGCA GCCGCTGTCT CTGCCGGTGC ATGTGGCTGA CGCTTTCCGC GCGGTGCTGC TCGATGAGAA GATTGATCCG GCGCTGGCGG CAGAAATCCT GACGCTGCCT TCTGTCAATG AAATGGCTGA ACTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTGCT GGCTATTTAC AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCGAA ACGCACTCTG CGTAATGCCT GCCTGCGCTT CCTTGCTTTT GGTGAAACGC ATCTGGCTGA CGTGCTGGTG AGCAAGCAGT TCCACGAAGC GAACAATATG ACCGATGCGC TGGCGGCGCT TTCTGCGGCG GTTGCCGCAC AACTGCCATG CCGTGATGCG CTGATGCAGG AATACGACGA TAAGTGGCAT CTGGACGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT GTGCTGGAGA CGGTGCGCGG TCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC CGCATTCGTT CGTTGATTGG CGCGTTTGCG GGCAGCAACC CGGCAGCGTT CCATGCCGAA GATGGCAGCG GTTATCAGTT CCTTGTGGAA ATGCTTACCG ACCTCAACAG CCGTAACCCG CAGGTGGCAT CGCGCCTGAT TGAGCCGCTG ATTCGCCTGA AACGTTACGA TGCCAAACGT CAGGAGAAAA TGCGCGCGGC GCTGGAGCAG TTGAAAGGGC TGGAAAATCT CTCTGGCGAT CTGTACGAGA AGATCACCAA AGCACTGGCT TGA
|
Protein sequence | MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLRLDGEDLK LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIVNEISP ATNTALEGLY QSGDALCTQC EAEGFRHITY YLDRPDVLAR FTTKIIADKT QYPFLLSNGN RVAQGELENG RHWVQWQDPF PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD FVQAMEDASN VDLSHFRRWY SQSGTPVVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQFHEANNM TDALAALSAA VAAQLPCRDA LMQEYDDKWH LDGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN RIRSLIGAFA GSNPAAFHAE DGSGYQFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR QEKMRAALEQ LKGLENLSGD LYEKITKALA
|
| |