Gene EcSMS35_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2187 
SymbolpepN 
ID6142897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2194212 
End bp2196824 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content54% 
IMG OID641617063 
Productaminopeptidase N 
Protein accessionYP_001744237 
Protein GI170683651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.113364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.739046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT 
ACTGATATTG ACTTGACCTT TGACCTGGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC
CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCGCTTCGTC TTGATGGCGA AGATCTTAAA
CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA
CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCGTTAATGA AATTAGCCCG
GCGACGAATA CCGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT
GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT
TTTACCACCA AAATTATTGC CGATAAAACC CAATATCCCT TCCTGCTTTC CAACGGTAAC
CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGTCACTGGG TACAGTGGCA GGACCCGTTC
CCTAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TCGACGTGCT GCGCGACACC
TTCACCACGC GTTCTGGTCG CGAAGTTGCG CTGGAGCTGT ACGTCGATCG CGGCAACCTT
GATCGTGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAGCGC
TTCGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG
GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC
GATACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC
CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA
GGTTTAACCG TCTTCCGCGA TCAGGAGTTC AGCTCTGACC TTGGTTCTCG CGCGGTAAAC
CGTATCAACA ACGTGCGCAC CATGCGCGGA TTGCAGTTTG CGGAAGACGC CAGCCCGATG
GCGCACCCGA TCCGCCCGGA TATGGTCATT GAGATGAACA ACTTCTACAC CCTGACCGTT
TACGAGAAGG GCGCGGAAGT CATTCGCATG ATCCACACCC TGCTGGGCGA AGAAAACTTC
CAGAAAGGGA TGCAGCTCTA TTTCGAGCGC CATGATGGCA GCGCGGCAAC CTGCGACGAC
TTTGTTCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CTCATTTCCG CCGTTGGTAC
AGCCAGTCCG GTACGCCGGT TGTGACCGTC AAAGACGACT ATAATCCGGA AACCGAGCAG
TACACCCTGA CCATCAGCCA GCGTACGCCT GCTACGCCGG ATCAGGCAGA AAAACAGCCG
CTGCATATTC CGTTTGCCAT CGAACTGTAT GACAACGAAG GCAAAGTGAT CCCGTTGCAA
AAAGGCGGTC ATCCGGTGAA TTCGGTGCTG AACGTCACCC AGGCGGAACA GACCTTTGTC
TTTGATAATG TCTACTTTCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG
AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGCCA TGCGCGTAAT
GATTTCTCCC GCTGGGATGC GGCGCAAAGC CTGCTGGCAA CCTACATCAA GCTAAACGTC
GCCCGTCATC AGCAAGGGCA GCCGCTGTCT CTGCCGGTGC ATGTGGCTGA CGCTTTCCGC
GCGGTGCTGC TCGATGAGAA GATTGATCCG GCGCTGGCGG CAGAAATCCT GACGCTGCCT
TCTGTCAATG AAATGGCTGA ACTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA
CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTGCT GGCTATTTAC
AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCGAA ACGCACTCTG
CGTAATGCCT GCCTGCGCTT CCTTGCTTTT GGTGAAACGC ATCTGGCTGA CGTGCTGGTG
AGCAAGCAGT TCCACGAAGC GAACAATATG ACCGATGCGC TGGCGGCGCT TTCTGCGGCG
GTTGCCGCAC AACTGCCATG CCGTGATGCG CTGATGCAGG AATACGACGA TAAGTGGCAT
CTGGACGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT
GTGCTGGAGA CGGTGCGCGG TCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC
CGCATTCGTT CGTTGATTGG CGCGTTTGCG GGCAGCAACC CGGCAGCGTT CCATGCCGAA
GATGGCAGCG GTTATCAGTT CCTTGTGGAA ATGCTTACCG ACCTCAACAG CCGTAACCCG
CAGGTGGCAT CGCGCCTGAT TGAGCCGCTG ATTCGCCTGA AACGTTACGA TGCCAAACGT
CAGGAGAAAA TGCGCGCGGC GCTGGAGCAG TTGAAAGGGC TGGAAAATCT CTCTGGCGAT
CTGTACGAGA AGATCACCAA AGCACTGGCT TGA
 
Protein sequence
MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLRLDGEDLK 
LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIVNEISP ATNTALEGLY QSGDALCTQC
EAEGFRHITY YLDRPDVLAR FTTKIIADKT QYPFLLSNGN RVAQGELENG RHWVQWQDPF
PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER
FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF
HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM
AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD
FVQAMEDASN VDLSHFRRWY SQSGTPVVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP
LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV
KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR
AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY
NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQFHEANNM TDALAALSAA
VAAQLPCRDA LMQEYDDKWH LDGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN
RIRSLIGAFA GSNPAAFHAE DGSGYQFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR
QEKMRAALEQ LKGLENLSGD LYEKITKALA