Gene EcE24377A_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1033 
SymbolpepN 
ID5586539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1056534 
End bp1059146 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content54% 
IMG OID640924738 
Productaminopeptidase N 
Protein accessionYP_001462152 
Protein GI157155731 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00218696 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT 
ACTGATATTG ACTTGACCTT TGACCTCGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC
CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCGCTTCGTC TCAACGGCGA AGACCTCAAA
CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA
CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCATTAATGA AATAAGCCCG
GCGGCGAATA CGGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT
GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT
TTTACCACCA AAATTATTGC CGATAAAACC AAATATCCCT TCCTGCTTTC CAACGGTAAC
CGCGTTGCGC AAGGTGAACT GGAAAACGGA CGCCATTGGG TACAGTGGCA GGACCCGTTC
CCGAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TTGATGTACT GCGCGACACC
TTTACCACGC GTTCTGGTCG CGAAGTGGCG CTGGAGCTGT ACGTCGATCG CGGCAACCTT
GATCGCGCGC CGTGGGCGAT GACCTCGCTG AAAAACTCAA TGAAATGGGA TGAAGAACGC
TTCGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG
GGCGCAATGG AGAATAAGGG TTTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC
GACACCGCCA CTGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC
CATAACTGGA CCGGTAACCG AGTCACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA
GGTTTAACCG TCTTCCGCGA TCAGGAATTC AGCTCTGACC TTGGTTCCCG CGCGGTAAAC
CGTATCAATA ATGTGCGCAC CATGCGCGGA TTGCAGTTTG CGGAAGACGC CAGCCCGATG
GCGCACCCGA TCCGCCCGGA TATGGTCATT GAGATGAACA ACTTCTACAC CCTGACCGTT
TACGAGAAGG GCGCGGAAGT GATTCGCATG ATCCACACCC TGCTGGGCGA AGAAAACTTC
CAGAAAGGGA TGCAACTCTA TTTCGAGCGC CATGATGGCA GCGCGGCAAC CTGCGACGAC
TTTGTGCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CCCATTTCCG CCGTTGGTAC
AGCCAGTCCG GCACGCCGAT TGTGACCGTC AAAGACGACT ACAATCCGGA AACCGAGCAG
TACACCCTGA CCATCAGCCA GCGCACGCCA GCCACGCCGG ATCAGGCAGA AAAACAGCCG
CTGCATATTC CATTTGCCAT CGAATTGTAT GACAACGAAG GCAAAGTGAT CCCGTTGCAG
AAAGGCGGTC ATCCGGTGAA TTCCGTGCTG AACGTCACCC AGGCGGAACA GACCTTTGTT
TTTGATAATG TCTACTTCCA GCCGGTGCCT GCGCTGCTAT GCGAATTCTC TGCGCCAGTG
AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGTCA TGCGCGTAAT
GATTTCTCCC GTTGGGATGC GGCGCAAAGT TTGCTGGCAA CCTACATCAA GCTGAACGTC
GCCCGTCATC AGCAAGGGCA GCCGCTGTCT CTGCCGGTAC ATGTGGCTGA CGCTTTCCGC
GCGGTGCTGC TCGATGAGAA GATTGATCCG GCGCTGGCGG CAGAAATCCT GACGCTGCCT
TCTGTCAATG AAATGGCTGA ACTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA
CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTGCT GGCTATTTAC
AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCAAA ACGCACTCTG
CGTAATGCCT GCCTGCGCTT CCTTGCTTTT GGTGAAACGC ATCTGGCTGA CGTGCTGGTG
AGCAAGCAAT ACCATGAAGC AAACAATATG ACCGATGCGC TGGCAGCGCT TTCTGCGGCG
GTTGCTGCAC AGCTGCCTTG CCGTGACGCG CTGATGCAGG AGTACGACGA TAAGTGGCAT
CAGGACGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT
GTGCTGGAGA CGGTGCGTGG TCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC
CGTATTCGTT CGTTGATTGG TGCGTTTGCG GGCAGCAACC CGGCAGCGTT CCATGCCGAA
GATGGCAGCG GTTACCAGTT CCTGGTGGAA ATGCTTACCG ACCTCAACAG CCGTAACCCG
CAGGTAGCTT CACGTCTGAT TGAACCGCTG ATTCGCCTGA AACGTTATGA TGCCAAACGT
CAGGAGAAAA TGCGCGCGGC GCTGGAACAG TTGAAAGGGC TGGAAAATCT CTCTGGCGAT
CTGTACGAGA AGATCACCAA AGCACTGGCT TGA
 
Protein sequence
MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLRLNGEDLK 
LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIINEISP AANTALEGLY QSGDALCTQC
EAEGFRHITY YLDRPDVLAR FTTKIIADKT KYPFLLSNGN RVAQGELENG RHWVQWQDPF
PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER
FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF
HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM
AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD
FVQAMEDASN VDLSHFRRWY SQSGTPIVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP
LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV
KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR
AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY
NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQYHEANNM TDALAALSAA
VAAQLPCRDA LMQEYDDKWH QDGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN
RIRSLIGAFA GSNPAAFHAE DGSGYQFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR
QEKMRAALEQ LKGLENLSGD LYEKITKALA