Gene ECD_00936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00936 
SymbolpepN 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp995711 
End bp998323 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content53% 
IMG OID 
Productaminopeptidase N 
Protein accessionACT42831 
Protein GI253977161 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00647733 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAC AGCCACAAGC CAAATACCGT CACGATTATC GTGCGCCGGA TTACCAGATT 
ACTGATATTG ACTTGACCTT TGACCTCGAC GCGCAAAAGA CGGTCGTTAC CGCGGTCAGC
CAGGCTGTCC GTCATGGTGC ATCAGATGCT CCCCTTCGTC TCAACGGCGA AGACCTCAAA
CTGGTTTCTG TTCATATTAA TGATGAGCCG TGGACCGCCT GGAAAGAAGA AGAGGGCGCA
CTGGTCATCA GTAATTTGCC GGAGCGTTTT ACGCTTAAGA TCATTAATGA AATAAGCCCG
GCGGCGAATA CGGCGCTGGA AGGGCTTTAT CAGTCAGGCG ATGCGCTTTG CACCCAGTGT
GAAGCCGAAG GTTTCCGCCA TATTACGTAT TATCTCGACC GCCCGGACGT GCTGGCGCGT
TTTACCACCA AAATTATTGC CGATAAAATT AAATATCCCT TCCTGCTTTC CAACGGTAAC
CGCGTTGCGC AAGGCGAACT GGAAAACGGA CGCCATTGGG TACAGTGGCA GGACCCGTTC
CCGAAACCGT GCTACCTGTT TGCGCTGGTG GCAGGCGACT TTGATGTACT GCGCGATACC
TTTACCACGC GTTCTGGTCG CGAAGTAGCA CTGGAGCTGT ACGTCGATCG CGGCAACCTT
GATCGCGCGC CATGGGCGAT GACCTCGCTG AAAAACTCCA TGAAATGGGA TGAAGAACGC
TTTGGCCTGG AGTATGACCT CGACATCTAT ATGATCGTCG CGGTGGATTT CTTCAATATG
GGCGCAATGG AGAATAAGGG TCTGAATATC TTTAACTCCA AATATGTGCT GGCCCGCACC
GACACCGCCA CCGACAAAGA TTACCTCGAT ATTGAACGCG TTATCGGCCA TGAATATTTC
CATAACTGGA CCGGTAACCG AGTGACCTGC CGCGACTGGT TCCAGCTCAG CCTGAAAGAA
GGTTTAACCG TCTTCCGCGA TCAGGAGTTC AGCTCTGACC TTGGTTCCCG CGCAGTTAAC
CGCATCAATA ATGTACGCAC CATGCGCGGA TTGCAGTTTG CAGAAGACGC CAGCCCGATG
GCGCACCCGA TCCGCCCGGA TATGGTCATT GAGATGAACA ACTTCTACAC CCTGACCGTT
TACGAGAAGG GCGCGGAAGT GATTCGCATG ATCCACACCC TGCTTGGCGA AGAAAACTTC
CAGAAAGGGA TGCAGCTTTA TTTCGAGCGT CATGATGGTA GTGCAGCGAC CTGTGACGAC
TTTGTGCAGG CGATGGAAGA TGCGTCGAAT GTCGATCTCT CCCATTTCCG CCGTTGGTAC
AGCCAGTCCG GTACACCGAT TGTGACCGTC AAAGACGACT ACAATCCAGA AACTGAGCAG
TACACCCTGA CCATCAGCCA GCGCACGCCA GCTACGCCGG ATCAGGCAGA AAAACAGCCG
CTGCATATTC CGTTTGCCAT CGAACTGTAT GATAACGAAG GCAAAGTGAT CCCGTTGCAG
AAAGGCGGTC ATCCGGTGAA TTCCGTGCTG AACGTCACTC AGGCGGAACA GACCTTTGTT
TTTGATAATG TCTACTTCCA GCCGGTGCCT GCGCTGCTGT GCGAATTCTC TGCGCCAGTG
AAACTGGAAT ATAAGTGGAG CGATCAGCAA CTGACCTTCC TGATGCGTCA TGCGCGTAAT
GATTTCTCCC GCTGGGATGC GGCGCAAAGT TTGCTGGCAA CCTACATCAA GCTGAACGTC
GCGCGTCATC AGCAAGGTCA GCCGCTGTCT CTGCCGGTGC ATGTGGCTGA TGCTTTCCGC
GCGGTACTGC TTGATGAGAA GATTGATCCA GCGCTGGCGG CAGAAATCCT GACGCTGCCT
TCTGTCAATG AAATGGCTGA ATTGTTCGAT ATCATCGACC CGATTGCTAT TGCCGAAGTA
CGCGAAGCAC TCACTCGTAC TCTGGCGACT GAACTGGCGG ATGAGCTACT GGCTATTTAC
AACGCGAATT ACCAGAGCGA GTACCGTGTT GAGCATGAAG ATATTGCAAA ACGCACTCTG
CGTAATGCCT GCCTGCGTTT CCTCGCTTTT GGTGAAACGC ATCTGGCTGA TGTGCTGGTG
AGCAAGCAGT TCCACGAAGC AAACAATATG ACTGATGCGC TGGCGGCGCT TTCTGCGGCG
GTTGCCGCAC AGCTGCCTTG CCGTGACGCG CTGATGCAGG AGTACGACGA CAAGTGGCAT
CAGAACGGTC TGGTGATGGA TAAATGGTTT ATCCTGCAAG CCACCAGCCC GGCGGCGAAT
GTGCTGGAGA CGGTGCGCGG CCTGTTGCAG CATCGCTCAT TTACCATGAG CAACCCGAAC
CGTATTCGTT CGTTGATTGG CGCGTTTGCG GGCAGCAATC CGGCAGCGTT CCATGCCGAA
GATGGCAGCG GTTACCTGTT CCTGGTGGAA ATGCTTACCG ACCTCAACAG CCGTAACCCG
CAGGTGGCTT CACGTCTGAT TGAACCGCTG ATTCGCCTGA AACGTTACGA TGCCAAACGT
CAGGAGAAAA TGCGCGCGGC GCTGGAACAG TTGAAAGGGC TGGAAAATCT CTCTGGCGAT
CTGTACGAGA AGATAACTAA AGCACTGGCT TGA
 
Protein sequence
MTQQPQAKYR HDYRAPDYQI TDIDLTFDLD AQKTVVTAVS QAVRHGASDA PLRLNGEDLK 
LVSVHINDEP WTAWKEEEGA LVISNLPERF TLKIINEISP AANTALEGLY QSGDALCTQC
EAEGFRHITY YLDRPDVLAR FTTKIIADKI KYPFLLSNGN RVAQGELENG RHWVQWQDPF
PKPCYLFALV AGDFDVLRDT FTTRSGREVA LELYVDRGNL DRAPWAMTSL KNSMKWDEER
FGLEYDLDIY MIVAVDFFNM GAMENKGLNI FNSKYVLART DTATDKDYLD IERVIGHEYF
HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRAVN RINNVRTMRG LQFAEDASPM
AHPIRPDMVI EMNNFYTLTV YEKGAEVIRM IHTLLGEENF QKGMQLYFER HDGSAATCDD
FVQAMEDASN VDLSHFRRWY SQSGTPIVTV KDDYNPETEQ YTLTISQRTP ATPDQAEKQP
LHIPFAIELY DNEGKVIPLQ KGGHPVNSVL NVTQAEQTFV FDNVYFQPVP ALLCEFSAPV
KLEYKWSDQQ LTFLMRHARN DFSRWDAAQS LLATYIKLNV ARHQQGQPLS LPVHVADAFR
AVLLDEKIDP ALAAEILTLP SVNEMAELFD IIDPIAIAEV REALTRTLAT ELADELLAIY
NANYQSEYRV EHEDIAKRTL RNACLRFLAF GETHLADVLV SKQFHEANNM TDALAALSAA
VAAQLPCRDA LMQEYDDKWH QNGLVMDKWF ILQATSPAAN VLETVRGLLQ HRSFTMSNPN
RIRSLIGAFA GSNPAAFHAE DGSGYLFLVE MLTDLNSRNP QVASRLIEPL IRLKRYDAKR
QEKMRAALEQ LKGLENLSGD LYEKITKALA