Gene YpAngola_A1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1982 
SymbolpepN 
ID5800452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2066086 
End bp2068701 
Gene Length2616 bp 
Protein Length871 aa 
Translation table11 
GC content47% 
IMG OID641339905 
Productaminopeptidase N 
Protein accessionYP_001606455 
Protein GI162418948 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.100486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACAAC AGCCGCAAGC TAAATACCGT CACGATTATC GTGCGCCGGA TTACACCATC 
ACCGATATCG ATCTGGACTT CGCACTGGAT GCGCAAAAAA CGACGGTAAC GGCGGTCAGT
AAAGTAAAAC GTCAAGGAAC AGACGTGACG CCGTTGATTC TAAATGGTGA AGATCTGACA
CTAATCAGTG TCAGCGTTGA TGGGCAAGCA TGGCCACATT ATCGCCAACA GGATAACACG
CTGGTTATTG AGCAATTACC CGCAGATTTT ACACTGACAA TTGTCAATGA TATTCATCCG
GCAACCAACA GCGCGTTGGA AGGGTTATAC CTGTCTGGTG AAGCGCTCTG CACACAGTGT
GAAGCCGAAG GGTTTCGCCA CATTACTTAT TATTTAGACC GTCCTGATGT GCTGGCTCGT
TTTACCACTC GCATCGTGGC GGATAAGTCT CGCTATCCAT ATCTGCTCTC CAACGGTAAC
CGTGTGGGGC AGGGTGAACT GGATGATGGT CGTCATTGGG TCAAGTGGGA AGATCCCTTT
CCGAAGCCTT CTTACCTGTT TGCTTTGGTC GCCGGTGATT TCGATGTGTT ACAGGATAAA
TTTATTACCC GTTCGGGTCG TGAAGTCGCC CTCGAAATTT TTGTTGACCG GGGTAATTTA
GATCGTGCTG ATTGGGCCAT GACATCCCTG AAAAACTCAA TGAAGTGGGA TGAAACTCGC
TTTGGTCTGG AATATGACCT CGACATCTAT ATGATTGTCG CTGTTGATTT TTTCAACATG
GGGGCGATGG AGAATAAAGG GTTAAATGTA TTTAACTCAA AATATGTGCT GGCGAAAGCA
GAAACAGCCA CGGACAAAGA TTATCTAAAT ATTGAAGCTG TTATCGGCCA TGAATATTTC
CATAACTGGA CGGGTAACCG CGTCACTTGC CGCGATTGGT TCCAACTAAG TCTGAAAGAG
GGCTTAACCG TATTCAGAGA TCAGGAGTTC AGCTCCGATT TAGGTTCACG CTCTGTCAAC
CGTATCGAAA ACGTGCGGGT AATGCGGGCA GCGCAGTTTG CGGAAGATGC CAGCCCGATG
GCACATGCTA TTCGCCCGGA TAAAGTGATA GAGATGAATA ACTTTTATAC ACTCACCGTG
TATGAAAAAG GTTCAGAAGT TATCCGAATG ATGCATACGC TACTGGGTGA ACAGCAATTC
CAAGCGGGTA TGCGGCTCTA TTTTGAACGT CATGATGGCA GTGCTGCGAC CTGCGATGAT
TTTGTACAGG CAATGGAGGA TGTATCAAAT GTCGATCTGT CTCTCTTCCG GCGTTGGTAC
AGCCAATCAG GGACACCATT ATTGACCGTG CATGACGATT ATGATGTTGA AAAACAGCAG
TATCATTTAT TCGTTAGCCA AAAAACGTTA CCGACAGCGG ATCAGCCAGA GAAATTGCCA
CTGCACATTC CGTTAGACAT TGAACTGTAT GATAGCAAAG GTAATGTCAT CCCATTGCAA
CATAATGGCT TGCCGGTTCA CCACGTGCTG AATGTGACTG AAGCTGAACA GACCTTTACT
TTTGATAATG TGGCACAGAA ACCGATTCCA TCGCTGTTGC GTGAGTTTTC TGCGCCCGTA
AAATTGGATT ACCCTTACAG TGATCAGCAA CTCACGTTCC TGATGCAACA TGCCCGCAAT
GAATTCTCTC GCTGGGATGC TGCGCAAAGC TTGCTGGCAA CTTACATTAA GTTGAATGTT
GCTAAATATC AGCAGCAACA GCCACTGAGT TTACCGGCGC ATGTGGCAGA TGCTTTCCGC
GCAATATTGC TGGATGAACA TCTTGATCCT GCCTTGGCCG CGCAGATTTT GACGTTGCCC
TCAGAAAATG AAATGGCCGA ACTGTTTACG ACTATCGATC CGCAAGCGAT CAGTACGGTA
CATGAAGCGA TCACGCGTTG TCTGGCTCAG GAACTGTCGG ATGAACTGTT GGCCGTATAT
GTCGCTAATA TGACGCCGGT TTACCGTATT GAGCATGGTG ATATTGCCAA ACGTGCTTTA
CGCAATACTT GCCTCAATTA TCTGGCCTTT GGTGATGAGG AATTTGCCAA TAAGCTTGTT
TCTTCACAAT ATCATCAAGC CGATAATATG ACGGATTCAT TGGCCGCATT GGCAGCGGCG
GTTGCCGCTC AGTTACCTTG CCGTGATGAG TTGTTGGCGG CCTTTGATGT GCGCTGGAAT
CATGACGGTT TGGTTATGGA TAAGTGGTTT GCTCTACAAG CGACCAGCCC GGCGGCGAAT
GTCCTGGTAC AGGTACGTAC CTTACTGAAA CACCCTGCAT TCAGTTTGAG TAATCCAAAC
CGTACCCGTT CGTTGATTGG TAGTTTTGCT TCCGGTAACC CGGCTGCATT CCATGCGGCT
GATGGTAGTG GTTATCAGTT CTTGGTTGAG ATACTCAGTG ACTTAAATAC CCGTAACCCA
CAAGTTGCTG CGCGGTTAAT TGAGCCGTTG ATCCGCCTGA AACGTTATGA TGCAGGGCGT
CAGGCGCTGA TGCGTAAGGC CTTGGAGCAA TTGAAAACGC TGGATAATTT ATCGGGTGAT
CTGTACGAGA AGATAACCAA AGCCCTGGCG GCATAA
 
Protein sequence
MTQQPQAKYR HDYRAPDYTI TDIDLDFALD AQKTTVTAVS KVKRQGTDVT PLILNGEDLT 
LISVSVDGQA WPHYRQQDNT LVIEQLPADF TLTIVNDIHP ATNSALEGLY LSGEALCTQC
EAEGFRHITY YLDRPDVLAR FTTRIVADKS RYPYLLSNGN RVGQGELDDG RHWVKWEDPF
PKPSYLFALV AGDFDVLQDK FITRSGREVA LEIFVDRGNL DRADWAMTSL KNSMKWDETR
FGLEYDLDIY MIVAVDFFNM GAMENKGLNV FNSKYVLAKA ETATDKDYLN IEAVIGHEYF
HNWTGNRVTC RDWFQLSLKE GLTVFRDQEF SSDLGSRSVN RIENVRVMRA AQFAEDASPM
AHAIRPDKVI EMNNFYTLTV YEKGSEVIRM MHTLLGEQQF QAGMRLYFER HDGSAATCDD
FVQAMEDVSN VDLSLFRRWY SQSGTPLLTV HDDYDVEKQQ YHLFVSQKTL PTADQPEKLP
LHIPLDIELY DSKGNVIPLQ HNGLPVHHVL NVTEAEQTFT FDNVAQKPIP SLLREFSAPV
KLDYPYSDQQ LTFLMQHARN EFSRWDAAQS LLATYIKLNV AKYQQQQPLS LPAHVADAFR
AILLDEHLDP ALAAQILTLP SENEMAELFT TIDPQAISTV HEAITRCLAQ ELSDELLAVY
VANMTPVYRI EHGDIAKRAL RNTCLNYLAF GDEEFANKLV SSQYHQADNM TDSLAALAAA
VAAQLPCRDE LLAAFDVRWN HDGLVMDKWF ALQATSPAAN VLVQVRTLLK HPAFSLSNPN
RTRSLIGSFA SGNPAAFHAA DGSGYQFLVE ILSDLNTRNP QVAARLIEPL IRLKRYDAGR
QALMRKALEQ LKTLDNLSGD LYEKITKALA A