Gene YpAngola_A4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4086 
Symbol 
ID5802565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4353221 
End bp4355263 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content52% 
IMG OID641341865 
Productoligopeptidase A 
Protein accessionYP_001608371 
Protein GI162420283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.902618 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACC CGCTGTTGAC TCCGTTCTCC CTGCCACCGT TTTCTGCTAT TCGGCCTGAA 
GATATCGTGC CTGCGGTGAA ATCCGCGCTG GATGAATGCC GTCAAGCGGT AGAGCGTGTG
GTTGCCCAAT CAGGGCCGTT CACCTGGGAT AATCTGTGTC AGCCACTGGC CGAATCCGAT
GACCGCTTAT CGCGCATTTG GTCACCCGTA GGCCACTTGA ACTCAGTAAA AAATAGCCCT
GAGCTGCGTA CCGCTTATGA ACAAAGCTTG CCATTGCTGT CGGAATACGG CACTTGGGTG
GGGCAGCATA AAGGTTTGTA TCAGGCGTAT GTCAGCCTGA AAGAGGGGCC GGGTTTTGCC
GCCTTGACCG CACCGCAGCG CAAAGCGGTA GAAAATGCTC TGCGTGACTT CCAGCTATCC
GGTATTGGTC TGGCGCCTGA ACAACAAAAG CGTTACGGCG AAATCGTGGC TCGCTTGTCG
GAGCTTGGCT CGACTTACAG CAATAACGTG CTTGATGCCA CCATGGGGTG GAGCAAACTG
ATTACCGATG TTGAGCAACT GAAAGGTTTG CCAGAAAGCG CGCTCGCAGC GGCCAAAGCC
ATGGCAGAAG CCAAAGAGCA GGACGGCTGG TTGCTGACAC TGGATATGCC AAGCTATCTG
CCGGTACTGA CTTATGCCGA TAACGTGGAA TTGCGCGAAG AGATGTACCG TGCATTTGCC
ACCCGTGCTT CTGATCAGGG GCCAAACGCG GGGAAATGGG ATAACAGCGA GATCATGGCG
GAAATTCTGA CACTGCGTCA TGAATTAGCG CAGTTGCTCG GTTTTAACAG TTATGCCGAT
AAATCGCTGG CCACCAAAAT GGCAGAAAAC CCACAGCAGG TATTGGGCTT CCTGAACGAT
CTGGCGAAGC GCGCCCGCCC GCAAGCAGAA GAAGAGCTGG CTCAGTTACG TGCTTTTGCT
AAAGAGCATT ATGGTGTCAG TGAACTCCAG GCTTGGGATA TCACCTATTA TTCCGAGAAA
CAGAAACAGC ATCTGTTTGC TATCAGTGAT GAACAACTTC GTCCTTATTT CCCGGAACAG
CGGGTGGTGG AAGGTTTATT CGAAGTGGTG AAACGCATTT ATGGCATTAC AGCCAAAGAG
CGCCATGATG TCGATACCTG GCATCCGGAT GTCCGCTTCT TCGATTTGTT TGATGCCGAT
GGTGAACTGC GCGGTAGCTT CTACCTTGAT TTGTATGCGC GCGAAAACAA GCGTGGCGGA
GCCTGGATGG ATGACTGCGT AGGTAGCCTG CGTTTGGCTA ATGGCCAACT GCAAAAACCA
GTCGCTTATC TGACTTGCAA TTTTAACGGG CCCGTTGGCG GCAAACCGGC GCTGTTTACT
CACAATGAAG TGACCACCTT GTTCCATGAG TTCGGCCATG GTTTACATCA TATGCTGACC
AAAATTGATA CCGCAGGCGT TTCTGGTATC AATGGCGTGC CTTGGGATGC AGTCGAGCTG
CCAAGTCAGT TTATGGAAAA CTGGTGCTGG GAGCCGGAGG CGCTGGCCTT TATTTCTGGT
CATTACCAAA CTCATGAGCC TTTGCCGCAA GAGATGCTGG ATAAACTACT GGCGGCGAAA
AACTATCAGG CGGCGTTGTT TATTCTGCGC CAACTGGAGT TTGGTCTGTT CGATTTCCGG
ATGCATTATG AGTTCGACCC GCTGACCGGT GCGCAGATCC TGCCTATTTT GTATGAAGTG
AAAAAACAGG TTGCTGTGGT GCCATCACCG GAATGGGGCC GCTTCCCTCA TGCCTTCAGC
CATATTTTTG CTGGCGGTTA TGCGGCCGGT TATTACAGCT ATTTATGGGC TGAAGTGCTC
TCGGCGGATG CGTTCTCACG CTTTGAAGAA GAAGGGATTT TTAATGCCGC TACCGGTCAG
TCCTTCCTCG ACAACATTCT GTCTCAAGGT GGCTCAGAGG AGCCAATGAC ACTGTTCAAA
CGCTTCCGTG GCCGTGAACC GCAGTTAGAT GCCATGTTGC GTCATTACGG TATTAAGGGC
TAA
 
Protein sequence
MTNPLLTPFS LPPFSAIRPE DIVPAVKSAL DECRQAVERV VAQSGPFTWD NLCQPLAESD 
DRLSRIWSPV GHLNSVKNSP ELRTAYEQSL PLLSEYGTWV GQHKGLYQAY VSLKEGPGFA
ALTAPQRKAV ENALRDFQLS GIGLAPEQQK RYGEIVARLS ELGSTYSNNV LDATMGWSKL
ITDVEQLKGL PESALAAAKA MAEAKEQDGW LLTLDMPSYL PVLTYADNVE LREEMYRAFA
TRASDQGPNA GKWDNSEIMA EILTLRHELA QLLGFNSYAD KSLATKMAEN PQQVLGFLND
LAKRARPQAE EELAQLRAFA KEHYGVSELQ AWDITYYSEK QKQHLFAISD EQLRPYFPEQ
RVVEGLFEVV KRIYGITAKE RHDVDTWHPD VRFFDLFDAD GELRGSFYLD LYARENKRGG
AWMDDCVGSL RLANGQLQKP VAYLTCNFNG PVGGKPALFT HNEVTTLFHE FGHGLHHMLT
KIDTAGVSGI NGVPWDAVEL PSQFMENWCW EPEALAFISG HYQTHEPLPQ EMLDKLLAAK
NYQAALFILR QLEFGLFDFR MHYEFDPLTG AQILPILYEV KKQVAVVPSP EWGRFPHAFS
HIFAGGYAAG YYSYLWAEVL SADAFSRFEE EGIFNAATGQ SFLDNILSQG GSEEPMTLFK
RFRGREPQLD AMLRHYGIKG