Gene YpAngola_A3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3232 
SymbolptrA 
ID5801708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3424911 
End bp3427799 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content47% 
IMG OID641341060 
Productprotease III precursor 
Protein accessionYP_001607583 
Protein GI162420459 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.730606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAC AGCTTGCCCG CATTGTCGGT ATCGTTTTCT TTTTAGGTTT ACTGGCTCCA 
TCGAGTTGGG CTGCAATTCC TGCGTGGCAA CCTTTGGCGG AAACCATTTA CAAAAGTGAA
CATGACTTAC GTAAATATCA GGCAATAAAA TTATCCAATG GTATGACGGT ACTACTGGTT
TCCGATACAC AAGCCCCTAA ATCTTTGGCG GCCTTAGCGC TACCGGTAGG GTCACTGGAA
GACCCAGATA ATCAGCTCGG CTTGGCCCAC TACTTGGAGC ATATGGTGCT CATGGGGTCC
AAACATTTCC CTGAACCTGG CAGTTTTTCC GAATTCCTAA AAAAGCATGG GGGGAGCCAT
AATGCGAGCA CGGCTTCTTA CCGAACCGCT TTCTATCTTG AAATTGAAAA TGATGCATTG
GCACCTGCGG TTGAGCGGTT GGCGGATGCC ATTGCTGAAC CCTTGTTAGA CCCCATTAAT
GCGGATCGTG AGCGTAATGC TGTCAATGCT GAATTGACGA TGGCCCGTTC ACGCGATGGT
ATGCGTATGG CGCAAGTTAA TGCGGAAACG CTTAATCCGG CACATCCAAG CGCACGTTTT
TCTGGTGGTA ACCTTGAAAC TCTTAGGGAC AAGCCCGATG GCAAGCTACA TGATGAATTG
GTGAGCTTCT ATCACCGCTA CTATTCTGCC AATCTCATGG TCGGGGTGTT GTACAGCAAC
CAATCATTAG AGCAGCTAGC ACAATTAGCC GCAGACACCT TCGGCCGTAT CCCTAATAGG
GATGCAAAAG TACCCACGAT TACGGTCCCA GTCGTGACAC CTGATCAAAC GGGAATCATC
ATTCACTATG TTCCTGCTCA ACCTCGTAAA CAGATAAAAG TCGATTTCCG TATTGCGAAC
AATAGTGCGG ATTTTCGTAG TAAAACAGAT ACCTATATTA GCTACTTGAT CAGTAATCGC
AGTAAAAACA CCTTATCTGA CTGGCTACAA AAACAGGGGC TGGCAGATGC AATCAATGCG
GGTGCTGACC CAATGTTGGA TAGAAATGGG GGGGTGTTCT CTATCACTGT CTCACTAACC
GATAAGGGGC TGGCACAACG TGATGTTGTC GTCGCGGCAA TTTTTGATTA CATCAATATG
CTGCATAAAG AGGGGATTAA AAAAAGCTAT TTCGATGAAA TTGCACATGT ATTGAATCTC
GATTTCCGTT ATCCCTCTAT CACCCGTGAT ATGGACTATA TCGAATGGCT CGTGGATATG
ATGTTACGAG TTCCTGTTGC ACATACGCTT GATGCGCCTT ATCTGGCTGA TCAGTATGAT
CCCAAAGCAA TTGCATCCCG ATTGGCCGAG ATGACGCCTG AAAATGCCCG CATCTGGTTT
GTCAGCCCTG AAGAACCGCA TAACAAAGTA GCTTATTTTG TCGATGCACC TTATCAGGTC
GATAAAATAG GCGTACAGCG GATGAAAGAA TGGCAGCAAC TGGGGCAAAA GATTGCGCTA
AGTTTGCCAG CACTGAACCC GTACATTCCT GATAATTTCA CTCTGATCAA AGCAGACAAG
AACATTACCC GCCCACAGAA CGTGGCAGAC CAGCCAGGAT TGCGGGTGTT TTACATGCCA
AGTCAGTATT TTGCTGACGA ACCCAAAGCC GACATTACGG TTGCTTTCCG CAACCCTCAT
GCATTGAACT CAGCTCGCCA TCAGGTGCTT TTTGCCCTGA CGGATTACCT TGCGGGTCTC
TCACTTGATC AACTGAGTTA TCAGGCCTCT ATTGGTGGGA TCAGTTTCTC TACCGCACCG
AACAATGGCT TGTATGTTAA TGCTGGTGGT TTTACACAGC GTATGCCGCA GTTACTGACA
TCTTTGGTTT CAGGCTATGC CAGTTTTACT CCGACAGAAG AGCAATTGGT ACAGGCTAAA
TCTTGGTATC GTGAGCAATT AGACGTGGCA GAGAAAGGCA AAGCTTATGA GTTGGCTATT
CAGCCCGCTA AGCTGCTATC GAATGTGCCT TATTCGGAGC GAAGTGAGCG ACGTAAACTA
CTTGATAGCA TCAGTGTGCA GGATGTGCTG ACCTATAGAG ATGATTTACT GAAACAGTCT
GCGATAGAAG TTTTGGCCGT GGGCAATATG ACAGCCGAAC AAGTCACTGA ACTCACTGAA
TCGCTGAAAA AACAGCTGAA CTTAATCGGA ACCACGTGGT GGGTCGGTGA AGATGTCATT
ATTGAGAAAA CACAGTTGGC CAATATGGAA CGGGTCGGCA GTAGTTCTGA CGCGGCTTTG
GCAGCAGTTT ATGTCCCTAC TGGCTATACA GAAATTGCTG GTATGGCACG TAGTGCCTTG
TTAGGGCAGA TCATTCAACC ATGGTTCTAT GATCAATTAC GGACAGAAGA ACAGCTTGGT
TATGCTGTAT TTTCTTTCCC AATGTCGGTT GGTCATCAAT GGGGCATCGG CTTCTTACTG
CAAAGTAATA GTAAGGAACC TAATTATCTT TACCAGCGAT ACCTCGCATT TTATCCGCAA
GCTGAAAAAC GCCTGCGCGA AATGAAGCCC GATGATTTTG AACAATATAA GCAGGGGCTG
GTCAATCAAC TATTGCAAAG GCCACAGACA TTAGATGAAG AGGCAGAGCG CTACCGTAAA
GACTTCAATC TCAATAATTT TGCATTTGAT AGCCGTGAGA AGATGATTGC TCAAGTGAAA
CAGCTTACGG CTAATGAACT GGCGGATTTC TTCCAGCAAG CGGTCATTAA ACCACAAGGT
CTGGCACTGC TTTCTCAAGT TAAAGGTCAG GGGCAGGCCG GAGGATTCGC AGTACCGGAG
GGATGGACGA CTTATCCAAC CACTTCCGCT TTACAGGCTA CGCTGCCGCA GAAGGTATTG
GCACCATGA
 
Protein sequence
MHKQLARIVG IVFFLGLLAP SSWAAIPAWQ PLAETIYKSE HDLRKYQAIK LSNGMTVLLV 
SDTQAPKSLA ALALPVGSLE DPDNQLGLAH YLEHMVLMGS KHFPEPGSFS EFLKKHGGSH
NASTASYRTA FYLEIENDAL APAVERLADA IAEPLLDPIN ADRERNAVNA ELTMARSRDG
MRMAQVNAET LNPAHPSARF SGGNLETLRD KPDGKLHDEL VSFYHRYYSA NLMVGVLYSN
QSLEQLAQLA ADTFGRIPNR DAKVPTITVP VVTPDQTGII IHYVPAQPRK QIKVDFRIAN
NSADFRSKTD TYISYLISNR SKNTLSDWLQ KQGLADAINA GADPMLDRNG GVFSITVSLT
DKGLAQRDVV VAAIFDYINM LHKEGIKKSY FDEIAHVLNL DFRYPSITRD MDYIEWLVDM
MLRVPVAHTL DAPYLADQYD PKAIASRLAE MTPENARIWF VSPEEPHNKV AYFVDAPYQV
DKIGVQRMKE WQQLGQKIAL SLPALNPYIP DNFTLIKADK NITRPQNVAD QPGLRVFYMP
SQYFADEPKA DITVAFRNPH ALNSARHQVL FALTDYLAGL SLDQLSYQAS IGGISFSTAP
NNGLYVNAGG FTQRMPQLLT SLVSGYASFT PTEEQLVQAK SWYREQLDVA EKGKAYELAI
QPAKLLSNVP YSERSERRKL LDSISVQDVL TYRDDLLKQS AIEVLAVGNM TAEQVTELTE
SLKKQLNLIG TTWWVGEDVI IEKTQLANME RVGSSSDAAL AAVYVPTGYT EIAGMARSAL
LGQIIQPWFY DQLRTEEQLG YAVFSFPMSV GHQWGIGFLL QSNSKEPNYL YQRYLAFYPQ
AEKRLREMKP DDFEQYKQGL VNQLLQRPQT LDEEAERYRK DFNLNNFAFD SREKMIAQVK
QLTANELADF FQQAVIKPQG LALLSQVKGQ GQAGGFAVPE GWTTYPTTSA LQATLPQKVL
AP