Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A3232 |
Symbol | ptrA |
ID | 5801708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | - |
Start bp | 3424911 |
End bp | 3427799 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641341060 |
Product | protease III precursor |
Protein accession | YP_001607583 |
Protein GI | 162420459 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.730606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAC AGCTTGCCCG CATTGTCGGT ATCGTTTTCT TTTTAGGTTT ACTGGCTCCA TCGAGTTGGG CTGCAATTCC TGCGTGGCAA CCTTTGGCGG AAACCATTTA CAAAAGTGAA CATGACTTAC GTAAATATCA GGCAATAAAA TTATCCAATG GTATGACGGT ACTACTGGTT TCCGATACAC AAGCCCCTAA ATCTTTGGCG GCCTTAGCGC TACCGGTAGG GTCACTGGAA GACCCAGATA ATCAGCTCGG CTTGGCCCAC TACTTGGAGC ATATGGTGCT CATGGGGTCC AAACATTTCC CTGAACCTGG CAGTTTTTCC GAATTCCTAA AAAAGCATGG GGGGAGCCAT AATGCGAGCA CGGCTTCTTA CCGAACCGCT TTCTATCTTG AAATTGAAAA TGATGCATTG GCACCTGCGG TTGAGCGGTT GGCGGATGCC ATTGCTGAAC CCTTGTTAGA CCCCATTAAT GCGGATCGTG AGCGTAATGC TGTCAATGCT GAATTGACGA TGGCCCGTTC ACGCGATGGT ATGCGTATGG CGCAAGTTAA TGCGGAAACG CTTAATCCGG CACATCCAAG CGCACGTTTT TCTGGTGGTA ACCTTGAAAC TCTTAGGGAC AAGCCCGATG GCAAGCTACA TGATGAATTG GTGAGCTTCT ATCACCGCTA CTATTCTGCC AATCTCATGG TCGGGGTGTT GTACAGCAAC CAATCATTAG AGCAGCTAGC ACAATTAGCC GCAGACACCT TCGGCCGTAT CCCTAATAGG GATGCAAAAG TACCCACGAT TACGGTCCCA GTCGTGACAC CTGATCAAAC GGGAATCATC ATTCACTATG TTCCTGCTCA ACCTCGTAAA CAGATAAAAG TCGATTTCCG TATTGCGAAC AATAGTGCGG ATTTTCGTAG TAAAACAGAT ACCTATATTA GCTACTTGAT CAGTAATCGC AGTAAAAACA CCTTATCTGA CTGGCTACAA AAACAGGGGC TGGCAGATGC AATCAATGCG GGTGCTGACC CAATGTTGGA TAGAAATGGG GGGGTGTTCT CTATCACTGT CTCACTAACC GATAAGGGGC TGGCACAACG TGATGTTGTC GTCGCGGCAA TTTTTGATTA CATCAATATG CTGCATAAAG AGGGGATTAA AAAAAGCTAT TTCGATGAAA TTGCACATGT ATTGAATCTC GATTTCCGTT ATCCCTCTAT CACCCGTGAT ATGGACTATA TCGAATGGCT CGTGGATATG ATGTTACGAG TTCCTGTTGC ACATACGCTT GATGCGCCTT ATCTGGCTGA TCAGTATGAT CCCAAAGCAA TTGCATCCCG ATTGGCCGAG ATGACGCCTG AAAATGCCCG CATCTGGTTT GTCAGCCCTG AAGAACCGCA TAACAAAGTA GCTTATTTTG TCGATGCACC TTATCAGGTC GATAAAATAG GCGTACAGCG GATGAAAGAA TGGCAGCAAC TGGGGCAAAA GATTGCGCTA AGTTTGCCAG CACTGAACCC GTACATTCCT GATAATTTCA CTCTGATCAA AGCAGACAAG AACATTACCC GCCCACAGAA CGTGGCAGAC CAGCCAGGAT TGCGGGTGTT TTACATGCCA AGTCAGTATT TTGCTGACGA ACCCAAAGCC GACATTACGG TTGCTTTCCG CAACCCTCAT GCATTGAACT CAGCTCGCCA TCAGGTGCTT TTTGCCCTGA CGGATTACCT TGCGGGTCTC TCACTTGATC AACTGAGTTA TCAGGCCTCT ATTGGTGGGA TCAGTTTCTC TACCGCACCG AACAATGGCT TGTATGTTAA TGCTGGTGGT TTTACACAGC GTATGCCGCA GTTACTGACA TCTTTGGTTT CAGGCTATGC CAGTTTTACT CCGACAGAAG AGCAATTGGT ACAGGCTAAA TCTTGGTATC GTGAGCAATT AGACGTGGCA GAGAAAGGCA AAGCTTATGA GTTGGCTATT CAGCCCGCTA AGCTGCTATC GAATGTGCCT TATTCGGAGC GAAGTGAGCG ACGTAAACTA CTTGATAGCA TCAGTGTGCA GGATGTGCTG ACCTATAGAG ATGATTTACT GAAACAGTCT GCGATAGAAG TTTTGGCCGT GGGCAATATG ACAGCCGAAC AAGTCACTGA ACTCACTGAA TCGCTGAAAA AACAGCTGAA CTTAATCGGA ACCACGTGGT GGGTCGGTGA AGATGTCATT ATTGAGAAAA CACAGTTGGC CAATATGGAA CGGGTCGGCA GTAGTTCTGA CGCGGCTTTG GCAGCAGTTT ATGTCCCTAC TGGCTATACA GAAATTGCTG GTATGGCACG TAGTGCCTTG TTAGGGCAGA TCATTCAACC ATGGTTCTAT GATCAATTAC GGACAGAAGA ACAGCTTGGT TATGCTGTAT TTTCTTTCCC AATGTCGGTT GGTCATCAAT GGGGCATCGG CTTCTTACTG CAAAGTAATA GTAAGGAACC TAATTATCTT TACCAGCGAT ACCTCGCATT TTATCCGCAA GCTGAAAAAC GCCTGCGCGA AATGAAGCCC GATGATTTTG AACAATATAA GCAGGGGCTG GTCAATCAAC TATTGCAAAG GCCACAGACA TTAGATGAAG AGGCAGAGCG CTACCGTAAA GACTTCAATC TCAATAATTT TGCATTTGAT AGCCGTGAGA AGATGATTGC TCAAGTGAAA CAGCTTACGG CTAATGAACT GGCGGATTTC TTCCAGCAAG CGGTCATTAA ACCACAAGGT CTGGCACTGC TTTCTCAAGT TAAAGGTCAG GGGCAGGCCG GAGGATTCGC AGTACCGGAG GGATGGACGA CTTATCCAAC CACTTCCGCT TTACAGGCTA CGCTGCCGCA GAAGGTATTG GCACCATGA
|
Protein sequence | MHKQLARIVG IVFFLGLLAP SSWAAIPAWQ PLAETIYKSE HDLRKYQAIK LSNGMTVLLV SDTQAPKSLA ALALPVGSLE DPDNQLGLAH YLEHMVLMGS KHFPEPGSFS EFLKKHGGSH NASTASYRTA FYLEIENDAL APAVERLADA IAEPLLDPIN ADRERNAVNA ELTMARSRDG MRMAQVNAET LNPAHPSARF SGGNLETLRD KPDGKLHDEL VSFYHRYYSA NLMVGVLYSN QSLEQLAQLA ADTFGRIPNR DAKVPTITVP VVTPDQTGII IHYVPAQPRK QIKVDFRIAN NSADFRSKTD TYISYLISNR SKNTLSDWLQ KQGLADAINA GADPMLDRNG GVFSITVSLT DKGLAQRDVV VAAIFDYINM LHKEGIKKSY FDEIAHVLNL DFRYPSITRD MDYIEWLVDM MLRVPVAHTL DAPYLADQYD PKAIASRLAE MTPENARIWF VSPEEPHNKV AYFVDAPYQV DKIGVQRMKE WQQLGQKIAL SLPALNPYIP DNFTLIKADK NITRPQNVAD QPGLRVFYMP SQYFADEPKA DITVAFRNPH ALNSARHQVL FALTDYLAGL SLDQLSYQAS IGGISFSTAP NNGLYVNAGG FTQRMPQLLT SLVSGYASFT PTEEQLVQAK SWYREQLDVA EKGKAYELAI QPAKLLSNVP YSERSERRKL LDSISVQDVL TYRDDLLKQS AIEVLAVGNM TAEQVTELTE SLKKQLNLIG TTWWVGEDVI IEKTQLANME RVGSSSDAAL AAVYVPTGYT EIAGMARSAL LGQIIQPWFY DQLRTEEQLG YAVFSFPMSV GHQWGIGFLL QSNSKEPNYL YQRYLAFYPQ AEKRLREMKP DDFEQYKQGL VNQLLQRPQT LDEEAERYRK DFNLNNFAFD SREKMIAQVK QLTANELADF FQQAVIKPQG LALLSQVKGQ GQAGGFAVPE GWTTYPTTSA LQATLPQKVL AP
|
| |