Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpAngola_A2346 |
Symbol | sppA |
ID | 5800816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pestis Angola |
Kingdom | Bacteria |
Replicon accession | NC_010159 |
Strand | + |
Start bp | 2459295 |
End bp | 2461145 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641340231 |
Product | protease 4 |
Protein accession | YP_001606775 |
Protein GI | 162421479 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000106232 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00400443 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGCACTT TGTGGCGAAT TATTGCTGGT TTTTTCAAGT GGACCTGGCG TCTACTGAAT TTCACCAGAG AGCTTATCCT TAATCTTTTT TTGGCACTGC TTATTTTGAT CGGTGTCGGT ATTTATTTTC AATTTCAAAG TAAGCCTGTG GAGCCAGTTA AAGGTGCCCT GCTGGTAAAT TTAAGCGGTG TGATCGTAGA TCAACCTGCT ATCAATAATA AACTACGCCA ATGGGGACGT GAGCTATTAG GGGCATCCAG TAACCGCCTA CAGGAAAACT CCCTGTTTGA TATTGTAGAA ACCATTCGAC TGGCAAAAGA CGATGACAAT ATTAACGGCC TGGTTCTGTC TCTGAGTGAC TTAACGGGGG CAGATCAATC CTCGCTGCAA TATATTGGTA AGGCTCTACG GGAATTTCGC GACACCGGCA AGAAAATTTA TGCCGTCGGC GACAGCTACA ATCAAACCCA ATACTATTTA GCCAGTTTTG CTAATAAGAT TTATCTCTCG CCACAAGGGG CTGTTGACCT ACACGGTTTT GCCAGCAATA ACCTTTATTA CAAGTCACTG CTGGAAAATC TTAAAGTCAC GACCAATATC TTCCGCGTAG GGACTTATAA ATCAGCAGTT GAACCGATGA TCCGTAATGA TATGTCCGCC GCTGCCCGTG AAGCCGATAG CCGCTGGGTG GGTGGCTTGT GGCAAAACTA CCTCACCACG GTCTCCGCTA ACCGCCGACT CACCCCAGAA CAACTGTTCC CTGGGGCCGC AGGGGTGATC AGTGGCTTAC AGGTAGCGGG TGGCTCACAA GCTAAATACG CACTGGATAG CAAACTGGTA GACCAATTAG CAGCCCGACC AGAAGTAGAA AGTGCACTGG TTGAAGCCTT TGGCTGGAAT AAAAAGACTA ATGACTTCAA CTACATCAGT ATTTATGACT ATCAGCCAAC ACCCGCACCA CAGCAAGGGG AACAGATAGC GGTTCTCTTC GCTAACGGAG CAATTATTGA CGGCCCGCAA CCCCCTGGGA ACGTGGGGGG AGATACGCTG GCAGCACAAA TTCGCCAGGC CCGTTTAGAT CCAAAGATTA AAGCGGTTAT CTTGCGTGTA AACAGCCCTG GCGGCAGTGT GAGCGCTTCT GAACTGATCC GCGCCGAATT GGCTGCATTA CGTGCTGCCC ATAAGCCATT GGTGGTATCA ATGGGTGGAA TGGCGGCTTC TGGTGGATAT TGGATCTCAA CACCAGCGAA CTATATCGTT GCCAGCCCAA GCACTCTGAC CGGCTCCATT GGTATTTTCG GTGTGATTAA CACCTTCCAA AATTCACTGG CAAGTATTGG TGTCCATACA GACGGTGTCG CGACATCACC ATTAGCTGAC GTGTCACTGA CCAAAGCATT GCCGCCTGAG TTCTCTCAGA TGATGCAAAT CAATATCGAA AATGGGTATA AAACCTTTAT TGATTTGGTT GCAACTTCTC GCCACAAGAC CCCTGAGCAA GTGGATCAAA TTGCACAAGG CCATGTGTGG ATTGGCCTCG ATGCTAAAAG TAACGGTTTA GTCGATCAAC TCGGCGATTT TGATGATGCC GTGAAAAAGG CGGCAGAACT GGCCAAACTG AAGACTTGGC AATTGAATTG GTTTGTTGAT GAGCCAAGCT TAAGCGACCT TATTTTGGGT CAAATGAGCG CGTCGGTGCA CGCTATGTTG CCAGCGGCTA TTCAGACGTG GTTACCCGCG CCTTTATCTG CTATGGCGCT TGCGGTTAAA GATCAACCTG GTTTATTCAA TACCCTGAAT GATCCGCAGA ATCGTTATGC TCTGTGCCTA ACCTGTGGTG ATGTCCGCTA A
|
Protein sequence | MRTLWRIIAG FFKWTWRLLN FTRELILNLF LALLILIGVG IYFQFQSKPV EPVKGALLVN LSGVIVDQPA INNKLRQWGR ELLGASSNRL QENSLFDIVE TIRLAKDDDN INGLVLSLSD LTGADQSSLQ YIGKALREFR DTGKKIYAVG DSYNQTQYYL ASFANKIYLS PQGAVDLHGF ASNNLYYKSL LENLKVTTNI FRVGTYKSAV EPMIRNDMSA AAREADSRWV GGLWQNYLTT VSANRRLTPE QLFPGAAGVI SGLQVAGGSQ AKYALDSKLV DQLAARPEVE SALVEAFGWN KKTNDFNYIS IYDYQPTPAP QQGEQIAVLF ANGAIIDGPQ PPGNVGGDTL AAQIRQARLD PKIKAVILRV NSPGGSVSAS ELIRAELAAL RAAHKPLVVS MGGMAASGGY WISTPANYIV ASPSTLTGSI GIFGVINTFQ NSLASIGVHT DGVATSPLAD VSLTKALPPE FSQMMQINIE NGYKTFIDLV ATSRHKTPEQ VDQIAQGHVW IGLDAKSNGL VDQLGDFDDA VKKAAELAKL KTWQLNWFVD EPSLSDLILG QMSASVHAML PAAIQTWLPA PLSAMALAVK DQPGLFNTLN DPQNRYALCL TCGDVR
|
| |