Gene YpAngola_A0987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0987 
SymboldegP 
ID5799450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1009817 
End bp1011262 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content50% 
IMG OID641338976 
Productserine endoprotease 
Protein accessionYP_001605548 
Protein GI162418209 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0018316 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA CAACTTTAGT ATTAAGTGCA TTGGCATTGA GCATTGGTTT CGCCATGGGC 
CCGGTTTCTT CCGTCGTTGC GGCAGAGACG GCAGCATCGA GTAGCCAGCA GCTCCCTAGC
CTGGCGCCAA TGCTAGAGAA AGTAATGCCT TCAGTGGTCA GTATCAACGT TGAAGGTAGT
GCGCCTGTAA GCAGTGCTGG TGCACGCGGT ATGCCACCAC AATTCCAGCA GTTTTTTGGT
GATAACTCGC CATTCTGTCA GGACGGTTCA CCGTTCCAAG GCTCGCCAAT GTGTCAAGGG
GATCTGGGCG GACTAGGGCA GGGAATGCCA AGTAAGCGGG AATTCCGTTC GCTTGGTTCA
GGTGTCATTA TTGATGCGGG CAAGGGGTAT GTCGTTACCA ATAACCACGT GGTCGATAAT
GCGAACAAGA TCAGCGTAAA ACTGAGCGAT GGCCGCAGTT TTGATGCCAA GGTGATCGGT
AAAGATCCAC GTACCGATAT CGCACTGTTA CAACTGAAAG ACGCTAAAAA TCTGACTGCG
ATTAAGATTG CCAATTCGGA TCAACTGCGT GTCGGTGATT ATACCGTCGC TATCGGGAAC
CCGTATGGCT TGGGTGAAAC CGTGACATCC GGTATTGTCT CTGCTTTAGG GCGCAGTGGT
TTGAATGTAG AAAACTATGA AAACTTTATC CAGACTGATG CGGCGATTAA CCGTGGTAAT
TCCGGCGGCG CATTAATCAA CCTGAACGGT GAGTTGATTG GTATTAACAC CGCTATTCTG
GCACCGGATG GCGGTAACAT TGGTATTGGC TTTGCTATCC CAAGCAACAT GGTGAAGAAC
CTGACATCAC AGATGGTTGA GTTTGGTCAG GTAAAACGCG GTGAACTGGG CATTATGGGG
ACCGAGCTAA ACTCTGAACT GGCAAAAGCC ATGAAGGTTG ATGCGCAGAA AGGTGCCTTT
ATCAGCCAGG TCGTGCCTAA ATCTGCTGCG GCAAAAGCGG GTATCAAAGC GGGCGATATC
ATTGTCAGTA TGAATGGGAA AGCCATCAAT AGTTTTGCAG GGTTCCGCGC CGAGATCGGC
ACGTTACCTG TTGGCAGCAA AATGACCTTG GGTCTGCTGC GTGATGGCAA ACCGATCAAT
GTGGATGTCG TCCTGGAGCA GAGCAGCCAC AGTCAGGTGG AATCCGGCAA TCTCTACACC
GGTATTGAGG GGGCTGAACT GAGTAACAGC GACGTTAGCG GCAAGAAAGG GGTGAAAGTT
GATAGCGTAA AACCAGGCAC TGCTGCGGCG CGTATCGGCC TGAAAAAAGG TGATATCATC
ATGGGGATTA ACCAGCAACC AGTCCAGAAC CTAGGTGAGC TGCGGAAAAT CCTCGATGCT
AAACCACCGG TATTGGCGTT GAATATTCAA CGTGGTGATA CTTCACTCTA TTTATTGATG
CAGTAA
 
Protein sequence
MKKTTLVLSA LALSIGFAMG PVSSVVAAET AASSSQQLPS LAPMLEKVMP SVVSINVEGS 
APVSSAGARG MPPQFQQFFG DNSPFCQDGS PFQGSPMCQG DLGGLGQGMP SKREFRSLGS
GVIIDAGKGY VVTNNHVVDN ANKISVKLSD GRSFDAKVIG KDPRTDIALL QLKDAKNLTA
IKIANSDQLR VGDYTVAIGN PYGLGETVTS GIVSALGRSG LNVENYENFI QTDAAINRGN
SGGALINLNG ELIGINTAIL APDGGNIGIG FAIPSNMVKN LTSQMVEFGQ VKRGELGIMG
TELNSELAKA MKVDAQKGAF ISQVVPKSAA AKAGIKAGDI IVSMNGKAIN SFAGFRAEIG
TLPVGSKMTL GLLRDGKPIN VDVVLEQSSH SQVESGNLYT GIEGAELSNS DVSGKKGVKV
DSVKPGTAAA RIGLKKGDII MGINQQPVQN LGELRKILDA KPPVLALNIQ RGDTSLYLLM
Q