Gene YpAngola_A4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4010 
Symbol 
ID5802489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4266817 
End bp4267812 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content53% 
IMG OID641341796 
ProductU32 family peptidase 
Protein accessionYP_001608303 
Protein GI162420445 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTGC TGTGTCCTGC GGGTAACTTA CCTGCATTAA AGGCCGCAAT CGACAATGGT 
GCCGATGCGG TTTATATCGG TTTGAAAGAT GATACTAACG CCCGCCATTT TGCCGGCCTC
AATTTCACCG ATAAAAAGCT ACAAGAAGCC GTTAACTACG TTCATAGCCG TAAACGTAAA
TTACATATCG CCATTAACAC CTTCGCCCAC CCTGACGGCT ATGCACGCTG GCAACGGGCG
GTGGATATGG CCGCACAACT TGGAGCTGAT GCGCTGATCT TGGCAGACCT GGCGATGCTG
GAATATGCTG CTGAACGTTA TCCGCAGGTT GAGCGCCATG TCTCGGTACA AGCTTCTGCC
ACCAACGATG AAGCCATCCG CTTTTATCAA CGCCATTTCG ATGTCGCCCG GGTGGTATTG
CCGCGTGTGC TCTCAATGCA TCAGGTAAAA CAGTTATCAC GTACCAGCCC GGTTCCGCTG
GAAGTCTTCG CTTTTGGTAG TTTGTGTATT ATGGCTGAAG GCCGTTGCTA CCTTTCTTCC
TATTTGACCG GTGAATCACC CAATACCGTG GGTGCTTGCT CGCCCGCACG CTTTGTACGT
TGGCAACAAA CCCCACAAGG GATGGAATCC CGCCTCAACG ACGTGTTGAT TGACCGCTAC
GAAGACCATG AAAATGCCGG TTATCCTACA CTGTGCAAAG GCCGCTATCG GGTAGATGGC
CAGCGCTACC ATGCACTGGA AGAGCCGACG AGCCTGAATA CCCTTGAGCT ATTACCCGAA
TTATTTGCTG CCAATATTGC CTCAGTAAAA ATTGAAGGAC GCCAGCGCAG CCCAGCCTAC
GTCAGCCAGG TTGCCAAAGT CTGGCGACAG GCGATTGACC TCTATCAGGC GAATCCGGCG
CAATTTGCCG CCAAAGCAGA ATGGATGGAG CAACTGGGCG CAATGTCCGA AGGTACCCAG
ACAACACTGG GCGCTTATCA TCGTAAGTGG CAGTAG
 
Protein sequence
MELLCPAGNL PALKAAIDNG ADAVYIGLKD DTNARHFAGL NFTDKKLQEA VNYVHSRKRK 
LHIAINTFAH PDGYARWQRA VDMAAQLGAD ALILADLAML EYAAERYPQV ERHVSVQASA
TNDEAIRFYQ RHFDVARVVL PRVLSMHQVK QLSRTSPVPL EVFAFGSLCI MAEGRCYLSS
YLTGESPNTV GACSPARFVR WQQTPQGMES RLNDVLIDRY EDHENAGYPT LCKGRYRVDG
QRYHALEEPT SLNTLELLPE LFAANIASVK IEGRQRSPAY VSQVAKVWRQ AIDLYQANPA
QFAAKAEWME QLGAMSEGTQ TTLGAYHRKW Q