Gene YpAngola_A4069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4069 
Symbol 
ID5802548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4331913 
End bp4333412 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content51% 
IMG OID641341850 
ProductM16 family peptidase 
Protein accessionYP_001608356 
Protein GI162418358 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.187129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCA CCAAAATTCG TCTTATGGTT GGTGGGTTAT TATTGGCGGC GGCCAGCAAT 
AATGTGCAGG CTGAAGCACT ACAACCCGAC CCTGCCTGGC AGCAGGGAAA GTTAGATAAT
GGGTTTTCAT GGCAGTTGCT GGCAACGCCG CAGCGCCCAA GTGATCGGAT CGAGCTACGC
TTGATCGTCA ATACCGGATC ATTATCCGAA AATACACAGG AAGTGGGCTT TGCCCATTTA
CTGCCTCGTC TCGCATTAAT GAGCAGCGCG AGTTTTACCC CCGCCCAACT CCAGTCGCTA
TGGCAGCAAG GGATCGATAA CGAACGCCCA TTGCCACCGG CTATTACATC GTATGATTTC
ACCTTATACA GCCTGAGCTT ACCCAATAAC CGTCCTGATC TGCTGAAGGA TGCGTTGGCA
TGGTTATCTG ATACGGCGGG TAATCTGGCG GTAAGTGAGC AAACGGTTAA TGCAGCCTTA
AATACGGCAA CCGACCCCAT TGCGACTTTC CCACAAAATA TTCAGGAACC GTGGTGGCGT
TATCGCCTCA AAGGTTCTTC TTTAATCGGC CACGATCCCG GTCAGCCAGT GACGCAACCG
GTCGATGTAG AGAAGCTAAA GCAGTTTTAT CAGCAATGGT ATACCCCAGA TGCGATGACG
CTGTATGTGG TCGGTAATGT TGATAGCCGT AGTATCGCCG CCCAAATCAG TAAAGCGTTT
TCTGAGCTGA AAGGTAAGCG TACCGCGCCA GCGGCAGTTG CAACCTTAGC CCCGCTACCA
CCAGAACCGG TGAGTTTGAT GAATGAACAG GCGGCACAGG ACACGTTATC GTTGATGTGG
GATACCCCAT GGCACCCCAT TCAAGACTCT ATGGCCCTGA GCCGCTATTG GCGCAGCGAT
TTAGCCCGTG AAGCGCTGTT TTGGCATATC AAGCAGGTGT TGGAGAAAAA TAATCAGAAG
AACCTAAAGC TGGGCTTTGA TTGCCGGGTC CAGTATCAAC GCGCCCAATG TGCTATTCAC
TTAAATACCC CCGTTGAGAA CCTGACAGCC AATATGACAT TTGTTGCCCG TGAGTTGGCG
GCATTGCGTG CTAACGGCCT GAGCCAGGCT GAGTTTGATG CGTTGATGAC ACAGAAAAAC
GACCAACTCA GTAAGTTGTT CGCCACCTAT GCGCGTACTG ATACCGATAT TTTGATGAGC
CAACGTCTGC GCTCACAGCA AAGTGGTGTT GTGGACATCG CGCCGGAACA GTATCAAAAA
TTGCGGCAGG CATTCTTGTC TGGGTTGACG CTGGCAGAGC TGAATCGGGA GTTAAAACAG
CAACTTTCAC AAGATACCAC CTTGGTTCTG ATGCAACCGA AAGGTGAACC TGAAGTTAAT
GTGAAGGCGT TGCAGGAGAT CTATAACGGC ATTATGGCAC CACAGACGGT GGCGGAAGAA
GAGGTTGCCC CTGCTGAAGC GGTAGAAACT GCACCTGTTA TGCCGACAAC CGCGCAATAA
 
Protein sequence
MQGTKIRLMV GGLLLAAASN NVQAEALQPD PAWQQGKLDN GFSWQLLATP QRPSDRIELR 
LIVNTGSLSE NTQEVGFAHL LPRLALMSSA SFTPAQLQSL WQQGIDNERP LPPAITSYDF
TLYSLSLPNN RPDLLKDALA WLSDTAGNLA VSEQTVNAAL NTATDPIATF PQNIQEPWWR
YRLKGSSLIG HDPGQPVTQP VDVEKLKQFY QQWYTPDAMT LYVVGNVDSR SIAAQISKAF
SELKGKRTAP AAVATLAPLP PEPVSLMNEQ AAQDTLSLMW DTPWHPIQDS MALSRYWRSD
LAREALFWHI KQVLEKNNQK NLKLGFDCRV QYQRAQCAIH LNTPVENLTA NMTFVARELA
ALRANGLSQA EFDALMTQKN DQLSKLFATY ARTDTDILMS QRLRSQQSGV VDIAPEQYQK
LRQAFLSGLT LAELNRELKQ QLSQDTTLVL MQPKGEPEVN VKALQEIYNG IMAPQTVAEE
EVAPAEAVET APVMPTTAQ