Gene YpAngola_A2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2572 
Symbol 
ID5801043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2691381 
End bp2692406 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content50% 
IMG OID641340441 
ProductDNA-binding transcriptional repressor PurR 
Protein accessionYP_001606983 
Protein GI162419802 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.407544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA TTAAAGATGT GGCCAAACAC GCAGGTGTGT CCACAACCAC CGTTTCGCAC 
GTTATCAACA AGACTCGTTT CGTCGCCGAA AATACGAAGG CTGCCGTGTG GGCCGCCATT
AAAGAGCTGC ATTATTCACC CAGTGCTGTC GCCCGCAGTT TGAAAGTGAA TCACACCAAG
TCGATTGGTT TGCTGGCAAC GTCCAGTGAA GCGCCCTATT TTGCTGAGGT GATCGAAGCG
GTAGAAAATA GCTGCTATAG CAAAGGTTAC ACGCTGATTT TATGTAATTC TCATAATAAT
CTGGATAAAC AAAAAGCCTA TCTGGCGATG TTGGCACAAA AGCGTGTCGA TGGTTTATTG
GTGATGTGCT CCGAATATCC AGATCAACTG CTGGGGATGC TAGAGGATTA TCGCAATATT
CCTATGGTCG TGATGGACTG GGGGACTGCC CGTGGCGATT TTACTGACTC TATCATTGAT
AACGCCTTCG AAGGAGGCTA TTTAGCGGGC CGTTATCTGA TTGAACGCGG GCATCGTGAT
ATTGGTGCGA TACCCGGCCA GTTGGCGCGT AATACCGGCG GTGGCCGTCA TCAAGGCTTC
CTTAAAGCCT TGGAAGAAGC CAATATCCCT GTTCGTGAAG AGTGGATTGT TCAGGGCGAT
TTTGAGCCAG AATCCGGTTA TAAAGCTATG CATCAAATTC TGACACAAAA ACATCGCCCA
ACGGCGGTGT TCTGTGGCGG CGATATCATG GCGATGGGCG CAATCTGTGC CGCCGACGAA
CTGGGCCTGC GGGTACCACA AGATATTTCG GTGATCGGGT ATGATAATGT GCGTAACGCG
CGCTACTTCT CACCAGCACT GACCACTATC CATCAACCTA AAGAGAGATT AGGCGAAACC
GCCTTTGCCA TGTTGCTGGA CCGCATTGTC AGTAAGCGTG AAGATCCGCA GACCATAGAG
GTACACCCTA AACTGGTGGA GCGTCGTTCC GTTGCTGATG GCCCTTTCCG TGATTATCGC
CGTTGA
 
Protein sequence
MATIKDVAKH AGVSTTTVSH VINKTRFVAE NTKAAVWAAI KELHYSPSAV ARSLKVNHTK 
SIGLLATSSE APYFAEVIEA VENSCYSKGY TLILCNSHNN LDKQKAYLAM LAQKRVDGLL
VMCSEYPDQL LGMLEDYRNI PMVVMDWGTA RGDFTDSIID NAFEGGYLAG RYLIERGHRD
IGAIPGQLAR NTGGGRHQGF LKALEEANIP VREEWIVQGD FEPESGYKAM HQILTQKHRP
TAVFCGGDIM AMGAICAADE LGLRVPQDIS VIGYDNVRNA RYFSPALTTI HQPKERLGET
AFAMLLDRIV SKREDPQTIE VHPKLVERRS VADGPFRDYR R