Gene YpAngola_A2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2845 
Symbol 
ID5801317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2985388 
End bp2986968 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content50% 
IMG OID641340697 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001607227 
Protein GI162420929 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.208852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTTA ACAAACCTGT CAGCCGGCAG GAGTATCCCA TTGAGCGCGA TATCACCTTG 
CAGTCCACCA CTGATATTCA TGGAAATATT GCTTACGCTA ATGCTGCTTT TGTTCGTGCC
AGTGGCTTTG AATATCAAGA GTTGCAGGGT CAACCTCACA ATATGGTAAG GCACCCCGAT
ATGCCACCCG CGGCTTTTGC TGATATGTGG CAAACATTGA ATGCCGGTCT TTCTTGGTCT
TCATTGGTGA AGAATCGCCG TAAAAATGGA GATTATTACT GGGTCCGGGC TAATGCGACG
CCACTGCGCC ATAATGGCCG TTTAACCGGT TATATTTCGG TACGTATTGC GCCAACTCGC
GATGAGGTAA AACAAGCTGA AGCATTGTAT ACCGATTTCA ACACCGGTAA GGCAAAACGG
CGGCATATTG CCTTGTATCG TGGGCTGATC GTGCGCACGG GGTGGCTCTC CCCACTTTCT
TTGTTCCAAA CTTTACCATT ACGCTGGCGC CTGCGGAGTG CCTTGCTAAG TTCCGCCATC
ATTCCAACCG CCGCCGCCAG TGTGATGGAG GTAGCGGGTC AACCTCTCTT GATTCTCGGC
TCGGTGGCAT TGACCTGGAG CGTATTGGCT TCACTGTGGT TGGAGCGGCA GATTGCCCGC
CCCATCGCGG CTATTTTGCA ACAGGCGCAG GACGTTTCCT CCGGAGAAGC AGGCGATTAT
GTACAACTAA ATCGTGTCGA TGAAATCGGT TATTTAATGC GTAGCGTCAA TCAACTCGGC
CTGAACCTAC GTTCTTTAAC CGATGATGTC AGTGGTCAGG TTGATGGGAT TAATACCGCC
AGCAACGAAA TTGCTGCTGG CAATCGCGAA CTAAAAGTAC GAACCGAACA AGCAACCGAT
AATCTGCAAC ACATTGTCAG TGCCACAGAA CAATTAGCTG CGACAGTGCA GAACAGCGCT
AACAGTGCCA ATGAAACAAC ATCACTCGCC GAAATGAGCA GCCATGCCGC AGAAAAAGGC
AGCGAGCTCA TGCACCAAGT CATTGAAACT ATGGGCACTA TCAATGACTC CAGCCATCGC
ATTGTAGATA TTATCAGTGT GATTGAAGGC ATTGCTTTTC AGACCAATAT TCTGGCACTC
AACGCGGCCG TAGAAGCGGC CCGGGCGGGT GAACAAGGTC GTGGATTTGC CGTAGTTGCC
AGCGAGGTAC GCCATTTGGC ACAACGTTCG TCAACCGCAG CAAAAGAGAT TAAACATTTA
ATTGAGACCA GTGTAGAGCG GGTACGTGAC GGAAGTACTT TGGTGCAAAG TGCCGGTGCC
ACTATGGACA ATATTGTCAA ACAGGCAAGC CAGGTCTCTA CGTTGATCAG TGAAATCAGC
ACCTCGACTC ATGAACAGAC ACAGGCCCTT GGCCAAATCC GGCAATCGAT CAGTCGGCTA
GATCAAATGA CTTACCAGAA TGCCGCGATG GTTGAGCAAT ATGCTGGCGC GGCAGAAGAA
CTGGCACATC GGACATTGCG ACTAACCGCG GCGGTCCGTA TTTACCGCCC ATTGAATAAC
GTAGGAAAGT TAAGGGATTA G
 
Protein sequence
MRVNKPVSRQ EYPIERDITL QSTTDIHGNI AYANAAFVRA SGFEYQELQG QPHNMVRHPD 
MPPAAFADMW QTLNAGLSWS SLVKNRRKNG DYYWVRANAT PLRHNGRLTG YISVRIAPTR
DEVKQAEALY TDFNTGKAKR RHIALYRGLI VRTGWLSPLS LFQTLPLRWR LRSALLSSAI
IPTAAASVME VAGQPLLILG SVALTWSVLA SLWLERQIAR PIAAILQQAQ DVSSGEAGDY
VQLNRVDEIG YLMRSVNQLG LNLRSLTDDV SGQVDGINTA SNEIAAGNRE LKVRTEQATD
NLQHIVSATE QLAATVQNSA NSANETTSLA EMSSHAAEKG SELMHQVIET MGTINDSSHR
IVDIISVIEG IAFQTNILAL NAAVEAARAG EQGRGFAVVA SEVRHLAQRS STAAKEIKHL
IETSVERVRD GSTLVQSAGA TMDNIVKQAS QVSTLISEIS TSTHEQTQAL GQIRQSISRL
DQMTYQNAAM VEQYAGAAEE LAHRTLRLTA AVRIYRPLNN VGKLRD