Gene YpAngola_A4053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4053 
Symbol 
ID5802532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4312569 
End bp4313663 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content49% 
IMG OID641341836 
Producthypothetical protein 
Protein accessionYP_001608342 
Protein GI162419815 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGC AACCACTACA GAACTCGCTT TTTGAGGACG ATTACCTGCT ACGGGAACTT 
GGGCAGGTAG CACATGTTCC ACTGGTCGCT TTAACAGAGT TAGTTGCTAA CGCCTGGGAT
GCCGGAGCGT CACGAGTAGA CTTGGTTTTA CCTACCGATA TAGGTGGAAT ATTGACTGTG
ACAGACGATG GTCATGGTAT GACCCCGGAG CAATTTAAAA AGCGCTGGAT GACTTTGCGC
TATACCCGCG AGAAGTATCA AGGTTTCAAT GTAGAGTTCC CTACCGGACG AACTGCCCGC
CCGCGAAAGG CTTACGGTCG CAATGGCTTG CTTTGTTTTG CTGACGAATA CGAAGTGAAA
ACCTGGCGCG ATGGCATTTT GGCAACCTTC GTTGTTGGCA CCGGAGCTGG TTCTAGTCCT
TTCGTGTGTC GTAGTGAGAC TGAGGAGAAA CACATAGGAT CCGGTACACG ATTGCAGGTA
CAGGTAACAC GCAAACTGCC AGATGCCGAT GAAATACTCA CTGTATTATC CGCCCGCTTC
ATCCACGATC CTGAATTCGA GGTGCGTGTT AATGGCAAGT TACGACCCTT TACTGAGATT
GATGGGCGAA TAAGTGAAGA GACCCTGTGC CTCAGTGCAG GACGTCATGC CAAAGTGATT
GTCATCGACT CAACTCGTCT GAATCATTCT TCTTTGCACC AAGGAATTGC ATTTTGGGTA
CAGCACAGAT TAGTTGGGAC CCCGTCATGG GCAGTTGGAC AGATAGCCAA TTTCGACGGA
CGTACACGCT TTGCCCGGCG TTACAAGGTT ATCGTTGATA CAGAAGGTTT CGAATCTGAT
GTTGAACAAG ACTGGACCGC TTTTTGTCCC ACTGATTCTG TAAATGAATT ATATCAGGCT
ACAGCTGAGC ACATTCGTGA TGTTGCACAA AGGCTTGCTG TGGAATTTAC TGAAATTACA
TCAGAGGATG CTCTGATGCA AAACCGCTCA GAGTTAGCAA CCTTGGGTCA GGGGGCCCGC
TTGGGAGTGG CTGAATTCAC TACAGCGGTA GCTCAAGAGC ATCCGACTGT CTCCCCAGAT
GTCAACGACG GATGA
 
Protein sequence
MSEQPLQNSL FEDDYLLREL GQVAHVPLVA LTELVANAWD AGASRVDLVL PTDIGGILTV 
TDDGHGMTPE QFKKRWMTLR YTREKYQGFN VEFPTGRTAR PRKAYGRNGL LCFADEYEVK
TWRDGILATF VVGTGAGSSP FVCRSETEEK HIGSGTRLQV QVTRKLPDAD EILTVLSARF
IHDPEFEVRV NGKLRPFTEI DGRISEETLC LSAGRHAKVI VIDSTRLNHS SLHQGIAFWV
QHRLVGTPSW AVGQIANFDG RTRFARRYKV IVDTEGFESD VEQDWTAFCP TDSVNELYQA
TAEHIRDVAQ RLAVEFTEIT SEDALMQNRS ELATLGQGAR LGVAEFTTAV AQEHPTVSPD
VNDG