Gene YpAngola_A1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1943 
Symbol 
ID5800413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2015381 
End bp2017132 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content51% 
IMG OID641339867 
Producthypothetical protein 
Protein accessionYP_001606417 
Protein GI162421872 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value2.94909e-10 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.195502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAATA ACCGACTCGA ATGGCAGTCA CTGCTGCCGA ACACAACGCC ATATGAAGCG 
TTATTTGCTA CTGCCTCCCA ATTGGAGCCT GTTGCTTTTT CAGCCTATCA GCCCCGGCTA
GAGAATGGGA TGACGCTGTT CTGTCATCCG CAATCTCAAC CGCGGTTTAT GCTAATCAAA
GCACAGGAGA GTACCGACTA TCTGGCATTG ATCGCACAAG CGGTTAAAGC GCTACAACCG
GAGGCCTCCA ACCTGCTTGG TGGCGATTAT CTGGTCCATG GCCACCACGT CACCTGGCAA
TCAGCACAGC ACGGTGATGA ACGGTTTGCT GCGCAATCAA CCTGTCTCTA TCAAGAATGG
GTCGAGCCGG AACAGTTGTT TGGCTGTGTC AGGATGTATA AAGATCAAAT TACGCTGCAA
CCGGGTTTAG TTCATCAGGT AAATGGTGGT GTATTGATCT TGTCGGTACG GGCCTTGCAA
GCTCAGCCAT TGATGTGGCT GCGTTTGAAG CAAATGATAG TTCAACAACG CTTTGATTGG
CTCTCTACCG ATGAAACCCG CCCGCTGCCA GTGCATATTC CCTCCATGCC ACTGGACCTG
CGTTTGATTC TGGTTGGTGA CCGTTTAGGG CTTGCCGATT TCCATGATAT GGAACCCGAG
CTAGGTGATC TGGCGATTTA TGGTGAGTTC GAGGCAGAAC TCCCCCTGGT AGATATCGAA
GGCATGGCTC TATGGTGCGG TTACATCAAC ACCCTATTGC AACAAAAGCA GCTCCCTGCC
TTATCTGCCG ATGCCTGGCC GGTCCTGTTC CGCCAAGCCG TGCGTTACAG TGGCGACCAA
GGCAGCCTGC CGCTCTGCCC GCAATGGCTT ACTCAGCAGC TCGCAGAAGC GGCGCTTTAT
GCTGAAAATG ACACCATCTC AGCTAACGCG CTAGAAGCCG CACTGAATGC CCGAAATTGG
CGGCAAAATT ATCTTGCCGA ACGGATGCAA GATGAGATCG AGCTGGGGCA AATTCTGGTT
GAGACTGAAG GGCACGTTGT TGGCCAGATT AATGCCTTAT CCGTGCTGGA ATATCCCGGT
CACCCTCACG CGTTCGGCGA ACCCGCACGG ATCAGTTGTG TGGTTCATTT AGGCGATGGC
GAATTTGTTG ACGTTGAACG TAAAGCGGAA TTGGGCGGTA ATATTCATGC TAAAGGCATG
ATGATTATGC AGGCATTCTT GATGTCTGAA TTGGAACTTG ACCAACCACT GCCATTCTCC
GCATCCATTG TGTTTGAGCA GTCCTACGGT GAGGTTGATG GCGATAGTGC ATCACTAGCA
GAGCTTTGTG CGCTGGTCAG TGCCCTTTCA CAGCAGCCTA TCAATCAGCA AATTGCGGTT
ACAGGCTCCG TTGACCAATT TGGTAACGTA CAACCTATTG GTGGCGTGAA CGAAAAAATT
GAAGGTTTCT TCGAAACTTG CCAACGCCGT GGTTTGACGG GCAATCAGGG GGTGATTTTA
CCTACCACGA ATGTTCGTCA CTTGTGCCTG AATCAAGCAG TGGTTGAAGC GGTACAGAAA
GGTCAATTCC ACCTGTGGGC TGTCGATACC GTGGCTGAAG CGCTACCATT ATTGACTGGG
GTCACTTATG ACGATGAGCA GCAACCCAGC CTGTTAGGCG CTATCCAGGA GCGTATCGCA
CAGATAAACC CACAGGATAA ACGCCGCTGG CCATGGCCAT TACGCTGGCT AAACTGGTTC
AACCAGGGCT AA
 
Protein sequence
MTNNRLEWQS LLPNTTPYEA LFATASQLEP VAFSAYQPRL ENGMTLFCHP QSQPRFMLIK 
AQESTDYLAL IAQAVKALQP EASNLLGGDY LVHGHHVTWQ SAQHGDERFA AQSTCLYQEW
VEPEQLFGCV RMYKDQITLQ PGLVHQVNGG VLILSVRALQ AQPLMWLRLK QMIVQQRFDW
LSTDETRPLP VHIPSMPLDL RLILVGDRLG LADFHDMEPE LGDLAIYGEF EAELPLVDIE
GMALWCGYIN TLLQQKQLPA LSADAWPVLF RQAVRYSGDQ GSLPLCPQWL TQQLAEAALY
AENDTISANA LEAALNARNW RQNYLAERMQ DEIELGQILV ETEGHVVGQI NALSVLEYPG
HPHAFGEPAR ISCVVHLGDG EFVDVERKAE LGGNIHAKGM MIMQAFLMSE LELDQPLPFS
ASIVFEQSYG EVDGDSASLA ELCALVSALS QQPINQQIAV TGSVDQFGNV QPIGGVNEKI
EGFFETCQRR GLTGNQGVIL PTTNVRHLCL NQAVVEAVQK GQFHLWAVDT VAEALPLLTG
VTYDDEQQPS LLGAIQERIA QINPQDKRRW PWPLRWLNWF NQG