Gene YpAngola_A0751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0751 
SymboluvrA 
ID5799213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp760773 
End bp763616 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content52% 
IMG OID641338749 
Productexcinuclease ABC subunit A 
Protein accessionYP_001605327 
Protein GI162418877 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.10562 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATA TTGAAGTTCG GGGCGCTCGC ACCCACAATC TTAAGAATAT CAACCTGATT 
ATCCCGCGCG ACAAACTGAT TGTTGTCACC GGCCTATCAG GTTCAGGCAA ATCCTCACTG
GCTTTTGATA CCTTGTATGC CGAAGGTCAA CGCCGTTATG TTGAGTCTCT CTCCGCCTAT
GCACGCCAAT TTCTGTCGCT GATGGAAAAA CCGGATGTTG ACCATATTGA AGGCCTGTCT
CCGGCTATCT CTATCGAGCA AAAATCAACC TCCCATAACC CACGGTCAAC TGTCGGTACT
ATCACTGAAA TCCATGATTA CCTGCGCTTG CTGTTCGCTC GAGTCGGCGA GCCGCGCTGC
CCTGATCATG ATGTCCCATT AGCGGCGCAA ACAGTCAGCC AAATGGTTGA TAACGTGATA
AGCCAGCCGG AAGGCCGCCG TCTGATGCTG TTGGCACCGG TGGTAAAAGA TCGCAAAGGT
GAACACACTA AAATACTGGA AAATCTGGCC GCTCAAGGTT ACATCCGGGC GCGGATCGAT
GGCGAGGTTT GTGATCTGTC TGATCCGCCC AAATTAGAGC TGCAAAAGAA ACACACCATT
GAAGTGGTGG TTGACCGTTT CAAGGTACGC GAAGATCTGG CGCAACGTTT GGCAGAATCA
TTTGAAACCG CATTGGCGTT ATCCGGCGGT ACCGCCGTGG TAGCCGATAT GGACGATCCG
CATGTGGAGG AGTTATTATT TTCCGCTAAC TTTGCCTGCC CGATTTGCGG TTACAGCATG
AGAGAGTTGG AACCACGCCT GTTCTCCTTT AACAACCCGG CGGGTGCCTG CCCAACGTGT
GATGGCTTGG GTGTCCAGCA GTTTTTTGAT CCAGACCGCG TGCTGCAAAA CCCTGAGCTC
TCTTTAGCTG GCGGAGCGAT TCGCGGCTGG GATCGCCGTA ACTTCTATTA CTTCCAGATG
CTACGTTCAC TGGCCGAGCA TTATAAATTT GATATCGAAG CGCCGTTTAA CTCACTGGAC
AGCGCCGTCC AGCAAGCCGT GCTATACGGT TCAGGCAAAG ATACCATCGA GTTCAAGTAC
ATTAATGATC GCGGTGATAC TACCGTTCGC CGCCACCCTT TTGAGGGGGT GTTGCACAAT
ATGGAACGCC GTTATAAAGA GACGGAATCC AGTGCGGTGC GCGAAGAGTT AGCCAAATTT
ATCAGCAACC GCTCCTGTGC GTCATGCAGC GGTACCCGTC TGCGCAGAGA GGCTCGTTAT
GTCTTCGTGG AAAACACCAC CCTGCCAGAG ATTTCTGAAC TGAGCATCGG CCATGCACTG
AGTTTCTTCC AGAATATGAA GCTCAGTGGT CAGCGTGCAC AAATCGCTGA AAAGATACTG
AAAGAAATTG GCGATAGGCT GAAATTCCTG GTTAACGTCG GGCTGAATTA TCTGTCTTTA
TCCCGATCTG CCGAAACCCT GTCCGGTGGC GAAGCACAGC GTATCCGTCT GGCTAGCCAG
ATTGGCGCGG GTTTGGTCGG CGTTATGTAT GTGCTGGATG AGCCGTCTAT CGGTTTACAT
CAGCGCGATA ACGAACGCTT GCTGGAGACA TTGATTCACC TACGTAATCT GGGTAACACC
GTGATTGTGG TGGAACATGA TGAAGATGCC ATCCGGGCCG CAGATCATGT GATTGATATC
GGCCCTGGGG CCGGCGTGCA CGGAGGTGAA GTTGTCGCCG AAGGAACGGT AGATGACATC
ATGGCCGCAC CGGCGTCACT CACGGGCCAG TTCCTCAGCG GTAAGCGGAG CATCGCCATC
CCAGAGAAAC GGGTCAGTGC TGATCCGAGC AAAGTCTTAA AGCTGATAGG GGCTACGGGC
AATAACCTGA AAGATGTCAC TCTGACGCTG CCTGTCGGGC TATTCAGTTG CATCACCGGG
GTCTCCGGCT CGGGGAAATC GACGCTGATC AACGATACTT TATACAGTAT TGCCCAACGC
CAACTGAACG GCGCGACCAT CACCGAACCC GCACCATACC GCGAGATCCA AGGGCTAGAA
CATTTCGATA AAGTCATCGA CATTGATCAA AGCCCGATTG GCCGTACGCC GCGTTCTAAC
CCAGCCACCT ATACCGGCAT CTTTACCCCC ATTCGCGAGT TATTTGCCGG AGTGCCGGAA
TCACGCACCC GCGGTTATAC GCCAGGCCGT TTCAGTTTTA ACGTCAAAGG TGGGCGCTGT
GAAGCCTGCC AGGGCGATGG GGTAATAAAA GTAGAAATGC ACTTTCTGCC TGATATTTAT
GTTCCTTGCG ATCACTGTAA AGGTAAGCGT TATAACCGTG AAACGCTGGA AGTAAAATAT
AAAGGTAAAA GTATTCACGA AGTTCTGGCG ATGACTATTG AAGAGGCCCG CGAGTTCTTT
GATGCCGTAC CTGCTCTGGC ACGTAAGCTG CAAACCCTGA TAGATGTTGG CCTGTCCTAT
ATTTGTCTGG GCCAATCAGC CACAACGTTA TCTGGTGGGG AAGCACAGCG AGTGAAACTA
TCGCGCGAAC TGTCAAAACG CGGGACCGGC CAGACATTGT ATATTCTGGA TGAGCCTACT
ACCGGCCTGC ATTTTGCCGA TATTCAGCAA CTGCTAGCGG TATTGCATCA GTTACGGGAT
CAGGGCAATA CCATTGTAGT GATTGAACAC AATCTGGACG TAATTAAGAC GGCGGATTGG
ATTGTCGATC TGGGGCCAGA AGGCGGCAGT GGTGGCGGTG AAATTTTGGT CTCCGGCACA
CCAGAGACGG TGGCGGAATG CGCGGCATCA CACACGGCAC GGTTCCTTAA GCCAATGTTG
CAGCGTAAGC CACAAACCGT TTAA
 
Protein sequence
MDNIEVRGAR THNLKNINLI IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVI SQPEGRRLML LAPVVKDRKG EHTKILENLA AQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVVDRFKVR EDLAQRLAES FETALALSGG TAVVADMDDP
HVEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQFFD PDRVLQNPEL
SLAGGAIRGW DRRNFYYFQM LRSLAEHYKF DIEAPFNSLD SAVQQAVLYG SGKDTIEFKY
INDRGDTTVR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRSCASCS GTRLRREARY
VFVENTTLPE ISELSIGHAL SFFQNMKLSG QRAQIAEKIL KEIGDRLKFL VNVGLNYLSL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLET LIHLRNLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGTVDDI MAAPASLTGQ FLSGKRSIAI
PEKRVSADPS KVLKLIGATG NNLKDVTLTL PVGLFSCITG VSGSGKSTLI NDTLYSIAQR
QLNGATITEP APYREIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGIFTP IRELFAGVPE
SRTRGYTPGR FSFNVKGGRC EACQGDGVIK VEMHFLPDIY VPCDHCKGKR YNRETLEVKY
KGKSIHEVLA MTIEEAREFF DAVPALARKL QTLIDVGLSY ICLGQSATTL SGGEAQRVKL
SRELSKRGTG QTLYILDEPT TGLHFADIQQ LLAVLHQLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVSGT PETVAECAAS HTARFLKPML QRKPQTV