Gene YpsIP31758_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3761 
SymboluvrA 
ID5385775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4236507 
End bp4239350 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content52% 
IMG OID640866785 
Productexcinuclease ABC subunit A 
Protein accessionYP_001402715 
Protein GI153950806 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.162646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA TTGAAGTTCG GGGCGCTCGC ACCCACAATC TTAAGAATAT CAACCTGATT 
ATCCCGCGCG ACAAACTGAT TGTTGTCACC GGCCTATCAG GTTCAGGCAA ATCCTCACTG
GCTTTTGATA CCTTGTATGC CGAAGGTCAA CGCCGTTATG TTGAGTCTCT CTCCGCCTAT
GCACGCCAAT TTCTGTCGCT GATGGAAAAA CCGGATGTTG ACCATATTGA AGGCCTGTCT
CCGGCTATCT CTATCGAGCA AAAATCAACC TCCCATAATC CACGGTCAAC TGTCGGTACT
ATCACTGAAA TCCATGATTA CCTGCGCTTG CTGTTCGCTC GAGTCGGCGA GCCGCGCTGC
CCTGATCATG ATGTCCCATT AGCGGCGCAA ACAGTCAGCC AAATGGTTGA TAACGTGATA
AGCCAGCCGG AAGGCCGCCG TCTGATGCTG CTGGCACCGG TGGTAAAAGA TCGCAAAGGT
GAACACACTA AAATACTGGA AAATCTGGCC GCTCAAGGTT ACATCCGGGC GCGGATCGAT
GGCGAGGTTT GTGATCTGTC TGATCCGCCC AAATTAGAGC TGCAAAAGAA ACACACCATT
GAAGTGGTGG TTGACCGTTT CAAGGTACGC GAAGATCTGG CGCAACGTTT GGCAGAATCA
TTTGAAACCG CACTGGCGTT ATCCGGCGGT ACCGCCGTGG TAGCCGATAT GGACGATCCG
CATGTGGAGG AGTTGTTATT TTCCGCTAAC TTTGCCTGCC CGATTTGCGG TTACAGCATG
AGAGAGTTGG AACCACGCCT GTTCTCCTTT AACAACCCGG CGGGTGCCTG CCCAACGTGT
GATGGCTTGG GTGTCCAGCA GTTTTTTGAT CCAGACCGCG TGCTGCAAAA CCCTGAGCTC
TCTTTAGCTG GCGGAGCGAT TCGCGGCTGG GATCGCCGTA ACTTCTATTA CTTCCAGATG
CTACGTTCAC TGGCCGAGCA TTATAAATTT GATATCGAAG CGCCGTTTAA CTCACTGGAC
AGCGCCGTCC AGCAAGCCGT GCTATACGGT TCAGGCAAAG ATACCATCGA GTTCAAGTAC
ATTAATGATC GCGGTGATAC TACCGTTCGC CGCCACCCTT TTGAGGGGGT GTTGCACAAT
ATGGAACGCC GTTATAAAGA GACGGAATCC AGTGCGGTGC GCGAAGAGTT AGCCAAATTT
ATCAGCAACC GCTCCTGTGC GTCATGCAGC GGTACCCGTC TGCGCAGAGA GGCTCGTTAT
GTCTTCGTGG AAAACACCAC CCTGCCAGAG ATTTCTGAAC TGAGCATCGG CCATGCACTG
AGTTTCTTCC AGAATATGAA GCTCAGTGGT CAGCGTGCAC AAATCGCTGA AAAGATACTG
AAAGAAATTG GCGATAGGCT GAAATTCCTG GTTAACGTCG GGCTGAATTA TCTGTCTTTA
TCCCGATCTG CCGAAACCCT GTCCGGTGGC GAAGCACAGC GTATCCGTCT GGCTAGCCAG
ATTGGCGCGG GTTTGGTCGG CGTTATGTAT GTGCTGGATG AGCCGTCTAT CGGTTTACAT
CAGCGCGATA ACGAACGCTT GCTGGAGACA TTGATTCACC TACGTAATCT GGGTAACACC
GTGATTGTGG TGGAACATGA TGAAGATGCC ATCCGGGCCG CAGATCATGT GATTGATATC
GGCCCTGGGG CCGGCGTGCA CGGAGGTGAA GTTGTCGCCG AAGGAACGGT AGATGACATC
ATGGCCGCAC CGGCGTCACT CACGGGCCAG TTCCTCAGCG GTAAGCGGAG CATCGCCATC
CCAGAGAAAC GGGTCAGTGC TGATCCGAGC AAAGTCTTAA AGCTGATAGG GGCTACGGGC
AATAACCTGA AAGATGTCAC TCTGACGCTG CCTGTCGGGC TATTCAGTTG CATCACCGGG
GTCTCCGGCT CGGGGAAATC GACGCTAATC AACGATACTT TATACAGTAT TGCCCAACGC
CAACTGAACG GCGCGACCAT CACCGAACCC GCACCATACC GCGAGATCCA AGGGCTAGAA
CATTTCGATA AAGTCATCGA CATTGATCAA AGCCCGATTG GCCGTACGCC GCGTTCTAAC
CCAGCCACCT ATACCGGCAT CTTTACCCCC ATTCGCGAGT TATTTGCCGG AGTGCCGGAA
TCACGCACCC GCGGTTATAC GCCAGGCCGT TTCAGTTTTA ACGTCAAAGG TGGGCGCTGC
GAAGCCTGCC AGGGCGATGG GGTAATTAAA GTAGAAATGC ACTTTCTGCC TGATATTTAT
GTTCCTTGCG ATCACTGTAA AGGTAAGCGT TATAACCGTG AAACGCTGGA AATAAAATAT
AAAGGTAAAA GTATTCACGA AGTTCTGGCG ATGACTATTG AAGAGGCCCG CGAGTTCTTT
GATGCCGTAC CTGCTCTGGC ACGTAAGCTG CAAACCCTAA TAGATGTTGG CCTGTCCTAT
ATTTGTCTGG GCCAATCAGC CACAACGTTA TCTGGTGGGG AAGCACAGCG GGTGAAACTA
TCGCGCGAAC TGTCAAAACG CGGGACCGGC CAGACATTGT ATATTCTGGA TGAGCCTACT
ACCGGCCTGC ATTTTGCCGA TATTCAGCAA CTGCTAGCGG TATTGCATCA GTTACGGGAT
CAGGGCAATA CCATTGTAGT GATTGAACAC AATCTGGACG TAATTAAGAC GGCGGATTGG
ATTGTCGATC TGGGGCCAGA AGGCGGCAGT GGTGGCGGTG AAATTTTGGT CTCCGGCACA
CCAGAGACGG TGGCGGAATG CGCGGTATCA CACACGGCAC GGTTCCTTAA GCCGATGTTG
CAGCGTAAGC CACAAACCGT TTAA
 
Protein sequence
MDNIEVRGAR THNLKNINLI IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVI SQPEGRRLML LAPVVKDRKG EHTKILENLA AQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVVDRFKVR EDLAQRLAES FETALALSGG TAVVADMDDP
HVEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQFFD PDRVLQNPEL
SLAGGAIRGW DRRNFYYFQM LRSLAEHYKF DIEAPFNSLD SAVQQAVLYG SGKDTIEFKY
INDRGDTTVR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRSCASCS GTRLRREARY
VFVENTTLPE ISELSIGHAL SFFQNMKLSG QRAQIAEKIL KEIGDRLKFL VNVGLNYLSL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLET LIHLRNLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGTVDDI MAAPASLTGQ FLSGKRSIAI
PEKRVSADPS KVLKLIGATG NNLKDVTLTL PVGLFSCITG VSGSGKSTLI NDTLYSIAQR
QLNGATITEP APYREIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGIFTP IRELFAGVPE
SRTRGYTPGR FSFNVKGGRC EACQGDGVIK VEMHFLPDIY VPCDHCKGKR YNRETLEIKY
KGKSIHEVLA MTIEEAREFF DAVPALARKL QTLIDVGLSY ICLGQSATTL SGGEAQRVKL
SRELSKRGTG QTLYILDEPT TGLHFADIQQ LLAVLHQLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVSGT PETVAECAVS HTARFLKPML QRKPQTV