Gene SNSL254_A4598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4598 
SymboluvrA 
ID6485239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4464381 
End bp4467206 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content57% 
IMG OID642739823 
Productexcinuclease ABC subunit A 
Protein accessionYP_002043505 
Protein GI194446489 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAATAT TAACCTCGTC 
ATCCCCCGCG ACAAACTGAT TGTCGTGACC GGGCTTTCGG GTTCAGGCAA ATCCTCACTG
GCTTTCGACA CTCTGTATGC CGAAGGGCAG CGTCGTTACG TTGAATCGCT CTCCGCTTAC
GCGCGGCAGT TTTTGTCGCT CATGGAAAAA CCGGATGTCG ACCATATTGA GGGGCTATCG
CCCGCGATCT CAATTGAACA GAAATCGACA TCTCACAACC CGCGCTCTAC GGTGGGTACT
ATTACCGAGA TCCACGACTA CCTGCGCCTG CTGTTTGCCC GCGTGGGCGA GCCGCGTTGT
CCGGACCATG ACGTGCCGCT GGCGGCGCAA ACCGTTAGCC AGATGGTCGA TAACGTGCTG
TCACAGCCGG AAGGCAAACG CCTGATGCTG CTGGCGCCGA TTATTAAAGA GCGTAAAGGC
GAACACACCA AAACGCTGGA AAATCTGGCA AGCCAGGGTT ACATTCGCGC CCGTATTGAC
GGCGAAGTCT GCGATCTCTC CGATCCGCCG AAGCTGGAGC TACAAAAGAA ACATACCATT
GAGGTGGTGA TCGATCGCTT CAAAGTTCGC AACGATCTTT CCCAACGCCT GGCGGAGTCG
TTCGAAACGG CGCTGGAATT ATCCGGCGGC ACGGCGGTTG TTGCCGATAT GGACGATGAG
AAAGCGGAGG AGCTTCTGTT CTCCGCCAAT TTTGCCTGTC CGATTTGCGG CTACAGTATG
CGCGAACTGG AACCGCGTCT GTTCTCGTTC AACAACCCGG CAGGCGCCTG CCCGACCTGT
GACGGGCTCG GCGTTCAGCA ATATTTCGAT CCGGACCGCG TGATCCAGAA TCCCGACCTG
TCGCTGGCAG GCGGCGCGAT TCGTGGTTGG GATCGTCGCA ATTTTTATTA CTTTCAAATG
CTCAAGTCGC TGGCGGAACA CTATAAGTTC GACGTGGATG CGCCGTGGGC AAGCCTCAGC
GCCAACGTAC ATAAAGTCGT GCTGTACGGT TCCGGCAAAG AGAACATTGA ATTTAAATAT
ATGAACGATC GCGGCGATAC TTCCGTGCGC CGCCATCCGT TCGAAGGCGT GCTGCATAAT
ATGGAGCGCC GTTATAAAGA GACGGAATCC AGCGCGGTGC GCGAAGAGCT GGCGAAGTTC
ATCAGTAATC GCCCCTGCGC CAGCTGTGAA GGAACGCGAC TGAATCGCGA AGCGCGCCAT
GTATTTGTGG AAAATACGCC GCTGCCTGCT ATTTCCGATA TGAGCATTGG CCATGCGATG
GATTTTTTCA CTAATCTCAA GCTTTCCGGG CAACGAGCGA AAATCGCCGA AAAAGTGCTA
AAAGAGATCG GCGATCGCCT CAAGTTTCTG GTGAACGTCG GCCTGAACTA TCTCACGCTC
TCCCGCTCGG CAGAGACGCT TTCCGGCGGC GAAGCCCAGC GTATTCGTCT GGCGAGCCAG
ATAGGCGCCG GGTTAGTCGG CGTGATGTAT GTGCTGGATG AACCGTCCAT CGGTCTGCAC
CAGCGCGATA ATGAACGGCT GCTGGGTACG CTGATTCATC TGCGCAATCT TGGCAATACC
GTGATTGTGG TGGAACACGA TGAAGACGCC ATCCGCGCCG CCGACCATGT GATTGATATT
GGCCCCGGCG CGGGCGTTCA CGGCGGCGAG GTGGTGGCGG AAGGCCCGCT GGAAGCCATT
ATGGCGGTAC CGGAATCGCT GACCGGCCAG TACATGAGCG GTAAACGCAA AATTGAAGTG
CCGAAACAAC GCGTGCCGGC AAATCCAGAA AAAGTGCTCA AACTCACCGG CGCGCGCGGC
AACAACCTGA AAGATGTGAC CCTTACGCTA CCGGTAGGGC TGTTTACCTG TATCACCGGC
GTCTCGGGTT CCGGTAAATC GACGCTGATT AACGACACGC TGTTCCCCAT CGCCCAGCGT
CAGTTAAACG GGGCGACTAT CGCCGAACCG GCGCCGTATC GGGATATTCA GGGGCTGGAA
CATTTCGATA AAGTGATCGA TATCGACCAG AGCCCGATCG GGCGCACCCC GCGTTCCAAC
CCGGCGACCT ATACGGGCGT CTTTACCCCG GTTCGCGAGC TTTTTGCTGG CGTGCCGGAG
TCTCGCTCGC GCGGCTACAC GCCAGGACGA TTCAGCTTCA ACGTGCGCGG CGGTCGCTGC
GAAGCGTGCC AGGGCGATGG CGTCATTAAA GTCGAAATGC ACTTTCTGCC GGATATTTAC
GTGCCGTGCG ACCAGTGCAA AGGCAAGCGC TATAACCGGG AAACGCTGGA GATTAAGTAC
AAAGGCAAGA CCATCCACGA AGTACTGGAT ATGACCATTG AAGAAGCGCG TGAGTTCTTT
GATGCGGTTC CGGCGCTGGC GCGTAAGCTG CAGACGCTGA TGGACGTGGG GCTGACCTAT
ATCCGTCTTG GTCAGTCGGC GACAACGCTT TCCGGCGGCG AGGCCCAGCG CGTGAAGCTG
GCGCGCGAAC TGTCGAAGCG CGGCACCGGG CAAACGCTGT ATATTCTCGA CGAGCCAACC
ACCGGCCTGC ACTTTGCCGA TATTCAGCAG TTGCTTGACG TTCTGCATCA GTTGCGCGAT
CAGGGCAACA CCATCGTGGT GATCGAACAC AACCTGGACG TCATTAAAAC GGCGGACTGG
ATTGTCGACC TCGGCCCGGA AGGCGGCAGC GGCGGCGGCG AAATTCTCGT CGCCGGTACG
CCGGAAACCG TGGCGGAGTG CGAGGCGTCG CATACTGCCC GCTTCCTTAA ACCTATGCTC
AAATAA
 
Protein sequence
MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVIDRFKVR NDLSQRLAES FETALELSGG TAVVADMDDE
KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPDL
SLAGGAIRGW DRRNFYYFQM LKSLAEHYKF DVDAPWASLS ANVHKVVLYG SGKENIEFKY
MNDRGDTSVR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLNREARH
VFVENTPLPA ISDMSIGHAM DFFTNLKLSG QRAKIAEKVL KEIGDRLKFL VNVGLNYLTL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRNLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV
PKQRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR
QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE
SRSRGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY
KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL
ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHQLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVAGT PETVAECEAS HTARFLKPML K