Gene SeD_A4649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4649 
SymboluvrA 
ID6872793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4488672 
End bp4491497 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content57% 
IMG OID642787551 
Productexcinuclease ABC subunit A 
Protein accessionYP_002218149 
Protein GI198245071 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.768489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAATAT TAACCTCGTC 
ATCCCCCGCG ACAAACTGAT TGTCGTGACC GGGCTTTCGG GTTCAGGCAA ATCCTCACTG
GCTTTCGACA CTCTGTATGC CGAAGGGCAG CGTCGTTACG TTGAATCGCT CTCCGCTTAC
GCGCGGCAGT TTTTGTCGCT CATGGAAAAA CCGGATGTCG ACCATATTGA GGGGCTATCG
CCCGCGATCT CAATTGAACA GAAATCGACA TCGCACAACC CGCGCTCTAC GGTGGGTACT
ATTACCGAGA TCCACGACTA CCTGCGCCTG CTGTTTGCCC GCGTGGGCGA GCCGCGTTGT
CCGGATCATG ACGTGCCGCT GGCGGCGCAA ACCGTTAGCC AGATGGTCGA TAACGTGCTG
TCACAGCCGG AAGGCAAACG TCTGATGCTG CTCGCGCCGA TTATTAAAGA GCGTAAAGGC
GAACACACCA AAACGCTGGA AAATCTGGCA AGCCAGGGTT ACATTCGCGC CCGTATTGAC
GGCGAAGTCT GCGATCTCTC CGATCCGCCG AAGCTGGAGC TACAAAAGAA ACATACCATT
GAGGTGGTGA TCGATCGCTT CAAAGTTCGC AACGATCTTT CCCAACGCCT GGCGGAGTCG
TTCGAAACGG CGCTGGAATT ATCCGGCGGC ACGGCGGTTG TTGCCGATAT GGACGATGAG
AAAGCGGAGG AGCTTCTGTT CTCCGCCAAT TTTGCCTGTC CGATTTGCGG CTACAGTATG
CGCGAACTGG AACCGCGTCT GTTCTCGTTC AACAACCCGG CAGGCGCCTG CCCGACCTGT
GACGGGCTCG GCGTTCAGCA ATATTTCGAT CCGGACCGCG TGATCCAGAA TCCCGACCTG
TCGCTGGCAG GCGGCGCGAT TCGTGGTTGG GATCGTCGCA ATTTTTATTA CTTTCAAATG
CTCAAGTCGC TGGCGGAACA CTATAAGTTC GACGTGGATG CGCCGTGGGC AAGCCTCAGC
GCCAACGTAC ATAAAGTCGT GCTGTACGGT TCCGGCAAAG AGAACATTGA ATTTAAATAT
ATGAACGATC GCGGCGATAC TTCCGTGCGC CGCCATCCGT TCGAAGGCGT GCTGCATAAT
ATGGAGCGCC GTTATAAAGA GACGGAATCC AGCGCGGTGC GCGAAGAGCT GGCGAAGTTC
ATCAGTAATC GCCCCTGCGC CAGCTGTGAA GGAACGCGAC TGAATCGCGA AGCGCGCCAT
GTATTTGTGG AAAATACGCC GCTGCCTGCT ATTTCCGATA TGAGCATTGG CCATGCGATG
GATTTTTTCA CTAATCTCAA GCTTTCCGGG CAACGAGCGA AAATCGCCGA AAAAGTGCTA
AAAGAGATCG GCGATCGCCT CAAGTTTCTG GTGAACGTCG GCCTGAACTA TCTCACGCTC
TCCCGCTCGG CAGAGACGCT TTCCGGCGGC GAAGCCCAGC GTATTCGTCT GGCGAGCCAG
ATAGGCGCCG GGTTAGTCGG CGTGATGTAT GTGCTGGATG AGCCGTCCAT CGGTCTGCAC
CAGCGCGATA ACGAACGGCT GCTGGGTACG CTGATTCATC TGCGCAATCT TGGCAATACC
GTGATTGTGG TGGAACATGA TGAAGACGCC ATTCGCGCCG CCGACCATGT GATCGATATT
GGCCCCGGCG CGGGCGTTCA CGGCGGCGAG GTGGTGGCGG AAGGCCCGCT GGAAGCCATT
ATGGCGGTAC CGGAATCGCT GACCGGCCAG TACATGAGCG GTAAACGCAA AATTGAAGTG
CCGAAACAAC GCGTGCCGGC AAATCCAGAA AAAGTGCTCA AACTCACCGG CGCGCGCGGC
AACAACCTGA AAGATGTGAC CCTTACGCTA CCGGTAGGGC TGTTTACCTG TATCACCGGC
GTCTCGGGTT CCGGTAAATC GACGCTGATT AACGACACGC TGTTCCCCAT CGCCCAGCGT
CAGTTAAACG GGGCGACTAT CGCCGAACCG GCGCCGTATC GGGATATTCA GGGGCTGGAA
CATTTCGATA AAGTGATCGA TATCGACCAG AGCCCAATCG GGCGCACCCC GCGTTCCAAC
CCGGCGACCT ATACGGGCGT CTTTACCCCG GTTCGCGAGC TTTTTGCTGG CGTGCCGGAG
TCTCGCTCGC GCGGCTACAC GCCAGGGCGA TTCAGCTTCA ACGTGCGCGG CGGTCGCTGC
GAAGCGTGCC AGGGCGATGG CGTCATTAAA GTCGAAATGC ACTTTCTGCC GGATATTTAC
GTGCCGTGCG ACCAGTGCAA AGGCAAGCGC TATAACCGGG AAACGCTGGA GATTAAGTAC
AAAGGCAAGA CCATCCACGA AGTGCTGGAT ATGACCATTG AAGAAGCGCG TGAGTTCTTT
GATGCGGTTC CGGCGCTGGC GCGTAAGCTG CAGACGCTGA TGGACGTGGG GCTGACCTAT
ATCCGTCTTG GTCAGTCGGC GACGACGCTT TCCGGCGGCG AGGCCCAGCG CGTGAAGCTG
GCGCGCGAAC TGTCGAAGCG CGGCACCGGG CAGACGCTGT ATATTCTCGA CGAGCCGACC
ACCGGCCTGC ACTTTGCCGA TATTCAGCAG TTGCTTGACG TTCTGCATCA GTTGCGCGAT
CAGGGCAACA CCATCGTGGT GATCGAACAC AACCTGGACG TCATTAAAAC GGCGGACTGG
ATTGTCGACC TCGGCCCGGA AGGCGGCAGC GGCGGCGGCG AAATTCTCGT CTCCGGTACG
CCGGAAACCG TGGCGGAGTG CGAGGCGTCG CATACCGCCC GCTTCCTTAA ACCTATGCTC
AAATAA
 
Protein sequence
MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVIDRFKVR NDLSQRLAES FETALELSGG TAVVADMDDE
KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPDL
SLAGGAIRGW DRRNFYYFQM LKSLAEHYKF DVDAPWASLS ANVHKVVLYG SGKENIEFKY
MNDRGDTSVR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLNREARH
VFVENTPLPA ISDMSIGHAM DFFTNLKLSG QRAKIAEKVL KEIGDRLKFL VNVGLNYLTL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRNLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV
PKQRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR
QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE
SRSRGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY
KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL
ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHQLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML K