Gene SeHA_C4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4597 
SymboluvrA 
ID6491674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4469164 
End bp4471989 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content57% 
IMG OID642744668 
Productexcinuclease ABC subunit A 
Protein accessionYP_002048245 
Protein GI194448954 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAGA TCGAAGTTCG GGGCGCCCGC ACCCATAATC TCAAAAATAT TAACCTCGTC 
ATCCCCCGCG ACAAACTGAT TGTCGTGACC GGGCTTTCGG GTTCAGGCAA ATCCTCACTG
GCTTTCGACA CTCTGTATGC CGAAGGGCAG CGTCGTTACG TTGAATCGCT CTCCGCTTAC
GCGCGGCAGT TTTTGTCGCT CATGGAAAAA CCGGATGTCG ACCATATTGA GGGGCTATCG
CCCGCGATCT CAATTGAACA GAAATCGACA TCGCACAACC CGCGCTCTAC GGTGGGTACT
ATTACCGAGA TCCACGACTA CCTGCGCCTG CTGTTTGCCC GCGTGGGCGA GCCGCGTTGT
CCGGACCATG ACGTGCCGCT GGCGGCGCAA ACCGTTAGCC AGATGGTCGA TAACGTACTG
TCACAGCCGG AAGGCAAACG TCTGATGCTG CTCGCGCCGA TTATTAAAGA GCGTAAAGGC
GAACACACCA AAACGCTGGA AAATCTGGCA AGCCAGGGTT ACATTCGCGC CCGTATTGAC
GGCGAAGTCT GCGATCTCTC CGATCCGCCG AAGCTGGAGC TGCAAAAGAA ACATACCATT
GAGGTGGTGA TCGATCGCTT CAAAGTTCGC AACGATCTTT CCCAACGCCT GGCGGAGTCG
TTCGAAACGG CGCTGGAATT ATCCGGCGGC ACGGCGGTTG TTGCCGATAT GGACGATGAG
AAAGCGGAGG AGCTTCTGTT CTCCGCTAAT TTTGCTTGTC CGATTTGCGG CTACAGTATG
CGCGAGCTGG AACCGCGTCT GTTCTCGTTC AACAACCCGG CAGGTGCCTG CCCGACCTGT
GACGGCCTCG GCGTTCAGCA ATATTTCGAT CCGGACCGCG TGATCCAGAA TCCCGACCTG
TCGCTGGCAG GCGGCGCGAT TCGTGGTTGG GATCGTCGCA ATTTTTACTA CTTTCAAATG
CTCAAGTCGC TGGCGGAACA CTATAAGTTC GACGTGGATG CGCCGTGGGC AAGCCTCAGC
GCCAACGTAC ATAAAGTCGT GCTATACGGT TCCGGCAAAG AGAATATTGA ATTTAAATAT
ATGAACGATC GCGGCGATAC TTCCGTGCGC CGCCATCCGT TCGAAGGCGT GCTGCATAAT
ATGGAGCGCC GTTATAAAGA GACGGAATCC AGCGCGGTGC GCGAAGAGCT GGCGAAGTTC
ATCAGTAATC GCCCCTGCGC CAGCTGTGAA GGAACGCGAC TGAATCGCGA AGCGCGCCAT
GTATTTGTGG AAAATACGCC GCTGCCTGCT ATTTCCGATA TGAGCATTGG CCATGCGATG
GATTTTTTCA CTAATCTCAA GCTTTCCGGG CAACGGGCGA AAATCGCCGA AAAAGTGCTA
AAAGAGATCG GCGATCGCCT CAAGTTTCTG GTGAACGTCG GCCTGAACTA TCTCACGCTC
TCCCGCTCGG CAGAGACGCT TTCCGGCGGC GAAGCCCAGC GTATTCGTCT GGCGAGCCAG
ATAGGCGCCG GGTTAGTCGG CGTGATGTAT GTGCTGGATG AGCCGTCCAT CGGTCTGCAC
CAGCGCGATA ACGAACGGCT GCTGGGTACG CTGATTCATC TGCGCAATCT TGGCAATACC
GTGATTGTGG TGGAACACGA TGAAGACGCC ATCCGCGCCG CCGACCATGT GATTGATATT
GGCCCCGGCG CGGGCGTTCA CGGCGGCGAG GTGGTGGCGG AAGGCCCGCT GGAAGCCATT
ATGGCGGTAC CGGAATCGCT GACCGGCCAG TACATGAGCG GTAAACGCAA AATTGAAGTG
CCGAAACAAC GCGTGCCGGC AAATCCAGAA AAAGTGCTCA AACTCACCGG CGCGCGCGGC
AACAACCTGA AAGATGTGAC CCTTACGCTA CCGGTAGGGC TGTTTACCTG TATCACCGGC
GTCTCGGGTT CCGGTAAATC GACGCTGATT AACGACACGC TGTTCCCCAT CGCCCAGCGT
CAGTTAAACG GGGCGACTAT CGCCGAACCG GCGCCGTATC GGGATATTCA GGGGCTGGAA
CATTTCGATA AAGTGATCGA TATCGACCAG AGCCCGATCG GGCGCACCCC GCGTTCCAAC
CCGGCGACCT ATACGGGTGT CTTTACCCCG GTTCGCGAGC TTTTTGCTGG CGTGCCGGAG
TCTCGCTCGC GCGGCTATAC GCCAGGGCGA TTCAGCTTCA ACGTGCGCGG CGGTCGCTGC
GAAGCGTGCC AGGGCGATGG CGTCATTAAA GTCGAAATGC ACTTTCTGCC GGATATTTAC
GTGCCGTGCG ACCAGTGCAA AGGCAAGCGC TATAACCGGG AAACGCTGGA GATTAAGTAC
AAAGGCAAGA CCATCCACGA AGTACTGGAT ATGACCATTG AAGAAGCGCG TGAGTTCTTT
GATGCGGTTC CGGCGCTGGC GCGTAAGCTG CAGACGCTGA TGGACGTGGG GCTGACCTAT
ATCCGTCTTG GTCAGTCGGC GACAACGCTT TCCGGCGGCG AGGCCCAGCG CGTGAAGCTG
GCGCGCGAAC TGTCGAAGCG CGGCACCGGG CAGACGCTGT ATATTCTCGA CGAGCCAACC
ACCGGCCTGC ACTTTGCCGA TATTCAGCAG TTGCTTGACG TTCTGCATCA GTTGCGCGAT
CAGGGCAACA CCATCGTGGT GATCGAACAC AACCTGGACG TCATTAAAAC GGCGGACTGG
ATTGTCGACC TCGGCCCGGA AGGCGGCAGC GGCGGCGGCG AAATTCTCGT CTCCGGTACG
CCGGAAACCG TGGCGGAGTG CGAGGCGTCG CATACCGCCC GCTTCCTTAA ACCTATGCTC
AAATAA
 
Protein sequence
MDKIEVRGAR THNLKNINLV IPRDKLIVVT GLSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSLMEK PDVDHIEGLS PAISIEQKST SHNPRSTVGT ITEIHDYLRL LFARVGEPRC
PDHDVPLAAQ TVSQMVDNVL SQPEGKRLML LAPIIKERKG EHTKTLENLA SQGYIRARID
GEVCDLSDPP KLELQKKHTI EVVIDRFKVR NDLSQRLAES FETALELSGG TAVVADMDDE
KAEELLFSAN FACPICGYSM RELEPRLFSF NNPAGACPTC DGLGVQQYFD PDRVIQNPDL
SLAGGAIRGW DRRNFYYFQM LKSLAEHYKF DVDAPWASLS ANVHKVVLYG SGKENIEFKY
MNDRGDTSVR RHPFEGVLHN MERRYKETES SAVREELAKF ISNRPCASCE GTRLNREARH
VFVENTPLPA ISDMSIGHAM DFFTNLKLSG QRAKIAEKVL KEIGDRLKFL VNVGLNYLTL
SRSAETLSGG EAQRIRLASQ IGAGLVGVMY VLDEPSIGLH QRDNERLLGT LIHLRNLGNT
VIVVEHDEDA IRAADHVIDI GPGAGVHGGE VVAEGPLEAI MAVPESLTGQ YMSGKRKIEV
PKQRVPANPE KVLKLTGARG NNLKDVTLTL PVGLFTCITG VSGSGKSTLI NDTLFPIAQR
QLNGATIAEP APYRDIQGLE HFDKVIDIDQ SPIGRTPRSN PATYTGVFTP VRELFAGVPE
SRSRGYTPGR FSFNVRGGRC EACQGDGVIK VEMHFLPDIY VPCDQCKGKR YNRETLEIKY
KGKTIHEVLD MTIEEAREFF DAVPALARKL QTLMDVGLTY IRLGQSATTL SGGEAQRVKL
ARELSKRGTG QTLYILDEPT TGLHFADIQQ LLDVLHQLRD QGNTIVVIEH NLDVIKTADW
IVDLGPEGGS GGGEILVSGT PETVAECEAS HTARFLKPML K