Gene Sare_3339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3339 
SymboluvrA 
ID5708294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3852436 
End bp3855399 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content69% 
IMG OID641272766 
Productexcinuclease ABC subunit A 
Protein accessionYP_001538133 
Protein GI159038880 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.535422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000450974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCTGACC GACTGATCAT CCGTGGCGCC CGCGAGCACA ACTTGCGTGA CGTCAGTCTC 
GACCTGCCGC GGGACGCGCT CATCGTGTTC ACCGGGCTGT CCGGGTCCGG CAAGTCGAGC
CTGGCCTTCG ACACGATCTT CGCCGAGGGG CAGCGGCGCT ATGTCGAGTC GCTGTCGTCC
TATGCCCGGC AGTTCCTGGG CCAGATGGAC AAGCCGGACG TGGACTTCAT CGAGGGTCTC
AGTCCGGCCG TCTCCATCGA CCAGAAGTCG ACCTCGCGCA ACCCTCGGTC GACGGTGGGC
ACGATCACCG AGGTCTACGA CTACCTGCGT CTGCTCTTCG CCCGCATCGG TCAGCCGCAC
TGCCCGGTCT GCGGTGAGCG GATCTCCCGC CAGACCCCGC AGCAGATCGT CGACCGGGTC
CTCGCCATGG CCGAGGGCAC GCGGTTCATG GTTCTCGCGC CGGTCGTTCG TGGTCGTAAG
GGCGAATATG TGGACCTTTT CGCCGAGCTC CAGGCGAAGG GCTACGCGCG CGCCCGCGTG
GACGGGGTGG TGCACCCGCT GACCGAGCCA CCGAAGCTCA AGAAGCAGGA GAAGCACACC
ATCGAGGTGG TCATCGACCG GCTCACCGTG AAGCCGTCGG CCAAGCAGCG GCTGACGGAT
TCGGTCGAGG CGGCGCTCGG GCTCTCCGCC GGTCTGGTCC TGCTCGACTT CGTCGACCTG
CCGGAGGACG ACCCGGATCG GGAGCGCCGC TACTCGGAGC ACCTGGCCTG TCCCAACGAT
CACCAGCTCG CGATCGAGGA CCTGGAGCCC CGGGTCTTCT CCTTCAACGC GCCGTACGGT
GCGTGCCCGG AGTGCACCGG CCTGGGTACG AAGAAGGAGG TCGACCCGGA GCTGGTGATC
CCCGACCCGG AGCGCACCCT GCGGGAGGGG GCGATCCAGC CCTGGTCCGG CGGGCACAGC
CTGGAATACT TCCTGCGCCT GCTGGAGGCG CTGGGCGAGG CGGAGCACTT CGACATCGAC
ACGCCGTGGC GGGCGTTGCC GTCCCGGGCG CAGAAGACGA TCCTGCATGG CGCCGAGGAC
CAGGTGCATG TGCGGTACCG GAACAAGTAC GGCCGGGAGC GCTCGTATTA CACCGGGTTC
GAGGGCGTGA TGCAGTGGAT CGAGCGCCGG CACTCCGACA CCGAGTCGGA GTGGTCCCGG
GAGAAGTACG AGGGTTACAT GCGGGACGTG CCCTGCGCGG CCTGCGGCGG TGCCCGGCTC
AAGCCGGAGG TGCTCGCGGT GACCGTCGCC GGTCGGAGTA TCGCCGAGGT GTGCGCGATG
TCCGTCGGTG AGTGCGCCGA GCTGCTCGCC GGCGTCGAAC TGACCGATCG GCAGCGGTTG
ATCGCCGAGC GGGTCCTCAA GGAGATCAAC GCCAGACTGC GGTTCCTGCT GGACGTCGGC
CTCGACTATC TCTCCCTGGA CCGTCCCGCC GGCACCCTCT CCGGCGGCGA GGCGCAGCGC
ATCCGGTTGG CCACCCAGAT CGGTTCCGGC CTGGTCGGGG TGCTCTACGT GCTGGACGAG
CCCTCGATCG GTCTGCACCA GCGGGACAAC CACCGGTTGA TCGAGACGTT GCTGCGGCTG
CGGGGGCTCG GCAACACGTT GATCGTGGTC GAGCACGACG AGGACACCAT CCGCACCGCG
GACTGGATCG TCGACATCGG CCCGGGGGCG GGCGAGCACG GGGGCCGGAT CGTGCACAGC
GGGTCGGTCC CGGCGCTCCT GGACAACCCG GAGTCGATGA CCGGGGCGTA CCTGTCCGGC
CGGAAGGAGA TCCCGACGCC GGGGCAGCGC CGTCCGCAGA CGCCGGGACG GGAGTTGACG
GTGCAGGGGG CCCGCGAGCA CAACCTGCGG AACCTGACCG TGACGTTCCC GCTCGGTCAG
CTGATCGCCG TCACCGGGGT CAGCGGTTCC GGTAAGTCGA CCCTGGTCAA CGACATCCTG
TACGCGGTCC TGGCCAACCA GATCAACGGG GCGCGGTTGG TGCCCGGCCG GCACACCCGG
GTCGCCGGCC TGGAGCATGT GGACAAGGTC GTCGGGGTGG ACCAGTCGCC GATCGGTCGC
ACCCCACGTT CCAATCCGGC CACCTACACC GGCGTCTGGG ACCACGTTCG TAAGCTGTTC
GCCGAGACCG TCGAGGCCAA GGTCCGGGGG TACGGGCCGG GCCGGTTCTC GTTCAACGTC
AAGGGCGGCC GGTGTGAGGC GTGCTCCGGT GACGGCACCA TCAAGATCGA GATGAACTTC
CTGCCCGACG TGTACGTGCC GTGCGAGGTC TGCAAGGGCG CCCGCTACAA CCGGGAGACC
CTGGAGGTGC ACTACAAGGG CAAGACCGTC TCGGATGTGC TGGAGATGCC GATCGAGGAG
GCGGCGGAGT TCTTCTCCGC CATCCCGGCC ATCCACCGGC ACCTCAGCAC GCTGGTTGAC
GTGGGCCTTG GCTACGTCCG GCTGGGCCAG CCCGCGCCGA CCCTCTCCGG CGGGGAGGCG
CAGCGGGTGA AGCTCGCCTC CGAGCTGCAG AAGCGCTCCA CCGGGCGGAC GGTCTACGTG
CTCGACGAGC CGACCACCGG ACTGCACTTC GAGGACATCC GTAAGCTGCT GATGGTGCTG
GAGGGGCTGG TCGACAAGGG CAACACGGTG ATCACGATCG AACACAACCT CGACGTGATC
AAGACCGCTG ACTGGATCAT CGACATGGGG CCGGAGGGCG GCCACCGCGG CGGCACGGTG
CTCGCCACCG GCACCCCGGA GGAGGTCGCG GAGGTGCCCG ACAGCCACAC CGGCCAGTTC
GTGCGCCAGG TGCTCAAGCT CGACGGTGAG GCCAAGGGCG CCGCGGCAGC CACCTCTCGC
GCGGCCAGGG CCAACGGCGT GAAGGCCCGG GCGAACGGTG CCAAAACCCG CGCGGCTCGG
AAGGCGCCCG CCAAGGCCCG GTGA
 
Protein sequence
MADRLIIRGA REHNLRDVSL DLPRDALIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSS 
YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLFARIGQPH
CPVCGERISR QTPQQIVDRV LAMAEGTRFM VLAPVVRGRK GEYVDLFAEL QAKGYARARV
DGVVHPLTEP PKLKKQEKHT IEVVIDRLTV KPSAKQRLTD SVEAALGLSA GLVLLDFVDL
PEDDPDRERR YSEHLACPND HQLAIEDLEP RVFSFNAPYG ACPECTGLGT KKEVDPELVI
PDPERTLREG AIQPWSGGHS LEYFLRLLEA LGEAEHFDID TPWRALPSRA QKTILHGAED
QVHVRYRNKY GRERSYYTGF EGVMQWIERR HSDTESEWSR EKYEGYMRDV PCAACGGARL
KPEVLAVTVA GRSIAEVCAM SVGECAELLA GVELTDRQRL IAERVLKEIN ARLRFLLDVG
LDYLSLDRPA GTLSGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN HRLIETLLRL
RGLGNTLIVV EHDEDTIRTA DWIVDIGPGA GEHGGRIVHS GSVPALLDNP ESMTGAYLSG
RKEIPTPGQR RPQTPGRELT VQGAREHNLR NLTVTFPLGQ LIAVTGVSGS GKSTLVNDIL
YAVLANQING ARLVPGRHTR VAGLEHVDKV VGVDQSPIGR TPRSNPATYT GVWDHVRKLF
AETVEAKVRG YGPGRFSFNV KGGRCEACSG DGTIKIEMNF LPDVYVPCEV CKGARYNRET
LEVHYKGKTV SDVLEMPIEE AAEFFSAIPA IHRHLSTLVD VGLGYVRLGQ PAPTLSGGEA
QRVKLASELQ KRSTGRTVYV LDEPTTGLHF EDIRKLLMVL EGLVDKGNTV ITIEHNLDVI
KTADWIIDMG PEGGHRGGTV LATGTPEEVA EVPDSHTGQF VRQVLKLDGE AKGAAAATSR
AARANGVKAR ANGAKTRAAR KAPAKAR