Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3339 |
Symbol | uvrA |
ID | 5708294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3852436 |
End bp | 3855399 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272766 |
Product | excinuclease ABC subunit A |
Protein accession | YP_001538133 |
Protein GI | 159038880 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.535422 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000450974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCTGACC GACTGATCAT CCGTGGCGCC CGCGAGCACA ACTTGCGTGA CGTCAGTCTC GACCTGCCGC GGGACGCGCT CATCGTGTTC ACCGGGCTGT CCGGGTCCGG CAAGTCGAGC CTGGCCTTCG ACACGATCTT CGCCGAGGGG CAGCGGCGCT ATGTCGAGTC GCTGTCGTCC TATGCCCGGC AGTTCCTGGG CCAGATGGAC AAGCCGGACG TGGACTTCAT CGAGGGTCTC AGTCCGGCCG TCTCCATCGA CCAGAAGTCG ACCTCGCGCA ACCCTCGGTC GACGGTGGGC ACGATCACCG AGGTCTACGA CTACCTGCGT CTGCTCTTCG CCCGCATCGG TCAGCCGCAC TGCCCGGTCT GCGGTGAGCG GATCTCCCGC CAGACCCCGC AGCAGATCGT CGACCGGGTC CTCGCCATGG CCGAGGGCAC GCGGTTCATG GTTCTCGCGC CGGTCGTTCG TGGTCGTAAG GGCGAATATG TGGACCTTTT CGCCGAGCTC CAGGCGAAGG GCTACGCGCG CGCCCGCGTG GACGGGGTGG TGCACCCGCT GACCGAGCCA CCGAAGCTCA AGAAGCAGGA GAAGCACACC ATCGAGGTGG TCATCGACCG GCTCACCGTG AAGCCGTCGG CCAAGCAGCG GCTGACGGAT TCGGTCGAGG CGGCGCTCGG GCTCTCCGCC GGTCTGGTCC TGCTCGACTT CGTCGACCTG CCGGAGGACG ACCCGGATCG GGAGCGCCGC TACTCGGAGC ACCTGGCCTG TCCCAACGAT CACCAGCTCG CGATCGAGGA CCTGGAGCCC CGGGTCTTCT CCTTCAACGC GCCGTACGGT GCGTGCCCGG AGTGCACCGG CCTGGGTACG AAGAAGGAGG TCGACCCGGA GCTGGTGATC CCCGACCCGG AGCGCACCCT GCGGGAGGGG GCGATCCAGC CCTGGTCCGG CGGGCACAGC CTGGAATACT TCCTGCGCCT GCTGGAGGCG CTGGGCGAGG CGGAGCACTT CGACATCGAC ACGCCGTGGC GGGCGTTGCC GTCCCGGGCG CAGAAGACGA TCCTGCATGG CGCCGAGGAC CAGGTGCATG TGCGGTACCG GAACAAGTAC GGCCGGGAGC GCTCGTATTA CACCGGGTTC GAGGGCGTGA TGCAGTGGAT CGAGCGCCGG CACTCCGACA CCGAGTCGGA GTGGTCCCGG GAGAAGTACG AGGGTTACAT GCGGGACGTG CCCTGCGCGG CCTGCGGCGG TGCCCGGCTC AAGCCGGAGG TGCTCGCGGT GACCGTCGCC GGTCGGAGTA TCGCCGAGGT GTGCGCGATG TCCGTCGGTG AGTGCGCCGA GCTGCTCGCC GGCGTCGAAC TGACCGATCG GCAGCGGTTG ATCGCCGAGC GGGTCCTCAA GGAGATCAAC GCCAGACTGC GGTTCCTGCT GGACGTCGGC CTCGACTATC TCTCCCTGGA CCGTCCCGCC GGCACCCTCT CCGGCGGCGA GGCGCAGCGC ATCCGGTTGG CCACCCAGAT CGGTTCCGGC CTGGTCGGGG TGCTCTACGT GCTGGACGAG CCCTCGATCG GTCTGCACCA GCGGGACAAC CACCGGTTGA TCGAGACGTT GCTGCGGCTG CGGGGGCTCG GCAACACGTT GATCGTGGTC GAGCACGACG AGGACACCAT CCGCACCGCG GACTGGATCG TCGACATCGG CCCGGGGGCG GGCGAGCACG GGGGCCGGAT CGTGCACAGC GGGTCGGTCC CGGCGCTCCT GGACAACCCG GAGTCGATGA CCGGGGCGTA CCTGTCCGGC CGGAAGGAGA TCCCGACGCC GGGGCAGCGC CGTCCGCAGA CGCCGGGACG GGAGTTGACG GTGCAGGGGG CCCGCGAGCA CAACCTGCGG AACCTGACCG TGACGTTCCC GCTCGGTCAG CTGATCGCCG TCACCGGGGT CAGCGGTTCC GGTAAGTCGA CCCTGGTCAA CGACATCCTG TACGCGGTCC TGGCCAACCA GATCAACGGG GCGCGGTTGG TGCCCGGCCG GCACACCCGG GTCGCCGGCC TGGAGCATGT GGACAAGGTC GTCGGGGTGG ACCAGTCGCC GATCGGTCGC ACCCCACGTT CCAATCCGGC CACCTACACC GGCGTCTGGG ACCACGTTCG TAAGCTGTTC GCCGAGACCG TCGAGGCCAA GGTCCGGGGG TACGGGCCGG GCCGGTTCTC GTTCAACGTC AAGGGCGGCC GGTGTGAGGC GTGCTCCGGT GACGGCACCA TCAAGATCGA GATGAACTTC CTGCCCGACG TGTACGTGCC GTGCGAGGTC TGCAAGGGCG CCCGCTACAA CCGGGAGACC CTGGAGGTGC ACTACAAGGG CAAGACCGTC TCGGATGTGC TGGAGATGCC GATCGAGGAG GCGGCGGAGT TCTTCTCCGC CATCCCGGCC ATCCACCGGC ACCTCAGCAC GCTGGTTGAC GTGGGCCTTG GCTACGTCCG GCTGGGCCAG CCCGCGCCGA CCCTCTCCGG CGGGGAGGCG CAGCGGGTGA AGCTCGCCTC CGAGCTGCAG AAGCGCTCCA CCGGGCGGAC GGTCTACGTG CTCGACGAGC CGACCACCGG ACTGCACTTC GAGGACATCC GTAAGCTGCT GATGGTGCTG GAGGGGCTGG TCGACAAGGG CAACACGGTG ATCACGATCG AACACAACCT CGACGTGATC AAGACCGCTG ACTGGATCAT CGACATGGGG CCGGAGGGCG GCCACCGCGG CGGCACGGTG CTCGCCACCG GCACCCCGGA GGAGGTCGCG GAGGTGCCCG ACAGCCACAC CGGCCAGTTC GTGCGCCAGG TGCTCAAGCT CGACGGTGAG GCCAAGGGCG CCGCGGCAGC CACCTCTCGC GCGGCCAGGG CCAACGGCGT GAAGGCCCGG GCGAACGGTG CCAAAACCCG CGCGGCTCGG AAGGCGCCCG CCAAGGCCCG GTGA
|
Protein sequence | MADRLIIRGA REHNLRDVSL DLPRDALIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSS YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLFARIGQPH CPVCGERISR QTPQQIVDRV LAMAEGTRFM VLAPVVRGRK GEYVDLFAEL QAKGYARARV DGVVHPLTEP PKLKKQEKHT IEVVIDRLTV KPSAKQRLTD SVEAALGLSA GLVLLDFVDL PEDDPDRERR YSEHLACPND HQLAIEDLEP RVFSFNAPYG ACPECTGLGT KKEVDPELVI PDPERTLREG AIQPWSGGHS LEYFLRLLEA LGEAEHFDID TPWRALPSRA QKTILHGAED QVHVRYRNKY GRERSYYTGF EGVMQWIERR HSDTESEWSR EKYEGYMRDV PCAACGGARL KPEVLAVTVA GRSIAEVCAM SVGECAELLA GVELTDRQRL IAERVLKEIN ARLRFLLDVG LDYLSLDRPA GTLSGGEAQR IRLATQIGSG LVGVLYVLDE PSIGLHQRDN HRLIETLLRL RGLGNTLIVV EHDEDTIRTA DWIVDIGPGA GEHGGRIVHS GSVPALLDNP ESMTGAYLSG RKEIPTPGQR RPQTPGRELT VQGAREHNLR NLTVTFPLGQ LIAVTGVSGS GKSTLVNDIL YAVLANQING ARLVPGRHTR VAGLEHVDKV VGVDQSPIGR TPRSNPATYT GVWDHVRKLF AETVEAKVRG YGPGRFSFNV KGGRCEACSG DGTIKIEMNF LPDVYVPCEV CKGARYNRET LEVHYKGKTV SDVLEMPIEE AAEFFSAIPA IHRHLSTLVD VGLGYVRLGQ PAPTLSGGEA QRVKLASELQ KRSTGRTVYV LDEPTTGLHF EDIRKLLMVL EGLVDKGNTV ITIEHNLDVI KTADWIIDMG PEGGHRGGTV LATGTPEEVA EVPDSHTGQF VRQVLKLDGE AKGAAAATSR AARANGVKAR ANGAKTRAAR KAPAKAR
|
| |