Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1109 |
Symbol | uvrA |
ID | 4485772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 1228829 |
End bp | 1231672 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639729884 |
Product | excinuclease ABC subunit A |
Protein accession | YP_872867 |
Protein GI | 117928316 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.57709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.229428 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATC GGCTTGTCAT CCGGGGCGCT CGCGAGCACA ACCTCAAGGA CGTCTCTCTC GATCTCCCGC GGAACGCATT GATCGTCTTC ACCGGCCTCT CCGGATCGGG AAAATCAAGC CTCGCATTTG ACACGATCTT CGCGGAGGGC CAGCGGCGGT ACGTGGAGTC CCTCTCGGCG TACGCCCGGC AATTTCTCGG TCAGATGGAC AAGCCGGACG TTGATTTCAT CGAGGGGCTC TCGCCGGCGG TGTCCATCGA CCAAAAATCG ACGTCGCGAA ATCCGCGCTC CACGGTGGGT ACGATCACCG AGGTTTACGA CTACCTGCGT CTGCTGTACG CGCGGATCGG GCATCCGCAC TGCCCGGTCT GCGGACGGGC GATTTCCCGC CAGACGCCGC AGCAGATCGT CGACCGCATC CTGGAATTTC CCGCCGGTAC GCGATTCCAG GTGCTTGCGC CGGTCGTCCG CGGCCGCAAA GGCGAGTACG CCGAGATGTT CGCCGAGCTG CAGAGCAAAG GTTTCGTGCG GGTACGGGTC GACGGCGTCG TCCACCCGCT GGACAATCCG CCTCGGTTGA AAAAGCAGGA GAAGCACACC ATCGACGTCG TGGTCGACCG GCTTGCCGTC AAGGAGAACG TCGCCCATCG GCTCGTTGAC TCGATTGAGA CGGCGCTGCG GCTCTCCGGA GGTCTGGTCA CGCTCGAATT CGTCGATCTG CCGGAGAGCG ACCCGCACCG GGAGCGAATG TTCTCCGAAC ATCTCGCCTG TCCCGACGAC GACCTGGATT TTGAAGAATT GGAGCCGCGT TCTTTCTCCT TCAATTCGCC GTACGGCGCC TGTCCGGAAT GCACCGGCCT CGGCACCCGG CTCGTCGCCG ATCCCGATCT CATCGTTCCG GATCCGAGCA AATCCATCGC GGACGGCGCC ATCGCGCCGT GGGCGACCGG GCACGTGTCG GACTACTTTG TCCGCCTCCT CGAGGCATTG GCCGACGCGG CGGGATTCTC CACCCAGACC CGCTGGGACC GGCTGCCCGC CAAAGCCCGC AAGTACATCC TGTACGGGTA CCCCGAACCG TTGTACGTCC AATACCGCAA TCGGTACGGC CGGCACCGGT CATACCACAC GACCTACGAA GGTGTCGTGC CGTACATCGA ACGCCGGCAC GGCGAAGTCG ACAGCGACTA CAGCCGGGAG CGGTTCGAGA GCTACATGCG GGAAGTGCCG TGCCCGAAGT GCCAGGGCAA ACGGCTCAAG CCGCTCGCGC TCGCCGTGAC GGTCGGCGGG AAGTCCATCG CCGACGTGTC CGCCATGCCC ATCAGCGAAT GTGCGCGATT CCTGCGCGCC CTTGACCTAA CCGCACGGGA ACGGCAGATC GCCGAGCGCG TGCTCAAAGA AGTGAACACC CGCCTGGGTT TCCTGCTGGA CGTCGGTCTC GACTACCTCA CGCTTGACCG TCCCGCCGCT ACGTTGGCGG GTGGCGAGGC GCAACGCATT CGGCTGGCCA CCCAGATCGG CTCCGGTCTT GTCGGCGTTC TCTATGTGCT GGACGAGCCG AGTATCGGCC TGCACCAGCG CGACAACCAC CGTCTCATCG AGACCCTGCT CCGCCTCCGC GATCTCGGCA ATACGCTGAT CGTGGTCGAG CACGATGAGG ACACCATCCG CGCGGCGGAT TGGGTGGTCG ACATCGGTCC CGGCGCCGGT GAGCACGGTG GACAGATCGT CGTCTCCGGG CCGGTGCCTG AACTGCTGGC GTGCAAAGAG TCGCTCACCG GCGCGTACCT TTCCGGACGG CGAACCATCG AGGTACCGGC GATCCGGCGG CCCGCCGATC CCAAGCGCCG GCTGATCGTC AAGGGTGCCC GAGAGCACAA CCTCCGCAAC ATCGACGTCG CGTTTCCGCT GGGCTGCTTC GTCGTCGTGA CCGGCGTCTC GGGCTCGGGA AAATCAACGC TGGTCAACGA CATTCTCTAT GCCGCGTTGG CCAAGGAACT GAACGGGGCG AAAACCGTGC CCGGCCGGCA CGACCGCATC CTCGGTCTCG ACCTCGTCGA CAAGGCGATT CACGTCGACC AGAGCCCGAT CGGCCGGACG CCGCGGTCGA ACCCCGCGAC CTACACCGGT GTGTTTGATG ACATCCGCCG GCTCTTCGCG GAGACCACGG AAGCCAAGGT GCGCGGATAC ACTCCCGGGC GTTTCTCCTT CAACGTGAAG GGCGGCCGCT GCGAAGCGTG CGCCGGCGAC GGCACCATCA AAATCGAAAT GAATTTCCTG CCCGACGTTT ACGTGCCGTG CGAAGTGTGC CATGGCGCGC GGTACAACCG GGAGACCCTC GAGGTGCATT ACAAGGGCAA GACAATTGCC GACGTCCTGG ACATGCCGAT CGAAGAAGCA GCCGACTTCT TCGGCCCGCT GCCGGGCATC GCCCGCCGGC TCACCACTCT CGTCGAGGTC GGTCTGGGTT ATGTCCGACT CGGCCAACCG GCCCCGACCC TCTCCGGCGG GGAAGCGCAG CGGGTGAAAC TCGCGTCCGA GCTGCAGAAG CGGGCGACCG GTCGCACCGT GTACATCCTC GACGAGCCGA CGACCGGCCT GCATTTCGAA GACGTCCGCA AGCTGCTTGC CGTTCTGCAA TCCCTTGTCG AGCGCGGCAA TACCGTGATC GTCATTGAGC ATAATCTTGA CGTCATCAAG ACGGCCGACT GGGTGATCGA CCTCGGTCCC GAGGGCGGCG CCGCCGGCGG ACAGGTGGTC GCCGAGGGTC CACCCGAACA GGTGGCGGCG ACGGAAGGCA GCTACACCGG CGCCTTTCTG CGCAAGGTGC TCCGCCTCTC CTAA
|
Protein sequence | MADRLVIRGA REHNLKDVSL DLPRNALIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLYARIGHPH CPVCGRAISR QTPQQIVDRI LEFPAGTRFQ VLAPVVRGRK GEYAEMFAEL QSKGFVRVRV DGVVHPLDNP PRLKKQEKHT IDVVVDRLAV KENVAHRLVD SIETALRLSG GLVTLEFVDL PESDPHRERM FSEHLACPDD DLDFEELEPR SFSFNSPYGA CPECTGLGTR LVADPDLIVP DPSKSIADGA IAPWATGHVS DYFVRLLEAL ADAAGFSTQT RWDRLPAKAR KYILYGYPEP LYVQYRNRYG RHRSYHTTYE GVVPYIERRH GEVDSDYSRE RFESYMREVP CPKCQGKRLK PLALAVTVGG KSIADVSAMP ISECARFLRA LDLTARERQI AERVLKEVNT RLGFLLDVGL DYLTLDRPAA TLAGGEAQRI RLATQIGSGL VGVLYVLDEP SIGLHQRDNH RLIETLLRLR DLGNTLIVVE HDEDTIRAAD WVVDIGPGAG EHGGQIVVSG PVPELLACKE SLTGAYLSGR RTIEVPAIRR PADPKRRLIV KGAREHNLRN IDVAFPLGCF VVVTGVSGSG KSTLVNDILY AALAKELNGA KTVPGRHDRI LGLDLVDKAI HVDQSPIGRT PRSNPATYTG VFDDIRRLFA ETTEAKVRGY TPGRFSFNVK GGRCEACAGD GTIKIEMNFL PDVYVPCEVC HGARYNRETL EVHYKGKTIA DVLDMPIEEA ADFFGPLPGI ARRLTTLVEV GLGYVRLGQP APTLSGGEAQ RVKLASELQK RATGRTVYIL DEPTTGLHFE DVRKLLAVLQ SLVERGNTVI VIEHNLDVIK TADWVIDLGP EGGAAGGQVV AEGPPEQVAA TEGSYTGAFL RKVLRLS
|
| |