Gene Acel_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1109 
SymboluvrA 
ID4485772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1228829 
End bp1231672 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content65% 
IMG OID639729884 
Productexcinuclease ABC subunit A 
Protein accessionYP_872867 
Protein GI117928316 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.57709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.229428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATC GGCTTGTCAT CCGGGGCGCT CGCGAGCACA ACCTCAAGGA CGTCTCTCTC 
GATCTCCCGC GGAACGCATT GATCGTCTTC ACCGGCCTCT CCGGATCGGG AAAATCAAGC
CTCGCATTTG ACACGATCTT CGCGGAGGGC CAGCGGCGGT ACGTGGAGTC CCTCTCGGCG
TACGCCCGGC AATTTCTCGG TCAGATGGAC AAGCCGGACG TTGATTTCAT CGAGGGGCTC
TCGCCGGCGG TGTCCATCGA CCAAAAATCG ACGTCGCGAA ATCCGCGCTC CACGGTGGGT
ACGATCACCG AGGTTTACGA CTACCTGCGT CTGCTGTACG CGCGGATCGG GCATCCGCAC
TGCCCGGTCT GCGGACGGGC GATTTCCCGC CAGACGCCGC AGCAGATCGT CGACCGCATC
CTGGAATTTC CCGCCGGTAC GCGATTCCAG GTGCTTGCGC CGGTCGTCCG CGGCCGCAAA
GGCGAGTACG CCGAGATGTT CGCCGAGCTG CAGAGCAAAG GTTTCGTGCG GGTACGGGTC
GACGGCGTCG TCCACCCGCT GGACAATCCG CCTCGGTTGA AAAAGCAGGA GAAGCACACC
ATCGACGTCG TGGTCGACCG GCTTGCCGTC AAGGAGAACG TCGCCCATCG GCTCGTTGAC
TCGATTGAGA CGGCGCTGCG GCTCTCCGGA GGTCTGGTCA CGCTCGAATT CGTCGATCTG
CCGGAGAGCG ACCCGCACCG GGAGCGAATG TTCTCCGAAC ATCTCGCCTG TCCCGACGAC
GACCTGGATT TTGAAGAATT GGAGCCGCGT TCTTTCTCCT TCAATTCGCC GTACGGCGCC
TGTCCGGAAT GCACCGGCCT CGGCACCCGG CTCGTCGCCG ATCCCGATCT CATCGTTCCG
GATCCGAGCA AATCCATCGC GGACGGCGCC ATCGCGCCGT GGGCGACCGG GCACGTGTCG
GACTACTTTG TCCGCCTCCT CGAGGCATTG GCCGACGCGG CGGGATTCTC CACCCAGACC
CGCTGGGACC GGCTGCCCGC CAAAGCCCGC AAGTACATCC TGTACGGGTA CCCCGAACCG
TTGTACGTCC AATACCGCAA TCGGTACGGC CGGCACCGGT CATACCACAC GACCTACGAA
GGTGTCGTGC CGTACATCGA ACGCCGGCAC GGCGAAGTCG ACAGCGACTA CAGCCGGGAG
CGGTTCGAGA GCTACATGCG GGAAGTGCCG TGCCCGAAGT GCCAGGGCAA ACGGCTCAAG
CCGCTCGCGC TCGCCGTGAC GGTCGGCGGG AAGTCCATCG CCGACGTGTC CGCCATGCCC
ATCAGCGAAT GTGCGCGATT CCTGCGCGCC CTTGACCTAA CCGCACGGGA ACGGCAGATC
GCCGAGCGCG TGCTCAAAGA AGTGAACACC CGCCTGGGTT TCCTGCTGGA CGTCGGTCTC
GACTACCTCA CGCTTGACCG TCCCGCCGCT ACGTTGGCGG GTGGCGAGGC GCAACGCATT
CGGCTGGCCA CCCAGATCGG CTCCGGTCTT GTCGGCGTTC TCTATGTGCT GGACGAGCCG
AGTATCGGCC TGCACCAGCG CGACAACCAC CGTCTCATCG AGACCCTGCT CCGCCTCCGC
GATCTCGGCA ATACGCTGAT CGTGGTCGAG CACGATGAGG ACACCATCCG CGCGGCGGAT
TGGGTGGTCG ACATCGGTCC CGGCGCCGGT GAGCACGGTG GACAGATCGT CGTCTCCGGG
CCGGTGCCTG AACTGCTGGC GTGCAAAGAG TCGCTCACCG GCGCGTACCT TTCCGGACGG
CGAACCATCG AGGTACCGGC GATCCGGCGG CCCGCCGATC CCAAGCGCCG GCTGATCGTC
AAGGGTGCCC GAGAGCACAA CCTCCGCAAC ATCGACGTCG CGTTTCCGCT GGGCTGCTTC
GTCGTCGTGA CCGGCGTCTC GGGCTCGGGA AAATCAACGC TGGTCAACGA CATTCTCTAT
GCCGCGTTGG CCAAGGAACT GAACGGGGCG AAAACCGTGC CCGGCCGGCA CGACCGCATC
CTCGGTCTCG ACCTCGTCGA CAAGGCGATT CACGTCGACC AGAGCCCGAT CGGCCGGACG
CCGCGGTCGA ACCCCGCGAC CTACACCGGT GTGTTTGATG ACATCCGCCG GCTCTTCGCG
GAGACCACGG AAGCCAAGGT GCGCGGATAC ACTCCCGGGC GTTTCTCCTT CAACGTGAAG
GGCGGCCGCT GCGAAGCGTG CGCCGGCGAC GGCACCATCA AAATCGAAAT GAATTTCCTG
CCCGACGTTT ACGTGCCGTG CGAAGTGTGC CATGGCGCGC GGTACAACCG GGAGACCCTC
GAGGTGCATT ACAAGGGCAA GACAATTGCC GACGTCCTGG ACATGCCGAT CGAAGAAGCA
GCCGACTTCT TCGGCCCGCT GCCGGGCATC GCCCGCCGGC TCACCACTCT CGTCGAGGTC
GGTCTGGGTT ATGTCCGACT CGGCCAACCG GCCCCGACCC TCTCCGGCGG GGAAGCGCAG
CGGGTGAAAC TCGCGTCCGA GCTGCAGAAG CGGGCGACCG GTCGCACCGT GTACATCCTC
GACGAGCCGA CGACCGGCCT GCATTTCGAA GACGTCCGCA AGCTGCTTGC CGTTCTGCAA
TCCCTTGTCG AGCGCGGCAA TACCGTGATC GTCATTGAGC ATAATCTTGA CGTCATCAAG
ACGGCCGACT GGGTGATCGA CCTCGGTCCC GAGGGCGGCG CCGCCGGCGG ACAGGTGGTC
GCCGAGGGTC CACCCGAACA GGTGGCGGCG ACGGAAGGCA GCTACACCGG CGCCTTTCTG
CGCAAGGTGC TCCGCCTCTC CTAA
 
Protein sequence
MADRLVIRGA REHNLKDVSL DLPRNALIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA 
YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TSRNPRSTVG TITEVYDYLR LLYARIGHPH
CPVCGRAISR QTPQQIVDRI LEFPAGTRFQ VLAPVVRGRK GEYAEMFAEL QSKGFVRVRV
DGVVHPLDNP PRLKKQEKHT IDVVVDRLAV KENVAHRLVD SIETALRLSG GLVTLEFVDL
PESDPHRERM FSEHLACPDD DLDFEELEPR SFSFNSPYGA CPECTGLGTR LVADPDLIVP
DPSKSIADGA IAPWATGHVS DYFVRLLEAL ADAAGFSTQT RWDRLPAKAR KYILYGYPEP
LYVQYRNRYG RHRSYHTTYE GVVPYIERRH GEVDSDYSRE RFESYMREVP CPKCQGKRLK
PLALAVTVGG KSIADVSAMP ISECARFLRA LDLTARERQI AERVLKEVNT RLGFLLDVGL
DYLTLDRPAA TLAGGEAQRI RLATQIGSGL VGVLYVLDEP SIGLHQRDNH RLIETLLRLR
DLGNTLIVVE HDEDTIRAAD WVVDIGPGAG EHGGQIVVSG PVPELLACKE SLTGAYLSGR
RTIEVPAIRR PADPKRRLIV KGAREHNLRN IDVAFPLGCF VVVTGVSGSG KSTLVNDILY
AALAKELNGA KTVPGRHDRI LGLDLVDKAI HVDQSPIGRT PRSNPATYTG VFDDIRRLFA
ETTEAKVRGY TPGRFSFNVK GGRCEACAGD GTIKIEMNFL PDVYVPCEVC HGARYNRETL
EVHYKGKTIA DVLDMPIEEA ADFFGPLPGI ARRLTTLVEV GLGYVRLGQP APTLSGGEAQ
RVKLASELQK RATGRTVYIL DEPTTGLHFE DVRKLLAVLQ SLVERGNTVI VIEHNLDVIK
TADWVIDLGP EGGAAGGQVV AEGPPEQVAA TEGSYTGAFL RKVLRLS