Gene Acel_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1110 
SymboluvrC 
ID4485773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1231717 
End bp1233642 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content66% 
IMG OID639729885 
Productexcinuclease ABC subunit C 
Protein accessionYP_872868 
Protein GI117928317 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.537728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.126972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGCAC CGCAGCTGCG CAAGCCGGCT GCCGGGGAGA TTCCGGAGTC ACCCGGCGTC 
TACCGGTTCT GGGACGCGCA TGATCGGGTC ATCTACGTCG GAAAGGCGAA GAACCTGCGC
GCCCGACTGA CCAGTTACTT CGCGGATCCG GCGCTCCTTC ATCCGCGGAC CCGGGCCATG
GTGACAGCGG CCGCCCGCCT CGACTGGGTC ATCGTCCGCA CCGAGGTTGA GGCGCTGCAA
CTGGAATACA ACTGGATCAA GCAGTACGAG CCTCGCTTCA ACATCAAGTA TCGGGACGAC
AAGAGTTACC CTTACCTCGC CGTCACGCTC GCCGAACCCG TGCCGCGCCT CATGGTTTAT
CGCGGCAAGC GAAAGAAAGG CAACCGGTAT TTCGGGCCCT TTGCGCACGC CTGGGCCATT
CGTGACACTC TGGATTTGCT GCTCCGGGTG TTTCCCGCCC GCACCTGCTC CGCCGGCGTC
TACCGCCGAG CTCAGCGCAT TGGACGGCCG TGCCTGCTGG GATACATCGG AAAATGCAGC
GCACCGTGTG TCGGCTGGGT GTCGGAGGAG CAGCACCGGC AAATCGTCCT CGACTTCTGT
GACGTGATGG GCGGAAGAGC CACGGAGTAC CTCCGCCGGC TGGAAAAGGA CATGCGCGCC
GCAGCCGCCG CGGAGGACTT CGAACGGGCG GCGCGGCTGC GCGACGATGC CGCCGCGCTG
CGGCTGGCCA TCGAAAAACA GACGGTGGTC CTCCCCGAGA ATACCGATGC AGACGTTATC
GCCTTGGCCG ACGACGACTT GGAGGCGGCG GTCCATGTGT TCTTCGTCCG CGATGGACGG
GTCCGCGGTC AGCGCGGCTG GGTGGTGGAG AAGGTCGAAG CGCTGGACAC CGCCGACCTC
GTTCAGCATT TTCTCGCCCA ACTGTACGGC GAAACGGGTG CGGAATCCGC CGACGTGCCG
CGGGAAATCT TGGTGCCCGT CGCACCAAGC GACACCGAGA CGCTCGAACG CTGGCTCAGC
TCACGCCGCG GCGGACGCGT CACCATCCGC GTTCCCCAGC GCGGCGACAA GAAGGCCCTG
CTGGAGACGG TTGCCCAGAA TGCGGCCCAG GCGCTCCACC TGCACAAGGT CCGCCGGGCC
GGCGACCTCA CCGCCCGCGG CCGGGCGCTG CGGGAAATCC AGGAAGCCCT CAATCTTCCC
GACGCACCGC TGCGCATTGA GTGCTACGAC GTGTCCACAT TGCAGGGCAC CGACGTCGTC
GCGTCGATGG TCGTGTTTGA AGACGGCCTG CCGCGGAAGA GCGAATATCG TCGCTTTGCC
CTGCGCGGCG TCGGTGGGGG GGACGTCGGG GCAATTCACG AGGTGATCAG TCGCCGTTTC
CGCCGGTATC TGGACGAGCG GATGCAAACG GACTCGCCCA TCGACGACGG AACCGGGCCG
GACCAGCCGC GGGTCGACGC GGCCGCGCAT CACCGAAAGT TCAGCTACCC GCCGAGCCTG
GTCATTGTTG ACGGCGGTGC GCCGCAGGTG GCGGCGGCAA AGAAGGCCCT CGACGAACTG
GGCATCGATG ACGTTGCCCT GGCGGGACTC GCGAAACGCC TGGAAGAAAT CTGGCTCCCC
GATCGCGAGG AACCCGTCAT CCTGCCGCGG GCCAGCGAAG GACTCTACCT CTTGCAGCGG
TTGCGCGACG AGGCGCACCG TTTCGCCATC TCCTACCACC GTGCGAAACG GTCCACGTCG
ATGACCCGCA GTGTGCTGGA GGGAATCCCG GGAATCGGGG AGACCCGCCG CAAGGCCTTC
CTGCGGCATT TCGGATCGGT TCAGCGAATG CGGCAAGCAA CCGTAGCGGA ATTGGCCGCC
GTGCCGGGCG TGGGACGGCG TACCGCTGAG GTTGTTTTCG CGGCCCTGCA TGGAGCGGAC
CAATGA
 
Protein sequence
MHAPQLRKPA AGEIPESPGV YRFWDAHDRV IYVGKAKNLR ARLTSYFADP ALLHPRTRAM 
VTAAARLDWV IVRTEVEALQ LEYNWIKQYE PRFNIKYRDD KSYPYLAVTL AEPVPRLMVY
RGKRKKGNRY FGPFAHAWAI RDTLDLLLRV FPARTCSAGV YRRAQRIGRP CLLGYIGKCS
APCVGWVSEE QHRQIVLDFC DVMGGRATEY LRRLEKDMRA AAAAEDFERA ARLRDDAAAL
RLAIEKQTVV LPENTDADVI ALADDDLEAA VHVFFVRDGR VRGQRGWVVE KVEALDTADL
VQHFLAQLYG ETGAESADVP REILVPVAPS DTETLERWLS SRRGGRVTIR VPQRGDKKAL
LETVAQNAAQ ALHLHKVRRA GDLTARGRAL REIQEALNLP DAPLRIECYD VSTLQGTDVV
ASMVVFEDGL PRKSEYRRFA LRGVGGGDVG AIHEVISRRF RRYLDERMQT DSPIDDGTGP
DQPRVDAAAH HRKFSYPPSL VIVDGGAPQV AAAKKALDEL GIDDVALAGL AKRLEEIWLP
DREEPVILPR ASEGLYLLQR LRDEAHRFAI SYHRAKRSTS MTRSVLEGIP GIGETRRKAF
LRHFGSVQRM RQATVAELAA VPGVGRRTAE VVFAALHGAD Q