Gene Acel_0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0968 
Symbol 
ID4485383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1063532 
End bp1065232 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content69% 
IMG OID639729743 
Producthypothetical protein 
Protein accessionYP_872727 
Protein GI117928176 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.277294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGAGCG GTCCTGGGCC GGGGTTCCGG CAAACCACGA TCGACGATCT CGACACACCG 
CTGCGGGACG TCACGTTCGT CGTCGTCGAC CTGGAGACGA CAGGTGGTGC GCCCGGAACC
GGAGCCATCA CGGAGCTCGC CGCCGTGAAG GTCCGCGGCG GTGAGGTGCT CGGTGAGTTA
CAGACGCTGG TCAACCCGGG CCGGCCGATT CCGCCGTTCA TCACCGTGCT CACCGGCATC
ACCGACGCGA TGGTCGCCCC CGCACCACCG ATTGACGCCG TCCTGCCGAC ATTCCTGGAA
TTCGCGACCG GCAGCGTTCT GGTTGCCCAT AACGCGCCGT TTGACGTCGG TTTCCTCCGC
GCCGCCTGTG CCGCCGCCGG TTATGACTGG CCGGAGTTCG AGGTCCTCGA CACGGCACGA
CTGGCCCGCA GAGTCCTCAC CCGGGAGGAA GCGCCGAATT GCACCCTCGC CACGCTGGCG
CGGGTGTTCA GGGCGAAAAC CACGCCGAAT CACCGGGCGC TCGCCGACGT ATACGCGACC
ATCGACGTCC TGCACGGCCT GCTCGAACGA TTGGGCTCCT TCGGCGTCGG GTCCCTGGCC
GACGTGCGAA CCTTCAGCGC CAAGGTCACC GCGACGCAAC GCCAGAAACG CCACCTCGCC
GAGCGGCTGC CGCACGCGCC CGGCGTCTAC CTGTTCCGGG ACCGTCAGGG CCGGGTGCTG
TATGTCGGAA AAGCCAAGGA CCTTCGACAG CGGGTGCGCA GCTATTTCAC GGCAGCGGAG
ACCCGCGGGC GGATCGTCGA CATGCTCCGC GTCGCCGAGG ACGTCACACC CATCGTCTGC
GCCACCGAGC TCGAAGCAGA AATTCGCGAA GTGCGGCTCA TCGCCGAACA CAAGCCTGCG
TACAACCGGC GATCGAAATT CCCGGAGCGG ACCTGGTGGG TGAAGCTCAC CGATGAACGC
TTCCCACGGC TTGCCGTCGT CCATACGGTT CGCAACGACG CAGCCACCTA CCTCGGACCG
TTCAATGTCC GGGAGAGCGC GCTCCTGGCG GTGGAGGCGC TGCAGGACGC CTTCCCCATC
CGCCGATGCC CCGACCGGCT CGGACCCCGA ACCCGCCGAC CGCCCTGTGC ATGGTTCGAA
CTCGGCCGGT GCGGCGCGCC GTGCACCGGT GCGCAATCGC CGGACGCGTA CCGGTCGGTG
GTCGCCGCGG TCCACCGGGC GATGACGAGC GACCCCTCAG AGGTTGTCGC CGCCGCGCTG
CGCCGCATCA CCCCGCTGGC CGCGGCGCAA CGATACGAAG AAGCTGTCCC GATCCGCGAT
CGGCTCGTCG CCTATCTGCA TGCTGTCGGG CGCGCACAGC GGCTCGCCGC ACTCGCCGGT
TGCCGACAGA TCGTCGCCGC CCGACCTGGT CCGGACGGCG CATGGGACGT CGCCGTCATT
CGCCACGGCC GCCTCGCCGG GGCCGGTCTG ATACCAGCCG GCGAGCCGGA GGTTGACCGC
CAGCTCACCG CGATCGTCGC GACCGCGAGC ACCTCCCTGC CACGGGGTGT CGGGATCACG
GCCTATGCCG ACGCGGAAGA GATGGAGCTG CTCCTGCGCT GGCTCGACCA GCCCGGCGTC
CGCCTGCTCG ACGTCGAGGG AACATGGGCG ATCCCCATCA ACGGCGGCCT TGCGTCGAAC
CACGTCCGGA TAGCCGCCTA G
 
Protein sequence
MTSGPGPGFR QTTIDDLDTP LRDVTFVVVD LETTGGAPGT GAITELAAVK VRGGEVLGEL 
QTLVNPGRPI PPFITVLTGI TDAMVAPAPP IDAVLPTFLE FATGSVLVAH NAPFDVGFLR
AACAAAGYDW PEFEVLDTAR LARRVLTREE APNCTLATLA RVFRAKTTPN HRALADVYAT
IDVLHGLLER LGSFGVGSLA DVRTFSAKVT ATQRQKRHLA ERLPHAPGVY LFRDRQGRVL
YVGKAKDLRQ RVRSYFTAAE TRGRIVDMLR VAEDVTPIVC ATELEAEIRE VRLIAEHKPA
YNRRSKFPER TWWVKLTDER FPRLAVVHTV RNDAATYLGP FNVRESALLA VEALQDAFPI
RRCPDRLGPR TRRPPCAWFE LGRCGAPCTG AQSPDAYRSV VAAVHRAMTS DPSEVVAAAL
RRITPLAAAQ RYEEAVPIRD RLVAYLHAVG RAQRLAALAG CRQIVAARPG PDGAWDVAVI
RHGRLAGAGL IPAGEPEVDR QLTAIVATAS TSLPRGVGIT AYADAEEMEL LLRWLDQPGV
RLLDVEGTWA IPINGGLASN HVRIAA