Gene Acel_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0735 
Symbol 
ID4486483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp807464 
End bp808954 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID639729505 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_872494 
Protein GI117927943 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.19078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTCT TTCTCATGCA CCCGGAGCAG GATTTCGATC CGCAGGCTCA GTTGCCGGAG 
CATGCCGACG ACCTGATCCG TGACCTGGAA CTGGAGACCC TCCTGGCGGC TATGGCGGGC
GGTGACGATT ATCTGCTGAC TGTCGCCCGC GCGGGTGTCC TGCATCCTCT CGCCGATCCC
GCCGTGATCC GCTATCGTCA GGAAATTCTC GCCGACTGCC TGCGCCATCC CGACGCCCTC
CGGGCCTTGT ATCACCTTGC CGCCGAGGCC GTCGCCGCGG AAAAGAAGAT CTGGCGGACC
TTCTGGAAGT CGCCGGACAT CATCCTGCAT CGAGCCCTCG AGGTGATGGG TGTTTTCCTC
GGCTATATTC GGCGGTTGCG CACACTCACC GATACGTACG CCGAGATGTT CTCCGCCCCG
GGTCTTCAGC GGTTCATGCG TATGGTCGGC GAGGAACTCT CCGACGAGTA CCTCCAGCGG
GTCGAGAATC ATCTGGCGGA GCTCTCTTTC CGGCGCGGCG TCCTGCTCAG CGCCCGTCTC
GGCCGGGGAA ATAAAGGGGA GGACTACACG CTGCATCGGT ACACGGAGCG CAGCCTTCTG
CAGAGGATTG CTTCGGGCGA GCGCCGCGCT GCGGGTTACA CCTTCCGCAT TCCGGACCGC
GACGAAGCGG GTCATCGGGC GCTGTCGGAA TTGCGCGGCC GGGGACTGAA CGTGGTTGCG
GACGCGCTTG CCCGGTCCGC CGACCATGTG CTCTCGTTCT TTGCGGCTCT CCGCGCCGAG
CTGGGTTTTT ACCTCGGCTG CGTGCAATTG GCGGAACGGC TTGCGTCGCT CGGCTGCCGG
TGGTCGTTTC CCGATGTGCA CCAGCCGTCG GCCCGCCGTT TCCACGCGCG ACATCTGTAC
GACGTCTGCC TCGCCTTGAC GATCGGCGGG CCGGTGGTCG GCAATGACGT CGACGCGGAC
GGGAAACTTC TCGTTCTGGT GACGGGACCC AATCAGGGCG GGAAGTCGAC GTTCCTGCGC
AGTGTCGGAC TGGCCCAGTT GCTCATGCAG GCTGGGATGT TCGTTCCCGC CGACGCGCTG
GACGCGAGCG TCGCCGGGGG AATTTTCACG CATTTCAAAC GCGAGGAGGA TCCGGCTCTG
CGTGGCGGAA AGCTCGAGGA GGAGCTCGCC AGGATGAGTG CGATCGCCGA CCGCGTGCAC
ACCGGCTCGG TCGTCTTGTG CAATGAGTCG TTCAGTGCCA CCAATGAACG CGAAGGGTCG
GAGATCGCCC AACAGGTCAT CGATGCATTC ATGGACTGCG GCGTCCGCGT CTTCTTCGTG
ACGCATTTGT ACGACCTGGC GCGCCGGTAC AGCGAGCGGC AGGACCGGCG GGTGCTGTTC
TTGCGGGCGG AGCGACTGCC CGACGGACGG CGCACCTTCC GGATGATCGT CGGTGCACCG
GAGCCGACGA GTCACGCGGC GGACTCCTAC CGCCGGATTT TCGGCCTCTG A
 
Protein sequence
MKVFLMHPEQ DFDPQAQLPE HADDLIRDLE LETLLAAMAG GDDYLLTVAR AGVLHPLADP 
AVIRYRQEIL ADCLRHPDAL RALYHLAAEA VAAEKKIWRT FWKSPDIILH RALEVMGVFL
GYIRRLRTLT DTYAEMFSAP GLQRFMRMVG EELSDEYLQR VENHLAELSF RRGVLLSARL
GRGNKGEDYT LHRYTERSLL QRIASGERRA AGYTFRIPDR DEAGHRALSE LRGRGLNVVA
DALARSADHV LSFFAALRAE LGFYLGCVQL AERLASLGCR WSFPDVHQPS ARRFHARHLY
DVCLALTIGG PVVGNDVDAD GKLLVLVTGP NQGGKSTFLR SVGLAQLLMQ AGMFVPADAL
DASVAGGIFT HFKREEDPAL RGGKLEEELA RMSAIADRVH TGSVVLCNES FSATNEREGS
EIAQQVIDAF MDCGVRVFFV THLYDLARRY SERQDRRVLF LRAERLPDGR RTFRMIVGAP
EPTSHAADSY RRIFGL