Gene Acel_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0734 
Symbol 
ID4486482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp805917 
End bp807467 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content64% 
IMG OID639729504 
ProductDNA mismatch repair protein MutS domain-containing protein 
Protein accessionYP_872493 
Protein GI117927942 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.013522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGC TCAAGGAGCG GGCCGCAAAA CCGAGTGTTC TCTTCGTCGA CCCGACGGCG 
TTGGCGGCGG TCGACGAGCA GCCCGAGTTC TTCAAGGACC TCCATCTCGA CGAGATCCTG
GACGTGATTC TCGCTGGATA CGAAGAATAT GGGCTGCGGC CGTTGTTCTA CCAACCGTTA
GGTGACGTCG ACGCCGTCCG GTATCGGCAC GAGGTCTTCC GGGATCTTCA GGATGCGGGC
ATCCGCGAAG CGGTTGACGC GTTCACCGCT GCTCTGCGCG ACATGCGCAA CGCCTTCGCG
ATGAGCGAAA AACTCTCTTA CCCGCAGCAG AAACAACGGT GGTTCCTCGA CGCTGCTTTG
ATCTACTGTA ACGCCGTTGC CTCATTGGCG CGTCGATTCT CCCGGATGCC TCTCGGCTCA
CGAGGACTTC AGGCACTTGC CGATTACCTT CAGGGTTACG TCGCATCGCC GGCGTTTACC
CGTCTTGCCC GCGACGCCCG GTCCGTGAGC GATGCGCTGG CCGGCGTTCG GTATGCGGTG
CATATCAAGG GGAGCCAGGT GCGCGTGCAG CGCTATGCCG GGGAGCCGGA GTACAGCGCC
GAGGTGACCG AGACGTTCGC CCGATTCCAG CAGCGGACCG GCGCCGATTA TCGGGTCACC
TTCAGCGAAT CCGCATACAT GGATCACGTC GAGGCGCGCA TCGCGGATCT CGTGGCTGCG
CTGTACCCGG CTGAGTTCAC CGCCCTTGTG ACGTTCACCG AGGTGCACGC GGATTTCCTG
GATCCGGTGG TGGCGGCCTT TGACCGCGAC ATTCACTTTT ACCTGGCGTA TCTGCGGCAT
GTGGAGCGGC TAACGCGCGG GGGACTGCCG TTCTGCTATC CAGAACTGTC GACGACGTCG
AAGGAAACCG TGGTCCGCGG CGGTTACGAT CTCGCTCTTG CCATGACGCG TGACGGAGAG
CCCGGCGCCA TCGTGGGCAA CGATTTCTCG CTTGACGGCG GGGAGCGCGT CATCGTGGTG
ACCGGACCCA ACAGCGGAGG GAAGACGACG TTCGCCCGCA TGTTCGGACA GGTGCACTAC
CTTGCGAGTC TCGGTCTGCC GGTGCCGGCG CGCGCGGCTC GGCTGTTCCT TCCCGATCGG
GTCTTTACGC ACTTCGAACG GGAGGAGCAA CTTGCGACTC TGCGCGGCAA GCTGGACGAC
GAACTTGTCC GGCTGCGGGA CATCCTGGCG CATGCCACCG CCAGGAGCGT CATCGTGATG
AATGAGAGCT TCTCTTCGAC GGCGTTGCGG GATGCGCGTT ACCTCGGCGC CGAAATTCTG
CGCCGTGTCC TCGGGCTGGA CGCCATCGGC GTTTACGTGA CCTTCGTCGA CGAGCTTGCC
TCGATCGACG GGAGGATCGT GAGCATGGTC GGGATCGTCG ATCCCCGCGA CCCGGCCCGG
CGAACGTACC GGTTCGAGCG CCGTCCCGCA GACGGCCGGG CGTACGCCCT CGCTGTTGCG
GCGAAATACG GTCTGACATA CGAGCAGCTT CGCCGGCGGG TGGCGTCGTG A
 
Protein sequence
MTVLKERAAK PSVLFVDPTA LAAVDEQPEF FKDLHLDEIL DVILAGYEEY GLRPLFYQPL 
GDVDAVRYRH EVFRDLQDAG IREAVDAFTA ALRDMRNAFA MSEKLSYPQQ KQRWFLDAAL
IYCNAVASLA RRFSRMPLGS RGLQALADYL QGYVASPAFT RLARDARSVS DALAGVRYAV
HIKGSQVRVQ RYAGEPEYSA EVTETFARFQ QRTGADYRVT FSESAYMDHV EARIADLVAA
LYPAEFTALV TFTEVHADFL DPVVAAFDRD IHFYLAYLRH VERLTRGGLP FCYPELSTTS
KETVVRGGYD LALAMTRDGE PGAIVGNDFS LDGGERVIVV TGPNSGGKTT FARMFGQVHY
LASLGLPVPA RAARLFLPDR VFTHFEREEQ LATLRGKLDD ELVRLRDILA HATARSVIVM
NESFSSTALR DARYLGAEIL RRVLGLDAIG VYVTFVDELA SIDGRIVSMV GIVDPRDPAR
RTYRFERRPA DGRAYALAVA AKYGLTYEQL RRRVAS