Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0734 |
Symbol | |
ID | 4486482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 805917 |
End bp | 807467 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639729504 |
Product | DNA mismatch repair protein MutS domain-containing protein |
Protein accession | YP_872493 |
Protein GI | 117927942 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.013522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTGC TCAAGGAGCG GGCCGCAAAA CCGAGTGTTC TCTTCGTCGA CCCGACGGCG TTGGCGGCGG TCGACGAGCA GCCCGAGTTC TTCAAGGACC TCCATCTCGA CGAGATCCTG GACGTGATTC TCGCTGGATA CGAAGAATAT GGGCTGCGGC CGTTGTTCTA CCAACCGTTA GGTGACGTCG ACGCCGTCCG GTATCGGCAC GAGGTCTTCC GGGATCTTCA GGATGCGGGC ATCCGCGAAG CGGTTGACGC GTTCACCGCT GCTCTGCGCG ACATGCGCAA CGCCTTCGCG ATGAGCGAAA AACTCTCTTA CCCGCAGCAG AAACAACGGT GGTTCCTCGA CGCTGCTTTG ATCTACTGTA ACGCCGTTGC CTCATTGGCG CGTCGATTCT CCCGGATGCC TCTCGGCTCA CGAGGACTTC AGGCACTTGC CGATTACCTT CAGGGTTACG TCGCATCGCC GGCGTTTACC CGTCTTGCCC GCGACGCCCG GTCCGTGAGC GATGCGCTGG CCGGCGTTCG GTATGCGGTG CATATCAAGG GGAGCCAGGT GCGCGTGCAG CGCTATGCCG GGGAGCCGGA GTACAGCGCC GAGGTGACCG AGACGTTCGC CCGATTCCAG CAGCGGACCG GCGCCGATTA TCGGGTCACC TTCAGCGAAT CCGCATACAT GGATCACGTC GAGGCGCGCA TCGCGGATCT CGTGGCTGCG CTGTACCCGG CTGAGTTCAC CGCCCTTGTG ACGTTCACCG AGGTGCACGC GGATTTCCTG GATCCGGTGG TGGCGGCCTT TGACCGCGAC ATTCACTTTT ACCTGGCGTA TCTGCGGCAT GTGGAGCGGC TAACGCGCGG GGGACTGCCG TTCTGCTATC CAGAACTGTC GACGACGTCG AAGGAAACCG TGGTCCGCGG CGGTTACGAT CTCGCTCTTG CCATGACGCG TGACGGAGAG CCCGGCGCCA TCGTGGGCAA CGATTTCTCG CTTGACGGCG GGGAGCGCGT CATCGTGGTG ACCGGACCCA ACAGCGGAGG GAAGACGACG TTCGCCCGCA TGTTCGGACA GGTGCACTAC CTTGCGAGTC TCGGTCTGCC GGTGCCGGCG CGCGCGGCTC GGCTGTTCCT TCCCGATCGG GTCTTTACGC ACTTCGAACG GGAGGAGCAA CTTGCGACTC TGCGCGGCAA GCTGGACGAC GAACTTGTCC GGCTGCGGGA CATCCTGGCG CATGCCACCG CCAGGAGCGT CATCGTGATG AATGAGAGCT TCTCTTCGAC GGCGTTGCGG GATGCGCGTT ACCTCGGCGC CGAAATTCTG CGCCGTGTCC TCGGGCTGGA CGCCATCGGC GTTTACGTGA CCTTCGTCGA CGAGCTTGCC TCGATCGACG GGAGGATCGT GAGCATGGTC GGGATCGTCG ATCCCCGCGA CCCGGCCCGG CGAACGTACC GGTTCGAGCG CCGTCCCGCA GACGGCCGGG CGTACGCCCT CGCTGTTGCG GCGAAATACG GTCTGACATA CGAGCAGCTT CGCCGGCGGG TGGCGTCGTG A
|
Protein sequence | MTVLKERAAK PSVLFVDPTA LAAVDEQPEF FKDLHLDEIL DVILAGYEEY GLRPLFYQPL GDVDAVRYRH EVFRDLQDAG IREAVDAFTA ALRDMRNAFA MSEKLSYPQQ KQRWFLDAAL IYCNAVASLA RRFSRMPLGS RGLQALADYL QGYVASPAFT RLARDARSVS DALAGVRYAV HIKGSQVRVQ RYAGEPEYSA EVTETFARFQ QRTGADYRVT FSESAYMDHV EARIADLVAA LYPAEFTALV TFTEVHADFL DPVVAAFDRD IHFYLAYLRH VERLTRGGLP FCYPELSTTS KETVVRGGYD LALAMTRDGE PGAIVGNDFS LDGGERVIVV TGPNSGGKTT FARMFGQVHY LASLGLPVPA RAARLFLPDR VFTHFEREEQ LATLRGKLDD ELVRLRDILA HATARSVIVM NESFSSTALR DARYLGAEIL RRVLGLDAIG VYVTFVDELA SIDGRIVSMV GIVDPRDPAR RTYRFERRPA DGRAYALAVA AKYGLTYEQL RRRVAS
|
| |