Gene Athe_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1500 
Symbol 
ID7408159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1583184 
End bp1585775 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content35% 
IMG OID643715863 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002573371 
Protein GI222529489 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAGT TAACTCCTAT GATGCAGCAA TATATGGAAA TAAAACAGAA GGTGAAAGAT 
TGTATTTTAT TTTTTAGGCT TGGCGATTTT TATGAGATGT TTTTTGAAGA TGCGATTGTG
GCGTCTAAAG AGCTTGAGAT AGCTTTAACC AGCAGAGATT GTGGAAACAA TGAAAAAGCT
CCTATGTGCG GTGTGCCGTA CCATTCTGCG ACCAGTTACA TTGCAAAGCT TATCGAAAAA
GGTTACAAGG TTGCTATCTG CGAACAGGTG GAAGACCCAA AGCTTGCAAA GGGAATTGTT
AAAAGGGAAA TTACAAGAAT AATAACTCCA GGCACATTTA TTGACGATAA TATTTCGACA
GCCAATAATT TCATATGTTG TATATCAAAA GACAGATCTG AATTTGCCCT GACATTTGTA
GATGTTTCAA CTGGTGAGAT GTACTCTTGC CTTCTTGAAG AGGACCTTCA GAAACTGGTA
AATGAAATTG GCAAATACAG TCCCAGTGAA ATTTTAATTT CAAATATAGA AGATGAGCTT
TATGAATTTT TGAAGAAAAA CTGCACTTCT TTTGTGCAGA TGATAGAGTT TGTGGATTTA
CAAAAGTGCC ATGAGATTAT TGAAAATCAG ATAAATGTTG GCAAAATAGA TGAGAAGCTG
ATTTTGAGCG TAGGAAATTT GCTAAAATAC TTAACAGAGA CCCAAAAAAT TTCCTTTGAT
TATATAAGGA GGTTTGAATT TTACAGAGTC CAAAACTATC TTCAAATTGA CATAAACACA
AAACGAAATT TGGAGCTCAC AGAGAGTATT ATTCAGCGCT CCAGAAAGAA TAGCCTTCTT
GGTATTTTGG ATCAAACGAA GACTTCAATG GGTTCAAGGC TATTGAAAAA ATGGATTGAA
AGACCTCTTA TTGACATTAT TGAGATTAAT AAAAGGCTTG ATAGTGTTGA GGAGCTCAAA
TCAAACTATT CCACTTTAGT GCAGGTAGAA GAACTTTTGA GCAGAATGTA TGATATAGAA
AGACTTTCCT CAAAGTTTGC ATATAAGAAT GTGAATGCTA AAGATTTGCT GAGTCTAAAA
AAGTCTATTG AAGTGTTGCC AACTTTAAAA CAATTTCTTT CTTCATTTGA TTCAGAATTA
TTAAAAGAGA TTTATGAAGG TCTTGATACA TTAGAAGATA TATATGCGCT TATTGACAGT
TCTATAAATG AGGATGCACC TGTGTCCCTA AAAGAGGGTG GAATAATTAA AGAGGGTTTT
AACGAAGAAG TAGATAGGCT GAGAAATATA TCTAAAAATA GCAAAGAGCT TTTAGTTGAG
TATGAAGAAA AAGAGAGGAA CCTCACAGGT ATAAAAAATC TCAGAATTGG TTATAACAAG
GTTTTTGGAT ACTATATTGA GGTGACCAAG TCAAACTACT CTCTTGTTCC TGACAGATAC
ATTCGGAAAC AAACTCTTGC AAATGCAGAG AGGTATATAA CAGAGGAACT CAAAAAATTG
GAAGATGAAA TATTGGGTGC TGATCAAAAA CTTATCGAAC TTGAATACCA GCTTTTTTGC
GAAATAAGGG ATAGGATTGA GGCTCAGATT GAAAGAATTC AAAAGACAGC AAGCAATATT
GCCAACTTGG ATGTTCTGTG TTCATTTGCC CGTATTGCAA TTGACAATGA GTATGTCAGG
CCAAATGTTT ACTTAGGGGA TAGAATATAT ATTAAGAACG GTAGACACCC AGTTGTTGAA
AAGATGATAG GCAGGGGCAA TTTCATTCCA AATGACACTG AACTTGATCA GGCAGAAAAC
AGAGTTTTGA TTATAACTGG TCCAAATATG GCTGGTAAGT CTACATACAT GAGACAGGTA
GCCTTAATTG TCATAATGGC ACAGATGGGA TGTTTTGTAC CTGCTGATGA GGCACACATT
GGTGTAGTTG ATAAAATCTT TTCACGGATA GGGGCATCTG ATGATATTTC ATCTGGGCAG
AGTACCTTCA TGGTGGAGAT GTCAGAAGTT GCGAACATAT TGAAAAATGC AACGCCAAAA
AGCCTTATAA TTTTTGACGA GGTTGGAAGA GGAACAAGCA CATATGATGG ACTTTCCATA
GCATGGGCAG TTTTAGAGTA TGTTGCTGAT AAATCTAAAA TTGGTGCAAA AACCCTTTTT
GCAACTCATT ACCATGAGTT AACAGAGCTT GAAGAGAGGA TTCCAGGTGT AAAAAACTAT
AGGGTTGATG TCAAGGAGGA GGGCAAAAAC ATTATATTTT TAAGGAAAAT TGTTAGAGGT
GGATGTGACT CAAGTTATGG GATTCATGTT GCACGGCTTG CTGGAATTCC AGAAGAGGTA
TTAAAAAGGG CTGAGGAAAT TCTAAAACAG CTTGAAGAAG CTGATATAAA TAGGAAAAAT
ATCAGAAAAC TCAGAAGAGA AATCAAAAAG GAGTTTACTG AGCAGATAGA TTTTTTTTCC
TATAAAAAAG AAGAGATAAT AGACAAAATT GAGAAACTTG ATATTTTAAA TATAACTCCT
ATCCAGGCTT TAAACATTTT AAGTGAGCTC AAACATGAAA TAATTAAAGC CAAAGAGAGG
CAATTGATAT GA
 
Protein sequence
MQELTPMMQQ YMEIKQKVKD CILFFRLGDF YEMFFEDAIV ASKELEIALT SRDCGNNEKA 
PMCGVPYHSA TSYIAKLIEK GYKVAICEQV EDPKLAKGIV KREITRIITP GTFIDDNIST
ANNFICCISK DRSEFALTFV DVSTGEMYSC LLEEDLQKLV NEIGKYSPSE ILISNIEDEL
YEFLKKNCTS FVQMIEFVDL QKCHEIIENQ INVGKIDEKL ILSVGNLLKY LTETQKISFD
YIRRFEFYRV QNYLQIDINT KRNLELTESI IQRSRKNSLL GILDQTKTSM GSRLLKKWIE
RPLIDIIEIN KRLDSVEELK SNYSTLVQVE ELLSRMYDIE RLSSKFAYKN VNAKDLLSLK
KSIEVLPTLK QFLSSFDSEL LKEIYEGLDT LEDIYALIDS SINEDAPVSL KEGGIIKEGF
NEEVDRLRNI SKNSKELLVE YEEKERNLTG IKNLRIGYNK VFGYYIEVTK SNYSLVPDRY
IRKQTLANAE RYITEELKKL EDEILGADQK LIELEYQLFC EIRDRIEAQI ERIQKTASNI
ANLDVLCSFA RIAIDNEYVR PNVYLGDRIY IKNGRHPVVE KMIGRGNFIP NDTELDQAEN
RVLIITGPNM AGKSTYMRQV ALIVIMAQMG CFVPADEAHI GVVDKIFSRI GASDDISSGQ
STFMVEMSEV ANILKNATPK SLIIFDEVGR GTSTYDGLSI AWAVLEYVAD KSKIGAKTLF
ATHYHELTEL EERIPGVKNY RVDVKEEGKN IIFLRKIVRG GCDSSYGIHV ARLAGIPEEV
LKRAEEILKQ LEEADINRKN IRKLRREIKK EFTEQIDFFS YKKEEIIDKI EKLDILNITP
IQALNILSEL KHEIIKAKER QLI