Gene Athe_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1642 
Symbol 
ID7409472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1739173 
End bp1741536 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content36% 
IMG OID643716011 
ProductMutS2 family protein 
Protein accessionYP_002573509 
Protein GI222529627 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000590377 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACCAAA AGACACTCAA AGCGTTGGAG TATGACAAGA TAGTTGAGAT TTTAAAAAAC 
ATGGCAAAGT CAACTCCAGC AAAGGAATAT TTTGAAAACC TTATTCCATC AACCAATGTT
GCAGATATAG AAAATGAACT TAATAAGGTC GATGAAAGCT ACAGGTACGT TCTAAAGTAT
GGAAATCTAC CAACTTTGGA GTTTGAAAAT ATACTGCCAA GCCTTAAGAA ATCAAAGCTT
GGGGCAACTT TAAATCCACA TGAGATTTTG CAAATTGGAA AAGTGTTGAA ACTTTCTTAT
GAGATGCGAA CTTATCTTTC TTTCACACAA GACTTTAGTT TTCTTGAAAG TATGAAAAAA
AGACTTGTAA ATTTAAAAGA AGTAATCTCA AGAATTGACC AGACATTTTT AACACCAGAT
GAAATCTTAG ATACAGCATC GTCAAAGCTC AAGGAGATAA GAGATAAAAT CAGAAAGCTT
GAAAATAAAA TAAGAGACGA GTTAAATAGC ATGATTCGCG ACCCAAAAAT CCAAAGGTTT
TTACAAGAAC CAATTATAAC AATCAGAGGT GAAAAGCTCT TACTTCCTGT CAAGGCAGAG
TTTAGAAATG AAGTAAAAGG AATTGTCCAT GACCAGTCAG CAACAGGTGC AACTTTATTT
GTTGAGCCTT TTGTTTGTGT TGAGATATCA AATCAAATCA AAATTTTAAA AAATCAAGAA
AAAGAGGAAA TAGAGAGAAT TTTGCAAGAG ATATCCTCTT TGATAGCAAG CTATTGTGAG
GTAATTGAAA CATCTTTTTA TGCACTTGTT GAGCTTGACA TAGTATTTAC GAAAGCTATT
TGGGCAAAAG AAATGAACGC AAGCAAACCA ATTATTAACG CAAGTGGCAT TATAAATCTC
AAAAAAGCTC GGCATCCATT AATACAAAAA GACAAGGTTG TTCCTATTGA TATTCATTTA
GGCAAAGACT TCGACGTTCT TATAATAACA GGTCCAAACA CAGGTGGAAA GACTGTAACA
CTAAAGACGG TTGGACTTTT TTGTCTTCTT TGCCAGAGCG GAATATTCAT TCCAGCAGAT
GAGGGTTCTG AGCTTTGCAT ATTTCAAAAG ATTTTTGCTG ATATTGGAGA TGATCAAAGT
ATAGTCCAGA GTCTTTCCAC ATTTTCAGCA CATATGAAAA ACATCATTGA AATAACAAAA
AATGCAGATG ATAAAACTCT TGTGCTATTA GATGAGATTG GAGCAGGTAC AGACCCTGAA
GAAGGTGCAG CTTTGGCGAA GGCAATCTTG AAATATCTTT CTGAGAAGGG CAGCAAGGTG
ATAGCTACAA CACACTATGG TGAGCTAAAA ATATTTGCTC AGCAAGAAGA TCGGTTTGAA
AACGCTTCTT GTGAGTTTGA TGTAAAAACC CTAAAGCCTA CGTACAGGCT TTTGATAGGA
ATTCCAGGAA GGAGCAATGC ACTTGTAATT TCATCCAATC TTGGGCTTGA CAAAGGTATT
GTTGAGATGG CAAGAGGGTA TCTGTCTCAA AAGACAATTG ATCTTGACAG AATAATAAAC
GAAATGGAAC AAAAGAGAAA AGAAGCTGAA GAAAACCTTG AACTTGCTCA AAAATTGAAG
CATGAAGCAC AAGCTTTAAA AGCGGCGTAT GAAGAGGAGA AGAAAAGGTT TGAGACTGAA
AGAGAGAGAA TTCGCAAAAA GGCCATAAAT GAGGCAAAAG AGATTGTTGA AAGCTCACAG
TATGAAATAG AAAATCTTTT TAAAGACCTT CGAAAACTTG CTGAAAACTT AAAGGAAAAA
GAAGTTTTAA AGGAGTTAGA AGAGAAAAAA AGAGAATATG AAAGGCTGAT TCAAAGTATT
TCACAGCAGG TAAAACAAGA AGCTGAGTCC AAAACCAAAA AAACAATACA GAATCTTCGC
TTAGGTCAAA AGGTATATGT CAGAAGCTTT GATGCTGAGG GGTTTGTCGA AAGCCTGCCA
GACTCAAAGG GAAATCTTAC TGTCCAGATA GGTATTATGA AGATAAATGT CAATCTTTCT
GATATTGAAG AGGTGGAAGG ACAAGATAGC AAAATATATC AAATAGCATC TAGAAATGTA
ATAATCAAAG AAAAGAACAT TGACATGTCC ATTGATGTAA GGGGCAAAAC AAGCGATGAT
GCTATTTTAG AGGTTGACAA GTACTTAGAT GACGCATACA CAGCGGGACT AAAACAGGTT
ACAATAATCC ATGGGAAAGG CACGGGGGTT TTGCGCCAGG CGATAAGGAA TTTTTTGAGA
CGGCATCCGC ATGTAAAATC ATTCAGAGAT GGAACATATG GTGAAGGTGA ACAGGGTGTT
ACGGTGGTTG AGCTGAAAGA CTAA
 
Protein sequence
MNQKTLKALE YDKIVEILKN MAKSTPAKEY FENLIPSTNV ADIENELNKV DESYRYVLKY 
GNLPTLEFEN ILPSLKKSKL GATLNPHEIL QIGKVLKLSY EMRTYLSFTQ DFSFLESMKK
RLVNLKEVIS RIDQTFLTPD EILDTASSKL KEIRDKIRKL ENKIRDELNS MIRDPKIQRF
LQEPIITIRG EKLLLPVKAE FRNEVKGIVH DQSATGATLF VEPFVCVEIS NQIKILKNQE
KEEIERILQE ISSLIASYCE VIETSFYALV ELDIVFTKAI WAKEMNASKP IINASGIINL
KKARHPLIQK DKVVPIDIHL GKDFDVLIIT GPNTGGKTVT LKTVGLFCLL CQSGIFIPAD
EGSELCIFQK IFADIGDDQS IVQSLSTFSA HMKNIIEITK NADDKTLVLL DEIGAGTDPE
EGAALAKAIL KYLSEKGSKV IATTHYGELK IFAQQEDRFE NASCEFDVKT LKPTYRLLIG
IPGRSNALVI SSNLGLDKGI VEMARGYLSQ KTIDLDRIIN EMEQKRKEAE ENLELAQKLK
HEAQALKAAY EEEKKRFETE RERIRKKAIN EAKEIVESSQ YEIENLFKDL RKLAENLKEK
EVLKELEEKK REYERLIQSI SQQVKQEAES KTKKTIQNLR LGQKVYVRSF DAEGFVESLP
DSKGNLTVQI GIMKINVNLS DIEEVEGQDS KIYQIASRNV IIKEKNIDMS IDVRGKTSDD
AILEVDKYLD DAYTAGLKQV TIIHGKGTGV LRQAIRNFLR RHPHVKSFRD GTYGEGEQGV
TVVELKD