Gene Athe_1499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1499 
Symbol 
ID7408158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1581418 
End bp1583187 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content35% 
IMG OID643715862 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_002573370 
Protein GI222529488 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAGC TTTACAAACT TCCTGAACAG TTAACTCACA TCTTGGCAGC GGGTGAGGTT 
GTAGAAAGAC CGGCATCCTG CCTTAAAGAA CTTTTGGAAA ATTCAATAGA TGCAGGAGCA
AATTTAATTG ATGTTAAAAT AGAAAAAGGT GGTATGAAGA GAATTGAGGT ATATGATAAT
GGGAAGGGAA TTCACCCTGA TGACATTGAA TATGTGTTTG AAAGACATAC AACCAGCAAG
ATAAAATCTT TTGAGGATAT ATTTAGCATC AAAACAATGG GATTTAGAGG GGAAGCGCTC
TGTGCAATAT CAAGCGTATC AAAGGTGACA CTTGTTTCTA AGCATTTAGA AGAAGAACAA
GGGTGCATGG TAAAAGTAGA AGGTGGGAAA GTCCTTTCTA AAAGTTTTTG TCCTTTTAAA
GAGGGGACAA GAATTGTTGT TGAAGATATT TTTTACAATA CTCCTGCAAG GTTAAAATTT
TTAAAATCTC CAACAACTGA ACAAAAGTAT TGTCTTGAGG TGGTAGAAAA GATTGCAATT
GCATGGCCGG AGATTTCATT TCGGGCAGAG GCAGATGGCA AAAGACAAAT TTTTACACCA
GGAGATAATA AGATTGAATC TGCCATTGGT TCTATATTTG GGATAGAAAT AGTAAAAAAT
CTTGTTGAAT TTTCTCTTGA GAAAGAATCT TTAAAAGTTT GGGGTTATTT TGTAAACCCC
ACTGTGAGCA GAGCTACACG CTCAGGTTAT CATTTTTATG TCAACCGAAG ATATATCAAA
AGCAAACTTC TTTCATCGTG CATTGATGAG GCATTTAAGA ATTCGGTCAT CACAGGTAGA
TTTCCAATAG TTTTTCTTTT TATACAAATT CCGCCTTCTG AGATTGATGT CAATGTGCAT
CCATCAAAAC TCGAAATAAA GTTCAGAGAT GAAAGATTTG TTTACAATAC CATTTATAAA
GCTATAACAG ATTCGTTGAA ATCGGAAAAA ATGATTCCTA AAGCTGATTT AAGTAAAGCT
AATGTTGGAA ATGATGCTGT TGCTGAGCGA AAACAAACTG GAGTTTTATC TGATAACTTA
AAAAATGATA TATCTTTAGT TATCTCAGAG CAGCCAAATT TCTTTGGAAT GTTTTCAAGG
AGTGAAGAGA TTGTAATTGA GCAACAGGGC TTTGAAAACT TTGATGCAGG AAACTACAAG
ATTGTTGGTT ACGCTTTTGA TACCTATATA ATTGTGCAAG GCGATGACAG CTTATACCTT
ATTGACCAGC ACGCGGTGCA CGAAAGAAGA TTATTTGAAG ATTTTAAAAG CCAAATTTAT
TCTTCAAATG TTCAAAGCCA AGTGTTGGTT TCTCCTGTTA TTGTTCAGAT TCCATCTTCA
CGAAAAGAGT TTGTGATTTC AAACCGAGCT ATCTTTCAAA AGATGGGTTT TGAAATTGAG
GATTTTGGGA AAAATGAAAT ATTAGTGAGG ACATGGCCTG CTATACTGAC TGAGAACATC
GAAAAAATGT TTTTAATTGA CATAATAGAG ATGATATACG AACAAATGGT TGAAGATAAG
AGTCTTGTAG GAATTTCTGA GGACCTGCTA AAAAGAATTG CTTGCAGAGC AGCAGTAAAA
GGAAATAGTA AAATTTCAGA CTTAGAAAAA AAAGAAATAG TTGAACTTGT GCTAATCAAA
AAAGAAATTT TTCACTGTCC GCATGGAAGA CCAGTGGTAG TAGAGATTTC TAAGAGAGAA
ATTGAAAAAA TGTTCAAAAG AATTGTATAA
 
Protein sequence
MRELYKLPEQ LTHILAAGEV VERPASCLKE LLENSIDAGA NLIDVKIEKG GMKRIEVYDN 
GKGIHPDDIE YVFERHTTSK IKSFEDIFSI KTMGFRGEAL CAISSVSKVT LVSKHLEEEQ
GCMVKVEGGK VLSKSFCPFK EGTRIVVEDI FYNTPARLKF LKSPTTEQKY CLEVVEKIAI
AWPEISFRAE ADGKRQIFTP GDNKIESAIG SIFGIEIVKN LVEFSLEKES LKVWGYFVNP
TVSRATRSGY HFYVNRRYIK SKLLSSCIDE AFKNSVITGR FPIVFLFIQI PPSEIDVNVH
PSKLEIKFRD ERFVYNTIYK AITDSLKSEK MIPKADLSKA NVGNDAVAER KQTGVLSDNL
KNDISLVISE QPNFFGMFSR SEEIVIEQQG FENFDAGNYK IVGYAFDTYI IVQGDDSLYL
IDQHAVHERR LFEDFKSQIY SSNVQSQVLV SPVIVQIPSS RKEFVISNRA IFQKMGFEIE
DFGKNEILVR TWPAILTENI EKMFLIDIIE MIYEQMVEDK SLVGISEDLL KRIACRAAVK
GNSKISDLEK KEIVELVLIK KEIFHCPHGR PVVVEISKRE IEKMFKRIV