Gene Anae109_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2088 
SymbolmutL 
ID5374889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2366777 
End bp2368582 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content76% 
IMG OID640843601 
ProductDNA mismatch repair protein 
Protein accessionYP_001379275 
Protein GI153004950 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.02775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCGCA TCCAGGTCCT GCCCCCCGGG CTCGTGAACC AGATCGCCGC GGGCGAGGTG 
GTCGAGCGCC CCGCCTCCGT CGTGAAGGAG CTCGTCGAGA ACGCGCTCGA CGCGGGCGCC
ACGTCCGTGT CGATCGACGT GGAGGAGGGC GGGCTCGCGC TCGTGCGGGT CGCGGACGAC
GGCTGCGGGA TGAGCGCCGA CGACGCCCAG CTCGCCCTGG AGCGCCACGC GACGTCGAAG
CTGCGCGACG CCGAGGGGCT CGCGGCCATC GCCACAATGG GCTTCCGGGG GGAGGCGCTC
CCCGCGATCG CCTCGGTGGC GCGCTTCCGG CTCGACACGG CGCCGGCCGA GGACGGGGCG
GGCACGCGGG TGGAGGTCGA GGGCGGCGGG CGGCCGTCGT CCGGGCCCGT GGCGCGCCCG
CGCGGTACGA CCATCGAGGT CCGCGACCTG TTCTTCAACA CGCCGGCCAG GCGCAAGTTC
ATGCGCGCCG CGGCGACGGA GTCGGGGCAC GTGACGGAGG CGGTCGTCCG GCTCGCGCTC
GCGCGGCCCG ACGTGGGGTT CACGCTGCGG TCCGCGGGGC GCCTCGTCCT CGGTTCGCGG
GCCGGCGCCG CGGCCGCAGA CCGCGCGGCG CAGGCGCTCG GGCGCGACGC GCACCGGCAC
CTCGTCCCGG TCGACGCCGG GCGCGGGAAC GTCCGCGTGC GCGGCCTCGT CTGCTCTCCC
GATCATTCGG AGGCGACGGG GCGTGCCCTC TACCTGTTCG TGAACGGCCG CTACGTTCGG
GACCGCGGGG CCGCGCACGC GGTGCTCCGC GCGTTCGCCG GGACGCTCCC GCCCGGACGG
CACCCGGCGG GCGTGCTGTT CGTGGAGCTG CCGCTCGATC GGGTGGACGT GAACGTCCAC
CCGCAGAAGC TGGAGGTGCG CTTCGCCGAG GCCCGCGAGG TGTACGACGC GCTCTTCCAC
GCGATCGCGG GGACGCTCCG CACGGCGCCC TGGCTGGCGC ACGGCCGCGC CGGGGGGAGT
GCCCCGCCGG GTCCGGTCGC CCTGACGCCG CCGGGCGGCG CGGGGAGCGA CGAGACCGCC
GCAGTGCTCG CCTGGGCGCG CGAGGCCCAC GCCCCCGAGG GCAGCGGCGC GCTCGTTCCG
CCTCCGTCGC CCGTGCCCGG CGCGAGCGGC ACGTTCGCGT TCGCGATCCC GGACGAGGCG
GGGCTGGCGC GGCCCGCCGG GTACTTCGCC TCCCTGCGCT ACGTCGGGCA GCACGCGCGC
ACGTATCTGC TGTGCGAGGC GCAGGGAGGG ACGCTCGTCG TGATCGACCA GCACGCGAGC
CATGAGCGGC TGCTGTTCCA GCGCCTGCGC GAGGTTTTCC GTACGCGCAA GCTCCCCGTG
CAGCCGTTCC TCCTCCCGCA GGTCGTGACC CTGCCGCCGG CCGTGGCGCG CGCGCTGGAG
GGCGGCCTGC CGGAGCTCGC CCGCCTCGGG TTCGACGTGG AGCCGTTCGG CGGCGACAGC
TTCGCCGTGA AGGGCGCGCC CGCGGCGCTC GCCGGAGTGG ATCTCGAGGC GCTGCTGCTC
GACCTGTCAG CTCAGCTCGA GCTGGTGGGC AGCGGCACGG CGGTCGACGA GGCGCTGCAC
GATCTGCTCG CCACGATGGC CTGTCACGCG GCGGTGCGGG CGAACCAGGA GGTCGCGCCA
GAGGAGGCCC GCGCGCTGCT CGACGGCCTC GACGCGATCG ACTTCAAGGC GCGCTGCCCG
CACGGCCGCC CCGTCGTCTT CGAGCTGCCG CTGGCGGAGC TCGAGCGGCG GGTAGGGCGT
CGATGA
 
Protein sequence
MPRIQVLPPG LVNQIAAGEV VERPASVVKE LVENALDAGA TSVSIDVEEG GLALVRVADD 
GCGMSADDAQ LALERHATSK LRDAEGLAAI ATMGFRGEAL PAIASVARFR LDTAPAEDGA
GTRVEVEGGG RPSSGPVARP RGTTIEVRDL FFNTPARRKF MRAAATESGH VTEAVVRLAL
ARPDVGFTLR SAGRLVLGSR AGAAAADRAA QALGRDAHRH LVPVDAGRGN VRVRGLVCSP
DHSEATGRAL YLFVNGRYVR DRGAAHAVLR AFAGTLPPGR HPAGVLFVEL PLDRVDVNVH
PQKLEVRFAE AREVYDALFH AIAGTLRTAP WLAHGRAGGS APPGPVALTP PGGAGSDETA
AVLAWAREAH APEGSGALVP PPSPVPGASG TFAFAIPDEA GLARPAGYFA SLRYVGQHAR
TYLLCEAQGG TLVVIDQHAS HERLLFQRLR EVFRTRKLPV QPFLLPQVVT LPPAVARALE
GGLPELARLG FDVEPFGGDS FAVKGAPAAL AGVDLEALLL DLSAQLELVG SGTAVDEALH
DLLATMACHA AVRANQEVAP EEARALLDGL DAIDFKARCP HGRPVVFELP LAELERRVGR
R