Gene Anae109_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_2107 
Symbol 
ID5376312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp2389603 
End bp2392251 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content75% 
IMG OID640843620 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001379294 
Protein GI153004969 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0317076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGA TCGCCCAGCC CACGCCGATG ATGCGGCAGT ACCTCGAGAC GAAGGCGAGG 
TACCCCGACG CGCTGCTGTT CTTCCGGCTG GGCGACTTCT ACGAGCTCTT CTTCGAGGAC
GCGCTCACCG CGTCCGAGGC GCTGCAGATC ACGCTGACGG CGCGCGCCAA GGGCGACGAC
AAGGTGCCGA TGTGCGGCGT GCCCCACCAC GCCGCCCGCG GCTACGTCGC CCGCCTGCTC
GAGAAGGGGT TCAAGGTCGC CATCTGCGAT CAGGTCGAGG AGCCCGGGAA GTCGGCGATC
GTGAAGCGCG AGGTCACGCG CGTCGTGACC CCGGGCATGG TGTTCGACGA CCAAGTGCTC
GATCCGCGGG AGGCGAGCTA CCTCGGCGTG GTCGCGCTGG CGGAGGGCCG CGCGGGGCTC
GCGCTCCTCG ACGCCTCCAC CGGCCAGCTC CAGTGCGGCG AGGTGCCCGA CGACGCCCGC
GCCGTCGACG AGCTGCGGCG CGCCGGGGTG CGCGAGCTCG TGCTGCCGCT CGGCGCGGAC
GCCGCGCGCG CGGAGCGGAT CGAGCGGGCG GTGGGCGTCC CCGCCGCGCG GCGGCCGGCC
GCGGACTACG AGCGCGCCGA CGATCGGCTC CGCCGCCACC TGGGCGTGGC GAGCCTCGAC
GGCTTCGGGG TGGGCGGGGA GCCGCTCGGG CTCGCCGCCG CGGCCGCCGC GCTCGCCTAC
CTCGCGGACA CCCAGCGCGC GACGCCGCGC CACGTGGATC GCGTCTCGCG CCTGCGGACG
GAGGACGTGC TCCTCCTCGA CGAGGCGACC CGTACGAACC TCGAGCTCGA GCGGACGCTC
AACGGGGGCC GCAAGAAGGG TTCGCTCCTC GCGCTCCTCG ATCGGAGCGT CACCGCGCCC
GGCGGGCGGC GCCTGGCGGA GTGGCTGCGC TACCCGCTCA CGGAGCTCGC CCCGATCCAC
GCCCGGCTCG ACGCGGTCGA GGAGCTCGCC GGCGCCTCCG TCGCGCGCGA GGACCTCGCG
GCGGCGCTTC GACCGGTCGC CGACGCGGAG CGGCTCCTGT CGCGCCTCGT GCTCGGCCAG
GGGAACGCGC GCGACCTGCG CGCGCTCGCG GGCGCGCTGC TGGCGCTCCC CGCGCTCGCC
GAGCTGCTCG GCGGCCGCGC CGCCGCGCTC CTGCGCGACG CGGGCGAGGG GACGCGCGGC
CTGGAGGAGC TCGCCGCGCA CCTCGATCGC GCCGTCGCGG AGGAGCCGCC GGCGACGCTC
CGGGAGGGCG GGATCATCCG GCGTGGGTTC TCGCCAGAGC TCGACGAGAT CGTCGCCGTC
GCCGAGGACG GCAAGGGGTT CATCGCCCGG CTCGAGGCGC GCGAGAAGGA GCGGACGGGG
ATCGGCTCGC TCAAGGTCCG CTTCAACAAG GTGTTCGGGT ACTACCTCGA GGTCACGAAG
GCGAACCTGC ACGCGGTCCC CTCCGACTAC GAGCGGCGCC AGACCACCGT CGGCGGCGAG
CGGTTCGTCA CCCCGGAGCT GAAGCGCTTC GAGGAGACCG TGCTCACGGC CGAGGAGCGG
CGCATCGCCG TCGAGGGCCG GCTGTTCGAG GAGCTCCGCC AGCGGGTCGC CGAGGCGGCC
CCGCGGATCC GCACCGCCGC CGACGCGGTC GCCACCGCGG ACGCGCTGCT CGCGCTCGCG
CGCGTCGCGG CGGAGCGGGG CTACTGCCGG CCGGAGGTGG ACGGCTCGGA GGTGCTCGAG
ATCGTGGACG GGCGCCACCC GGTGGTGGAG GCGGTGCTGC CCGAGGGGCC CGCCGGGTTC
GTGCCGAACG ACGTGCTCGT CGCCTCCCGG GGGGCGCTCG AGTGCGAGCG GCTCGGGGCG
CTGCACGTGA TCACCGGGCC CAACATGGCC GGCAAGAGCA CCGTCATGCG GCAGGCGGCG
CTCGTGACGT TGCTCGCGCA GATGGGGGCC TTCGTGCCGG CGCGGAAGGC GCGGGTGGGG
ATCGTCGACC GGATCTTCAC GCGCGTGGGC GCGTCCGACG ACCTCGCGCG AGGTCGCTCC
ACCTTCATGG TCGAGATGAC CGAGACCGCG GCCATCCTCC ACAACGCGAC GCGCCGCTCG
CTGGTGGTGC TCGACGAGAT CGGGCGCGGC ACCTCGACCT TCGACGGCGT CTCGATCGCG
TGGGCGGTGG CGGAGCACCT CCACGACCAG GTGGGCTGCC GGACGCTGTT CGCCACCCAC
TACCACGAGC TGCAGGACCT CGCGCGCGAG CGGCCGGCGG TGAGGAACCT CACCGTGGCG
GTGCGCGAGG TCGGCGATCG GGTGGTGTTC CTGCGCAAGC TCGTGCAGGG CGGCGCCTCG
CGGAGCTACG GCATCGAGGT CGCGAAGCTC GCCGGCCTCC CGGCGGAGGT GCTCGCGCGG
GCGCGCGAGA TCCTGAAGAA CCTGGAGGCG CTCGAGGTCG ACGAGGGCGG CCACGCCGCG
CTCGCGCGCG GGCGGAAGAC CCGGCGGGCG GACCCGCAGA GCCAGCTCGG CCTGTTCGCG
CCCGCGCCGG CCCCGGCGGA CCCGGCGCTC GAGGAGATCG CGAGCGCCCT GCGCGCGACC
GAGATCGACG CGCTCCGCCC GCTCGACGCG CTGAACCTCC TCGCCGCCTG GCGCGCGAAG
CTGCGCTGA
 
Protein sequence
MPEIAQPTPM MRQYLETKAR YPDALLFFRL GDFYELFFED ALTASEALQI TLTARAKGDD 
KVPMCGVPHH AARGYVARLL EKGFKVAICD QVEEPGKSAI VKREVTRVVT PGMVFDDQVL
DPREASYLGV VALAEGRAGL ALLDASTGQL QCGEVPDDAR AVDELRRAGV RELVLPLGAD
AARAERIERA VGVPAARRPA ADYERADDRL RRHLGVASLD GFGVGGEPLG LAAAAAALAY
LADTQRATPR HVDRVSRLRT EDVLLLDEAT RTNLELERTL NGGRKKGSLL ALLDRSVTAP
GGRRLAEWLR YPLTELAPIH ARLDAVEELA GASVAREDLA AALRPVADAE RLLSRLVLGQ
GNARDLRALA GALLALPALA ELLGGRAAAL LRDAGEGTRG LEELAAHLDR AVAEEPPATL
REGGIIRRGF SPELDEIVAV AEDGKGFIAR LEAREKERTG IGSLKVRFNK VFGYYLEVTK
ANLHAVPSDY ERRQTTVGGE RFVTPELKRF EETVLTAEER RIAVEGRLFE ELRQRVAEAA
PRIRTAADAV ATADALLALA RVAAERGYCR PEVDGSEVLE IVDGRHPVVE AVLPEGPAGF
VPNDVLVASR GALECERLGA LHVITGPNMA GKSTVMRQAA LVTLLAQMGA FVPARKARVG
IVDRIFTRVG ASDDLARGRS TFMVEMTETA AILHNATRRS LVVLDEIGRG TSTFDGVSIA
WAVAEHLHDQ VGCRTLFATH YHELQDLARE RPAVRNLTVA VREVGDRVVF LRKLVQGGAS
RSYGIEVAKL AGLPAEVLAR AREILKNLEA LEVDEGGHAA LARGRKTRRA DPQSQLGLFA
PAPAPADPAL EEIASALRAT EIDALRPLDA LNLLAAWRAK LR