Gene Anae109_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1457 
Symbol 
ID5378015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1653127 
End bp1655544 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content77% 
IMG OID640842968 
ProductSmr protein/MutS2 
Protein accessionYP_001378648 
Protein GI153004323 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GCACGCAACG AGAGCTCGGC TGGCCCGAGA TCCTCAACGC CCTCGCGGCC 
CGCTGCCGGC TGCCGGCGGG CCGCAACCGG GCGCTCGCCC TCCCCTTCCA GCCGACCGCG
GAGGCCGCGC GCGAGGCGCT CGCCTTGGTG GGAGAGGCGC GGCGACTGTC GGAGCTCGCC
CTCGCGCTGC CGCTCGGGGG CGTCGGCGAC GTGGAGGGGC ACCTCGAGCG GGCGAGCAAG
GGGGGCGTCC TCGAGCCGCT CGCGCTGCGC GAGTGCGCCG CGCTCGCACG CGCCGCCGCC
CGGACCCGGG GCCTGCTCGA GGCCCGGGCG TCGGAGACGC CGCGGCTGTG GGCGCTCGCG
GAGCCGCTGT CGCCCTCGGC GGCGCTCGCC GATCGGATCG AGCGCGCCAT CGAGCCCTCG
GGCGCGATCT CGGACCGGGC GAGCGCGGAG CTCGCCCAGG CCCGCGAGCG ATCCCGCGGC
CTCCACCGCG CGCTCAAGGC CCAGGTCGAG ACGCTCCTCG CGGACGCGGA CATGCAGCGC
CACCTGCGCG ACACCTACTT CACCATCCGC AACGAGCGCT ACGTCCTCCC GGTGCTCGCG
AGCGCGCGCC GCGCCGTCCC CGGCATCGTC CACAACGCCT CGCAGTCCGG CCAGACGCTC
TTCGTCGAGC CGGATTCGAT GGTCGAGCTC GGCAACGAGC TCTCGATCGC GAACGCCGTC
GCCGCCGAGG AGGAGCAGCG CATCCTGCGC GAGCTCACCG GCGCGCTCAT GGCCGACTCG
GGCGCGCTCG CCCGCGACCT CGGGATCCTG GCCGCGCTCG ACGTGCTCGA GGGCTCGGCG
CTGCTCGCCT CCGATCTCGA CGCGCACGCC CCCGAGGTGC TCTCCCCCTT CGACGGGCTC
AGGGTGGGCG GCGCTGGCGC CGGCTTCGAG CTGCTGTCGC TCCGCCATCC CCTGCTCGTC
CTCCAGGGCA AGAAGGTGGT CCCGAGCCAC GTCCGGCTCG ACGCGCCCGC GCGCGCGCTC
ATCGTGTCGG GGCCGAACGG CGGCGGGAAG ACGGTCGCCA TCACCGCGGT CGGGCTGTCC
GCGCTCATGC TCCGCGCCGG CCTGCCGGTC GCGGCCGCCG AGGGGTCGCG GCTGCCGTTC
TTCCTGGAGG TGAAGGCGGC GGTGGACGAG CGGGGCGATC TCGCGAAGGA CCTCTCCACC
TTCACCGCCC ACCTCGCGGC CGTGAAGGAG ATGCTCGCGG GCGCGGTGCC CGGCTCCCTC
ATCCTCGTGG ACGAGATCGC CGCCGACACC GATCCGCGCG AGGGGGCGGC CCTCGCCGCG
GCGATCCTGG AGTCCCTGGT CGAGCGCGGC GCGGCGGTGC TCGTGACCAC GCACCTCGAC
GAGCTGAAGG CGCTCGCTCT CACGGACCCG CGCTACGCGA ACGCCCGCGT CGGCTTCGAC
GCCGAGCGGC TCGCGCCCAC CTACCAGCTG CACCTCGGGA GCCCGGGCAG CTCGTCGGCG
ATCGAGGTCG CCGCGCGGGT GGGCCTGCCG GCGCCGCTCG TCGAGCGGGC GCGGGCGGCG
CTCACCGGGC ACGGCGGCGC GCTCGGGCAG GCGCTCCGCG CCCTCGACGA CGAGCGCGCG
CGGCTCGCGG AGGAGCGGCG CGCCGCGGAG AGCGCGCGGG ACGCGGCGCG GAAGGCCGAG
GAGCGCGCCC GGGCCGCCGA GGAGGTCGCG CGGCGCGCCC AGCGCGAGGC GGCGGCGCGC
ATGGGCGAGG CGCTCGCGGA CGAGCTCGAG GCCGCGCGCG CGGAGGTGGC GGAGCTGCTC
GCCGGCCTGC AGGCGCGGCC CACCGTGAAG GCGGCGACCG ACGCCGCCCG CCAGCTCGAC
GCCTGGCGCG CGACGGTCGC CCAGGCGGCG AAGGCGACGC AGGCGCGCGC CGACGCCGGC
GCGGAGGCGC TCCCGGGCGG CGAGGTGAGG CCGGGCGTGC GCGTGCGGAT CGTCTCGCTC
GGCCAGGAGG GCGAGGTGGT CGAGGTGGAC GGCAAGGACG CGCTCGTGCG GGCCGGGCCG
CTGAAGGTCC GCCGGCCGGT CGCGGATCTC GTGCCGCTCC TCGGCAAGGC GAAGGACGCG
GCGAAGCTCG GGCGCTCGCG CTCGGAGAAG CTCCAGGCGG CCTCGGAGGC CCGCCCGAGC
GCGCCGCCCG GCCTCGAGCG GCGGCTCGAC GTGCGCGGCC TGCGCGTCGA GGAGCTGCTC
CGGGAGGTGG AGCGATTCCT CGATCGCCTC TACTCGGACG GCGAGGCCGA CTGCCTCATC
CTCCACGGCC ACGGCACCGG CGCGCTGAAG CAGGCGCTGC GCGATCACCT CTCCGCCTCG
CCCTACGTCG GCGCCTTCCG CGCCGGCGAC CGGCACGAGG GCGGCGACGC GGTGACGGTC
GTCAGCCTCC GGCGCTAG
 
Protein sequence
MTDRTQRELG WPEILNALAA RCRLPAGRNR ALALPFQPTA EAAREALALV GEARRLSELA 
LALPLGGVGD VEGHLERASK GGVLEPLALR ECAALARAAA RTRGLLEARA SETPRLWALA
EPLSPSAALA DRIERAIEPS GAISDRASAE LAQARERSRG LHRALKAQVE TLLADADMQR
HLRDTYFTIR NERYVLPVLA SARRAVPGIV HNASQSGQTL FVEPDSMVEL GNELSIANAV
AAEEEQRILR ELTGALMADS GALARDLGIL AALDVLEGSA LLASDLDAHA PEVLSPFDGL
RVGGAGAGFE LLSLRHPLLV LQGKKVVPSH VRLDAPARAL IVSGPNGGGK TVAITAVGLS
ALMLRAGLPV AAAEGSRLPF FLEVKAAVDE RGDLAKDLST FTAHLAAVKE MLAGAVPGSL
ILVDEIAADT DPREGAALAA AILESLVERG AAVLVTTHLD ELKALALTDP RYANARVGFD
AERLAPTYQL HLGSPGSSSA IEVAARVGLP APLVERARAA LTGHGGALGQ ALRALDDERA
RLAEERRAAE SARDAARKAE ERARAAEEVA RRAQREAAAR MGEALADELE AARAEVAELL
AGLQARPTVK AATDAARQLD AWRATVAQAA KATQARADAG AEALPGGEVR PGVRVRIVSL
GQEGEVVEVD GKDALVRAGP LKVRRPVADL VPLLGKAKDA AKLGRSRSEK LQAASEARPS
APPGLERRLD VRGLRVEELL REVERFLDRL YSDGEADCLI LHGHGTGALK QALRDHLSAS
PYVGAFRAGD RHEGGDAVTV VSLRR