Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3768 |
Symbol | |
ID | 7267841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 4592846 |
End bp | 4595752 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643568575 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002465040 |
Protein GI | 219850607 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCGA TTGAATTGCA TGCGTGGTAT CGCCAGTATC GCAAACTAAA AGAAGAGGCT GCCGATGCGA TTTTGCTCTT TCGCTTCGGT GATTTCTACG AGACGTTTGA TGATGATGCA AAGTTGATTG CAGAGTTACT TGACGTAACT TTAACACGTA AAGAATACGC CGTTGATAAG CGGGCACCGA AGGATCAGCA GAAGTTGTAT GCGCCGATGG CGGGAATGCC TTACCACGCC GTCGATCGCT ATGTGAGCGA ATTAGTTGCC CGTGGCTATC GGGTGGCAAT TGCCGAGCAG TTGAGCGAGA CGGAGGCAAT GCGCAATGAT ACGCGGCCTC GTTCGGTCTA CGCTGCCGGT TTGACCCCGC TCGAAAGCAG CGGCAAGATG GTACAGCGGG CCATTGTACG GATTATCACG CCGGGTACCG TGATCGATCC GGCGATGTTG CCCGATCGCA CGAACAACTA TTTGGCAGCA GTGCTGGTTG AACAAGGGAA GGTAGGGTTG GCGTATGCCG ACCTTTCAAC CGGTGAATTT GCCGCTGCCG AGTTTGTTGA TGCACGTGCA TTGACCCAGT TGCAAGCCGA GTTGGCGCGT CTTCGTCCCG CTGAAGTGCT CGTCCCTGAC GATGAGGCGC TCCGTTTGCC AAACCTGGCC CCGGTACAGG CACGTTTGAG CCAAGACCTG GCCCCGCTCA CCAAGGAGGA GCGTGAGGTA TTGTTGCCCC ACGAGCGGGT GGCCCGTCGT CTTGATGCCC CTGGTGCTGC CAGTTGGACG CAAGGTCACG TGACCGAGTG GCCGACGTGG CGCTGGGAGT TGGCGACGGC TGCCTCCGCG TTGTGTGAGC AGTTGGCAGT TGCGACGTTG GCGGTGTGTG GTCTTGAAGA CCGCCCGTTG GCGACACGCG CTGCCGGTGC ATTGATTCAA TACGCCCAGA CCACGCAACG CCAGCGGGTC AACCAGTTGC GGTATCTGCG GGTGTACCAG ACCGGTGCAT ATATGCTGCT CGATCCGCAG ACGCGGCGCA ATCTTGAATT GCTGGAAAGT AGTGGGCGGC AAGGGGCAAA AGCTTCGTTG ATCGGCGTGC TCGACCGCAC GTGTACGGCA ATGGGGGCGC GTTTGCTGCG GCGTTGGATT GCCCAACCGC TGATCGTTTT AGAACCATTG CAAGTGCGTC AGCATGCTGT AGCACGCCTG GTCGCCGAGA CGATGACTCG GCTTGAGCTT CGTGAGGCGC TGGCCGAGTT GCCCGATATG GAGCGGGCGC TCAATCGGAT CGCACAAGGT ATTGCAGTGG CAACGCCGCG TGATATGGTT CAGTTACGGG CCGCGTTGCG CAAACTACCC GGCATCGCGC AAGCTATCGC ACCGTTGTTA CCCGACTTGC TCGCCCCTGA AATGGACGGC GAGCCGCTGC TCACGTTTGA CCCGTGCAGT GATGTGCTCG ATCTGCTAGA ACGGGCACTC GACGATGATC CACCGGCGTT GCTCGGTTCG TCGAACTATC TACGGGCTGC CGAAGAGGGT GGCGAGCGAC CGCGCCGTGT GATCCGCCCC GGTTTCGATC AGCGTCTCGA TGCGTTGATT AAGGCTAGTC GCCATGCCCA AGAATTCATC GACCGTCTCG AAACGAAAGA ACGTGAGCGT ACCGGGATTC GTTCGCTCAA AGTGGGTTAC AATCAAGTGT TTGGTTACTA TATCGAAATA TCGCGTGCCG TTGATCCGAA ACTGATCCCA TCACATTACG AACGCAAGCA AACGCTGGTG AATGCTGAGC GTTATGTGAC TGAAGAGCTG AAGTACTACG AAGGGTTGCT CAGCGATGCA CGGTTAAAGC TGGTTGATCT TGAACGAGAC ATCTTTCAAC GGTTGTGCGA TGACATTCAG CAACACCTCG ACCGGCTGCG GATAACGGTG GCCGCAGTGG CCCGCCTCGA TGCGTTAGCC GCCCTGGCCG AGGTGGCAGT GCGTGGCCGT TATGTCCAAC CGACCTTGCG AACCGATCGG GTATTGCGGA TCAAGCAGGG CCGTCATCCG GTTGTTGAGC GGACGCTGGG TGAGCCGTTC ATCGGCAACG ATGTCGATCT TGATGGTGAT AATGTCCAGA TTTTGATCAT TACCGGCCCG AATATGGCCG GTAAAAGCAC TTTCTTGCGC CAGGTGGCCT TGATTACCCT GATGGCGCAG ATCGGCTCGT TTGTCCCCGC CGATGAAGCC GAAATTGGCT TGGTGGATCG CATTTTTACC CGGATCGGTG CTCAAGACGA CATCGCTACC GGTCAGAGCA CCTTTATGGT TGAGATGACC GAAACTGCTG CATTGCTTAT GCAGAGTACA CCCCGTTCGC TGATCATTCT CGATGAGGTG GGGCGTGGGA CGAGTACGTA TGACGGTATG GCAATTGCCC GGGCCGTAGT TGAGTACATC CATAACGAAC CTCGGTTGGG GTGTCGGACG TTGTTTGCGA CCCACTATCA CGAGTTGACG GCACTTGATA CCGAACTACC TCGTGTACGC AACTTTCATA TGGCGGCTGT CGAGCGTGAC GGCCGAGTCG TCTTTTTGCA TGAGCTGCGT CCCGGTGGTG CCGATCGCTC GTATGGTATA CACGTCGCCG AGCTGGCCGG TATTCCGGCG AGTGTGATCA GGCGGGCCAA TGATTTGCTG GCCGAACTCG AGGGTCACAC GGCACGACCG ACGGATCGGC ACGCCAAACC ACGTTCGGAT GGGGAGCGCG CATTGCCATC GGCACCCTCA ACCGTAGGTA GTATGCAGTT ATCGCTATTT GATCTCGTAC CGCACCCGGT AGTGGAGTAT CTCCGCCGGC TGCGGATTGA GGAACTTACC CCGTTGGAAG CGTTGAACCG GCTGGCCGAG TTACAACGGC TAGCTCGCGA AGGATAG
|
Protein sequence | MAAIELHAWY RQYRKLKEEA ADAILLFRFG DFYETFDDDA KLIAELLDVT LTRKEYAVDK RAPKDQQKLY APMAGMPYHA VDRYVSELVA RGYRVAIAEQ LSETEAMRND TRPRSVYAAG LTPLESSGKM VQRAIVRIIT PGTVIDPAML PDRTNNYLAA VLVEQGKVGL AYADLSTGEF AAAEFVDARA LTQLQAELAR LRPAEVLVPD DEALRLPNLA PVQARLSQDL APLTKEEREV LLPHERVARR LDAPGAASWT QGHVTEWPTW RWELATAASA LCEQLAVATL AVCGLEDRPL ATRAAGALIQ YAQTTQRQRV NQLRYLRVYQ TGAYMLLDPQ TRRNLELLES SGRQGAKASL IGVLDRTCTA MGARLLRRWI AQPLIVLEPL QVRQHAVARL VAETMTRLEL REALAELPDM ERALNRIAQG IAVATPRDMV QLRAALRKLP GIAQAIAPLL PDLLAPEMDG EPLLTFDPCS DVLDLLERAL DDDPPALLGS SNYLRAAEEG GERPRRVIRP GFDQRLDALI KASRHAQEFI DRLETKERER TGIRSLKVGY NQVFGYYIEI SRAVDPKLIP SHYERKQTLV NAERYVTEEL KYYEGLLSDA RLKLVDLERD IFQRLCDDIQ QHLDRLRITV AAVARLDALA ALAEVAVRGR YVQPTLRTDR VLRIKQGRHP VVERTLGEPF IGNDVDLDGD NVQILIITGP NMAGKSTFLR QVALITLMAQ IGSFVPADEA EIGLVDRIFT RIGAQDDIAT GQSTFMVEMT ETAALLMQST PRSLIILDEV GRGTSTYDGM AIARAVVEYI HNEPRLGCRT LFATHYHELT ALDTELPRVR NFHMAAVERD GRVVFLHELR PGGADRSYGI HVAELAGIPA SVIRRANDLL AELEGHTARP TDRHAKPRSD GERALPSAPS TVGSMQLSLF DLVPHPVVEY LRRLRIEELT PLEALNRLAE LQRLAREG
|
| |