Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_7642 |
Symbol | mutS |
ID | 5150854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 8036256 |
End bp | 8038988 |
Gene Length | 2733 bp |
Protein Length | 910 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640562285 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001243393 |
Protein GI | 148258808 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.532158 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCC AGCAGCCCAT CACCGCCCCG GTTCCACAGG AAACCGCGCC CGCCGAGGCG CCGGCCAAGC TCACGCCGAT GATGGAGCAA TATCTGGACA TCAAGGCCGC CCATCCGGGG CTGATGCTGT TCTACCGGAT GGGCGACTTC TACGAGCTGT TTTTCGAGGA CGCGGAGGTC GCGTCGAAGG CGCTCGGCAT CGTGCTGACC AAGCGCGGCA AGCATCAGGG CCAGGACATC CCGATGTGCG GCGTGCCGGT GGAGCGCTCC GAGGACTATC TGCACCGGCT GATCGCGCAA GGCATCCGTG TCGCCGTCTG CGAGCAGATG GAGGATCCGG CGGCGGCGCG CGCGCGCGGC AACAAGAGCG TGGTCAAGCG CGGCGTGGTG CGCGTGGTGA CGCCGGGCAC GCTCACCGAA GACAATCTGC TCGATGCCCG CGCCAATAAT TATCTGCTGG CGATCGCGCG CAGCCGCGGC TCCTCCGGCG GCGACCGGCT CGGGCTCGCC TGGATCGACA TTTCGACCTC GGACTTCATC GTCACCGAAT GCGCCTTCGC CGAGCTGACC GCGACGCTCG CCCGCATCAA TCCGAACGAA GTGATCATCT CGGATGCGCT GTATTCGGAT GAAGTCTTCG AGCCGGTGCT GCGCGAGCTC GCCGCGGTGA CGCCGCTGAC GCGCGACGTA TTCGACGGCG CCACCGCCGA GCGCCGGCTG TGCGATTATT TCGCGGTGGC CACGATGGAC GGCCTCGCCG TGCTGTCGCG GCTGGAAACC ACCGCTGCCG CCGCCTGCGT CACTTATGTC GAGCGGACAC AAGTGGGGCA GCGCCCGCCG CTGGCGCCGC CGGCGCGCGA GGCGACCGGC AGCACCATGG CGATCGACCC GGCCACCCGG GCCAATCTCG AGCTGACGCG CACGCTCGCC GGCGAGCGCC GCGGCTCGCT GCTCGACGCC ATCGACTGCA CGGTGACCTC GGCCGGCTCA CGCCTGCTGG CGCAGCGGCT TGCGGCGCCC TTGACGGAGC CGGCGCAGAT CGGCCGCCGG CTCGATGCCG TCAACGTGTT CGTCGCCGAC AGCGCCGCGC GCGAGGACAT CCGCGCGATC CTGCGCGGCG CGCCGGACAT GACGCGCGCG ATGGCACGGC TGTCGGTCGG CCGCGGCGGC CCGCGCGATC TCGCGGCCTT ACGCGACGGC ATCCTCGCCG CCGACCAGGC GCTCGGGCGG CTGTCGGCGC TGGATCAGCC GCCGCAGGAG ATCGCGGCCG CGATGGCGGC GCTGGCGCGG CCGGCGCGCG CGCTCGCCGA AGAATTGAGC CGCGCGCTCG ACGAGCAGTT GCCGCTGATC AAGCGTGACG GCGGCTTTGT CCGCTCAGGC TATGATTCGA CCCTCGACGA GACGCGCAAT CTGCGCGACG CGTCCCGCCT CGTCGTCGCC TCGATGCAAG CGCGCTATGC CGACCAGACC GGCGTCAAGG CGCTCAAGAT CCGCCACAAC AACGTGCTCG GCTATTTCGT CGAGGTCACC GCGCAGCACG GCGATAAATT GATGTCGGCG CCGCTGAACG CGACCTTCAT CCATCGCCAG ACGCTCGCCG GACAGGTGCG CTTCACGACA TCGGAGCTCG GCGAGATCGA GGCCAAGATC GCGAACGCCG GCGAACGCGC GCTCAATCTC GAGCTCGAGA TCTTCGACCG GCTGTGCGGA CAAGCGCTGG CGATCGGCGA CGATCTGCGC GCTGCGGCCC ATGGCTTTGC GATGCTCGAT GTCGCGACTG CGCTGGCCAA GCTCGCGGTC GATGACAACT ACATCAGGCC GGAGGTCGAC GGCTCGCTCG GCTTCGCCAT CGAGGGCGGC CGCCATCCGG TGGTCGAGCA GGCCTTGAAG CGCGAAGGCC AGCCCTTCAT CGCCAATTCC TGCGATCTGT CGCCGACGCC GGGGCACAAA AGCGGCCAGT TGTGGCTGCT CACCGGCCCG AACATGGCGG GTAAATCGAC CTTCCTGCGC CAGAACGCCT TGATCGCCCT GCTCGCGCAG ATCGGCTCGT TCGTTCCGGC GACGCGGGCC CGCATCGGCA TCGTCGACCG GCTGTTCTCG CGCGTCGGCG CCGCCGACGA TCTGGCGCGC GGACGCTCGA CCTTCATGGT CGAGATGGTC GAGACGGCGG CGATCCTCAA CCAGGCCGGC GAGCGCGCGC TGGTGATCCT GGACGAGATC GGCAGGGGCA CCGCGACCTT CGACGGCCTG TCGATCGCCT GGGCCGCGAT CGAGCATCTG CACGAGAGCA ACCGCTGCCG CACGCTGTTC GCCACGCATT ATCACGAGCT GACGGCGCTC GCCGCCAAGC TGCCGCGGCT GTTCAACGCC ACGGTGCGGG TCAAGGAATG GCACGGCGAC GTGGTGTTCC TGCACGAGGT GCTGCCCGGC TCCGCCGACC GCTCCTATGG AATTCAGGTG GCGAAGCTCG CCGGCCTGCC GCCGGCCGTG ATCAGCCGCG CCAAATCGGT GTTGGCGAAG CTGGAGGCCG CCGACCGCGG CCAGAACGCG CGGGCGCTGG TCGACGATCT TCCGCTGTTC GCCGTGCCGT CGCGCGCGGC GCCCGAGCCC GCGATGTCGA AAGAAGCGGA GGAGCTGATT GCCGCGGTCA AGGCGCTGCA CCCCGACGAG ATGACGCCGC GCGAAGCGAT GGACGCGCTG TATGCGCTGA AGGCGAAGCT GCCGAAGGGC TAA
|
Protein sequence | MTIQQPITAP VPQETAPAEA PAKLTPMMEQ YLDIKAAHPG LMLFYRMGDF YELFFEDAEV ASKALGIVLT KRGKHQGQDI PMCGVPVERS EDYLHRLIAQ GIRVAVCEQM EDPAAARARG NKSVVKRGVV RVVTPGTLTE DNLLDARANN YLLAIARSRG SSGGDRLGLA WIDISTSDFI VTECAFAELT ATLARINPNE VIISDALYSD EVFEPVLREL AAVTPLTRDV FDGATAERRL CDYFAVATMD GLAVLSRLET TAAAACVTYV ERTQVGQRPP LAPPAREATG STMAIDPATR ANLELTRTLA GERRGSLLDA IDCTVTSAGS RLLAQRLAAP LTEPAQIGRR LDAVNVFVAD SAAREDIRAI LRGAPDMTRA MARLSVGRGG PRDLAALRDG ILAADQALGR LSALDQPPQE IAAAMAALAR PARALAEELS RALDEQLPLI KRDGGFVRSG YDSTLDETRN LRDASRLVVA SMQARYADQT GVKALKIRHN NVLGYFVEVT AQHGDKLMSA PLNATFIHRQ TLAGQVRFTT SELGEIEAKI ANAGERALNL ELEIFDRLCG QALAIGDDLR AAAHGFAMLD VATALAKLAV DDNYIRPEVD GSLGFAIEGG RHPVVEQALK REGQPFIANS CDLSPTPGHK SGQLWLLTGP NMAGKSTFLR QNALIALLAQ IGSFVPATRA RIGIVDRLFS RVGAADDLAR GRSTFMVEMV ETAAILNQAG ERALVILDEI GRGTATFDGL SIAWAAIEHL HESNRCRTLF ATHYHELTAL AAKLPRLFNA TVRVKEWHGD VVFLHEVLPG SADRSYGIQV AKLAGLPPAV ISRAKSVLAK LEAADRGQNA RALVDDLPLF AVPSRAAPEP AMSKEAEELI AAVKALHPDE MTPREAMDAL YALKAKLPKG
|
| |