Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_2520 |
Symbol | |
ID | 6282032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | + |
Start bp | 2843553 |
End bp | 2846237 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642622079 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001896139 |
Protein GI | 187924497 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00821283 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000000391787 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCATTC AAACCGCAGC GGCCAACGAC GTCGCACAGC ACACGCCGAT GATGCAGCAG TATCTGCGCA TTAAAGCGGA CCATCCGGGC ACGCTGGTGT TCTATCGGAT GGGCGACTTC TACGAACTCT TTTTCGAAGA CGCGGAAAAA GCCGCACGTC TGCTCGATCT GACACTCACG CAGCGGGGCG CGTCGGCGGG CAATCCGATC AAGATGGCCG GCGTGCCGCA TCACGCGGTC GAACAGTATC TCGCCAAGCT CGTGAAGCTC GGCGAATCCG TCGCAATCTG CGAGCAGATC GGCGACCCGG CTACGTCGAA GGGCCCGGTC GAGCGCAAGG TGGTGCGCGT GGTCACGCCG GGCACGCTCA CGGACGCAGC GCTCCTGTCC GACAAGAGCG ATGTCTATCT GCTGGCCATG TGCGTCGCGC ATAACCGGCG CGGCGTGGCG ACCAGCGTCG GCCTTGCATG GCTGAATCTG GCGAGCGGCG CGCTGCGCCT CGCGGAGGTC GCGCCCGACC AGGTGGCGGC TGCGCTTGAG CGCATTCGTC CTGCGGAAAT TCTGGTTGCC GACACGCCTA GCGACTCGGC GAGCTGGACG CCGCCGGTCA ATGCGGGCGC GCTCACCCGC GTGCCGGTCT GGCACTTCGA CGTCACCTCG GGCACGCAAC GCTTGTGCGA TCAGCTGGAA GTGGCGGGCC TCGACGGCTT CGGCGCGCAT TCGCTGACCT GCGCATGCGG CGCGGCCGGC GCGCTGCTGC TGTATGCCGC GGCCACACAA GGTCAGCAGT TGCGTCATGT GCGCAGCCTG AAGGTCGAAT ACGAATCCGA ATATATCGGC CTCGACCCCG CCACGCGCCG CAACCTCGAA CTCACGGAAA CACTGCGCGG CACTGAATCG CCCACGCTGT GTTCCCTGCT CGACACCTGC TGCACGACCA TGGGCAGCCG GCTGCTGCGT CACTGGCTGC ATCATCCGCC GCGCGAATCG GCGGTGGCGC AGGCGCGTCA GCAGGCGATC GGCGCCTTGC TCGACGCGCC GCCGGGCGCG AGCATCGATT CGTTGCGCGG CGCGTTGCGG CAGATCTCGG ACATCGAGCG CATCACCGGG CGTCTTGCGC TGCTGTCCGC GCGGCCTCGC GATCTGTCGA GCCTGCGCGA TACCTTCATT GCATTGCCCG AGTTGCGCAC GCAAGTCGCA GCCGTTGCGC CGAATGCGGA TTCGCTGGCC CGCATCGACG CATCCCTTGA ACCGCCGCAA GCCTGCGTCG AACTGCTCAA GCGCGCGGTT GCGCAGGAAC CGTCGGCCAT GGTGCGCGAC GGCGGCGTGA TCGCGCGCGG CTACGATGCG GAACTGGACG AACTGCGCGA TATTTCGGAG AACTGCGGGC AGTTCCTGAT CGACCTCGAG ACACGCGAGC GCGCACGCAC GGGCATCGGC AATCTGCGCG TCGAATACAA CAAGGTGCAT GGCTTTTATA TCGAAGTCAC GCGCGGCCAG ACCGACAAGG TGCCCGACGA CTATCGCCGC CGGCAGACAC TGAAAAATGC CGAACGCTAC ATCACGCCGG AACTGAAAAC GTTCGAGGAC AAAGCCCTGT CCGCGCAGGA ACGCGCCCTC GCCCGCGAAC GCTCGCTCTA CGACGCGCTG CTGCAGGCGC TGCTGCCCTT CATCCCGGAT TGTCAGCGGG TCGCCTCGGC GCTCGCCGAA CTCGATCTAC TGGCGGCCTT CGGCGAGCGC GCCCGCGCGC TCGATTGGGT CGCGCCCACG TTTTCGGCAA ATGCCGGCAT CGAAATCGAA CAGGGGCGGC ATCCGGTTGT CGAGGCGCAG GTCGAGCAGT TCATCGCCAA CGACTGCTCG CTCACGCCCG AACGCAAACT GCTGCTGATC ACCGGCCCGA ACATGGGCGG TAAATCGACC TTCATGCGCC AGACCGCGCT GATCGCGCTG CTCGCTTATG TGGGCAGTTA TGTGCCCGCG CGACGCGCCG CCTTCGGGCC GATCGACCGC ATCTTCACGC GCATCGGCGC AGCGGACGAT CTCGCCGGCG GCCGCTCGAC CTTCATGGTC GAGATGACCG AAGCCGCCGC GATCCTGAAC GACGCCACGC CGCAAAGTCT CGTGCTGATG GACGAGATCG GCCGCGGCAC GTCCACGTTC GACGGGCTCG CGCTGGCGTG GGCCATCGCG CGGCATCTGC TCGCGCACAA CGGTTGCCAT ACGCTATTTG CCACGCACTA CTTCGAATTG ACGCAGTTGC CCGCGGAATT TCCACAGGCG GCCAACGTGC ATTTGTCGGC GGTCGAGCAT GGGCACGGCA TCGTGTTCCT GCACGCGGTC AGCGAAGGCC CGGCGAATCA GAGCTACGGC CTGCAGGTTG CGCAACTCGC CGGCGTGCCG AACGCGGTGA TTCGCGCGGC GCGCAAGCAT CTTGCGCATC TGGAACAGCA GTCGGCCGCG CAACCCGCGC CGCAACTCGA TCTGTTCGCC ACGCCCATGC CGATGCTGCT CGAAGACGCA GACGACGAGC GCGATGCGAA GGCAGAGCCC GCGGTGCCAC CTGCCATGCA GGAGCTCGTC GAGCGTCTGC GCGGCATCGA TCCGAACGAT CTGCGACCGC GCGAAGCACT CGATCTGCTG TACGAATTGC ACGAACTGGC CGCCGCGCCG GATGCGGATC ATTGA
|
Protein sequence | MGIQTAAAND VAQHTPMMQQ YLRIKADHPG TLVFYRMGDF YELFFEDAEK AARLLDLTLT QRGASAGNPI KMAGVPHHAV EQYLAKLVKL GESVAICEQI GDPATSKGPV ERKVVRVVTP GTLTDAALLS DKSDVYLLAM CVAHNRRGVA TSVGLAWLNL ASGALRLAEV APDQVAAALE RIRPAEILVA DTPSDSASWT PPVNAGALTR VPVWHFDVTS GTQRLCDQLE VAGLDGFGAH SLTCACGAAG ALLLYAAATQ GQQLRHVRSL KVEYESEYIG LDPATRRNLE LTETLRGTES PTLCSLLDTC CTTMGSRLLR HWLHHPPRES AVAQARQQAI GALLDAPPGA SIDSLRGALR QISDIERITG RLALLSARPR DLSSLRDTFI ALPELRTQVA AVAPNADSLA RIDASLEPPQ ACVELLKRAV AQEPSAMVRD GGVIARGYDA ELDELRDISE NCGQFLIDLE TRERARTGIG NLRVEYNKVH GFYIEVTRGQ TDKVPDDYRR RQTLKNAERY ITPELKTFED KALSAQERAL ARERSLYDAL LQALLPFIPD CQRVASALAE LDLLAAFGER ARALDWVAPT FSANAGIEIE QGRHPVVEAQ VEQFIANDCS LTPERKLLLI TGPNMGGKST FMRQTALIAL LAYVGSYVPA RRAAFGPIDR IFTRIGAADD LAGGRSTFMV EMTEAAAILN DATPQSLVLM DEIGRGTSTF DGLALAWAIA RHLLAHNGCH TLFATHYFEL TQLPAEFPQA ANVHLSAVEH GHGIVFLHAV SEGPANQSYG LQVAQLAGVP NAVIRAARKH LAHLEQQSAA QPAPQLDLFA TPMPMLLEDA DDERDAKAEP AVPPAMQELV ERLRGIDPND LRPREALDLL YELHELAAAP DADH
|
| |