Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1594 |
Symbol | |
ID | 3760854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1744784 |
End bp | 1747399 |
Gene Length | 2616 bp |
Protein Length | 871 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637786331 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_391860 |
Protein GI | 78485935 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0845017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGACA CAATCAAGCA CACCCCCATG ATGCAGCAAT ACCTTGCTGT TAAAGCAGAC TATCCAAATC AATTATTGTT TTATCGAATG GGGGATTTCT ACGAGCTCTT TTATGAAGAT GCGGTCAAAG CGTCTGAATT ATTGGAAATT ACGTTGACGG CTCGCGGGAA ATCTGGCGGC AATCCTATTC CGATGGCGGG TATTCCGCAT CATTCGGCCG AAGGGTATTT GGCAAAGTTG GTAAAACTGG GACAATCGGT CGCGATTTGT GAACAAATTG GTGATCCGTC AATATCAAAA GGCCCTGTAG AACGTAAAGT GGTGCGCGTC ATCACACCGG GAACCTTGGT AGAGGATGCT TTATTAGAAG ACAAGTCTGA AAATTTACTT GCCGCCATTT TCCAGCAAGC AGACGAATAC GGACTGGCCA CTCTGGATGT GGCCAGTGGC CGCTTTGAAG CCACATTGCT GCCAGACTCG ACACAATTAA GTGCCGAGGT AGAACGACTC AAGCCAGCTG AAATCATCTT GCCGGACGAT CCACTCTTTA AACAGAACCT GCCTGAAAGC ATCCAAAATC GACCAGGCCT GGTCGATTAC CCAAGCTGGC ACTTTGAAAA AGACAGCTGT CGAAAACGCT TGATAGACCA TTTTGGCACA CAAGATTTAG TGGCCTTCGG TTGTGACCAA CTCCCGGCCG TCATCAGCGC GGCCGGCGTG ATTCTGCATT ACGCACAATC CATGCTTCAA AATACGCTGG CTCATGTATT CAGCCTGCAA ACCTATCAAG CCGATGATGC GCTGGCATTG GATGCCATGA GTCGACGTAA TCTGGAACTC GACACTAACC TGACAGGCGG TAAAAATCAC ACCTTGTTTG CCATTTTAGA CAATGCCACA ACGGCGATGG GTAGCCGCTT GATGAACCGC TGGCTCAACC AGCCCTTGCG AAACCGAGAC ATTATCAATG ATCGCTTCAA TGCGATTGAA GACATCATTG AACAACACAG CCAAGAAGAA TTTCGCAGCG CGTTAAAACC CATTGGCGAC TTGGAACGCA TTTTAAGCCG CGTGTCTTTA TATTCTGCAC GGCCTCGCGA CATCTTACAT TTAGGGCGGT CGCTAAACCA GCTCCCGGAA CTTCAAGCCT TATTGAAGCA ACAAACGGCC AATAAATGGC AACAGTTGTC CAAACAACTT GGTCTTTATC CGGAACTGGC AAGTCAACTA GAAACCGCGT TGGTTGAATC GCCCCCCATG TTGATGCGAG ATGGCGGCGT TTTTGCCGAA GGTTATGACA GCGAACTGGA TGAACTCCGT AACCTTAAAA ACCAAGCCGG CGATTATTTA TTGGCGCTGG AAGCACGTGA AAAAGAACGC ACAGGCATCA CCACCTTAAA GGTGGGCTAT AACCGAGTGC ATGGCTATTA CATTGAAGTC AGTAAACTGC AATCGGATAA TGTGCCGGCA GATTATGTCC GCCGCCAAAC TTTAAAAGCA CAAGAACGTT ATATCACCCC CGAATTGAAA GAATTTGAAG ACAAAGTTCT CAGTGCGAAT GAAAAAGCAC TGGCTCGCGA AAAGTGGTTA TATCAACAAT TATTGGAACG CTTAAACCAA GATTTGCAAG CGCTACAACG AACCGCAGCG GCACTGGCCG AAACGGATGT TTTGGTGTCT TTAGCCCGTC AGGCCATCAA CCTAAATTTA ACACGACCGA CCTTAAGTTC GGAACCTGGC ATTGACATCA AACAGGGGCG TCACTTAACC GTGGAAGCGC TATCGAACCA ACCGTTTATT CCGAACGATA CCTGTTTTGA TGAACAGCGC CGATTACAAA TTATCACCGG GCCCAACATG GGCGGTAAAT CAACCTTCAT GCGCCAAACC GCTTTGATTG CGATTATGGC GTACATGGGA AGCTTTGTGC CTGCTGAATC GGCCACTCTA GGACCGATTG ATCGTATTTT CACTCGCATC GGGGCTTCGG ACGATCTGAC CTCCGGTCGC TCCACTTTTA TGGTGGAAAT GACGGAAACA GCCAACATTC TTCATCATGC CTCACCGGAA TCTTTAATTC TGATGGACGA AGTCGGACGA GGTACCTCAA CCTTTGACGG ACTGGCACTT GCCTGGGCCA TTGCCGAACA AATGGCGCAA AGCATCCAAG GCTATTGCCT GTTTGCCACC CACTACTTTG AGCTCACCAC ACTGGTAGAG CAGTTCAATA ATACGGTCAA CATTCATCTC AGCGCCATAG AACACCAGGA TAAAATTGTC TTCATGCATC AGGTCGAAGA AGGTCCAGCC TCTCAAAGTT ACGGACTACA AGTAGCGGCT TTAGCCGGTG TACCAACTGC CGTCATAGAC AAGGCCAAAA AACACCTACA CCGTTTGGAA AATCAAACCG CAGCACAACA GCAAACCTCT GGCACAGCCT CTTCAGCAAA AGAATCTGTG CAACAATTTG ATTTATTCGC TCAACCCGCT TTACCGGAAG CCATAGAAAC CATGCTGACC GACCTGAAAG CCTTATCGGT TGATGATTTA ACACCGAGAC AAGCACTTGA AAAATTGTAC GAAGTCACTA ATACAGTTAA AAATGCATCC GAATAA
|
Protein sequence | MADTIKHTPM MQQYLAVKAD YPNQLLFYRM GDFYELFYED AVKASELLEI TLTARGKSGG NPIPMAGIPH HSAEGYLAKL VKLGQSVAIC EQIGDPSISK GPVERKVVRV ITPGTLVEDA LLEDKSENLL AAIFQQADEY GLATLDVASG RFEATLLPDS TQLSAEVERL KPAEIILPDD PLFKQNLPES IQNRPGLVDY PSWHFEKDSC RKRLIDHFGT QDLVAFGCDQ LPAVISAAGV ILHYAQSMLQ NTLAHVFSLQ TYQADDALAL DAMSRRNLEL DTNLTGGKNH TLFAILDNAT TAMGSRLMNR WLNQPLRNRD IINDRFNAIE DIIEQHSQEE FRSALKPIGD LERILSRVSL YSARPRDILH LGRSLNQLPE LQALLKQQTA NKWQQLSKQL GLYPELASQL ETALVESPPM LMRDGGVFAE GYDSELDELR NLKNQAGDYL LALEAREKER TGITTLKVGY NRVHGYYIEV SKLQSDNVPA DYVRRQTLKA QERYITPELK EFEDKVLSAN EKALAREKWL YQQLLERLNQ DLQALQRTAA ALAETDVLVS LARQAINLNL TRPTLSSEPG IDIKQGRHLT VEALSNQPFI PNDTCFDEQR RLQIITGPNM GGKSTFMRQT ALIAIMAYMG SFVPAESATL GPIDRIFTRI GASDDLTSGR STFMVEMTET ANILHHASPE SLILMDEVGR GTSTFDGLAL AWAIAEQMAQ SIQGYCLFAT HYFELTTLVE QFNNTVNIHL SAIEHQDKIV FMHQVEEGPA SQSYGLQVAA LAGVPTAVID KAKKHLHRLE NQTAAQQQTS GTASSAKESV QQFDLFAQPA LPEAIETMLT DLKALSVDDL TPRQALEKLY EVTNTVKNAS E
|
| |