Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1981 |
Symbol | |
ID | 3744950 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | - |
Start bp | 2205363 |
End bp | 2207243 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637770012 |
Product | DNA mismatch repair protein |
Protein accession | YP_375866 |
Protein GI | 78187823 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0323] DNA mismatch repair enzyme (predicted ATPase) |
TIGRFAM ID | [TIGR00585] DNA mismatch repair protein MutL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.201794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCCA TTGCCAGGCT TCCGGACAAT GTAGCCAACA AGATTTCAGC GGGGGAGGTC GTCCAGCGGC CGGCTTCCGT CGTCAAGGAA CTCCTTGAAA ACGCCATAGA CTCCGGCGCC GACCGGATTT CAGTCGTTAT AAAAGACGCC GGCAGGGAAC TGGTGCGCAT CATCGACAAC GGACGGGGAA TGAGCAGGGC GGATGCACTT CTTTCGGTCG AGCGGTTCGC CACAAGCAAG CTCAGGGACG TCGATGACCT CGATACCCTC GGGACCCTCG GATTCCGTGG CGAGGCACTC GCCAGCATCT CCTCCGTCTC CCATTTCGAG CTCCGCACCC GCATGACCGA TGCGCCTGTC GCGCTCCGTT TCCGTTACGA GGGCGGCATT GCAGTGGAAG AGTCTGAGGT GCAGGGTGAG GCGGGCACCT CGGTGAGCGT CCGCAATCTC TTTTACAACG TCCCCGCGCG CCGGAAGTTC CTGAAGTCCA ACGCCACCGA GTACGGCCAT ATTTTCGAAC TCGTCCGTTC GTTTTCCCTC GCCTACCCTG AAATACAGTG GCAGCTCCTG AACGACGACC AGGAGCTGTT CAACTTCCGC ACTTCCGATA TGCTGGAGCG CCTCGATACC TTTTACGGAA AAGGGTTTGC CGACAGCCTC ATCGAGGTCG GCGAAGAAAA CGACTACCTC TCCATCAGGG GATACATCGG CCGCCCGGCG CTCCAGAAGC GAAAGAAGCT CGACCAGTAC TTCTTCATCA ACCGTCGCCC GATCCAGAAC CGCATGCTCA CCCAGGCTCT CCAGCAGGCA TATGCCGAGC TGCTTGTAGA GCGCCAGGCA CCCTTCGCCC TCCTCTTTCT CGGTATCGAT CCCTCACGGG TGGATGTCAA CGTGCACCCT GCGAAGCTCG AGGTCCGGTT CGACGATGAG CGAAGCGTGC GCAACATGTT CTACCCCGTC ATCAAGCGGG CCGTGACACT GCATGACTTT TCCCCCGATC TTGCCGCAGG AGGACGGACC TCGCAGGCAG GGGATGATTC CGCTTCCCGG GGGTTCACTC ATGCCGGCGG GGGTGGATTC AGGACCCTTG CTTTTCAGGA GGTCCCGGAA CGGGCCATTA CGACCGGAGA GCTCTACGGC AGCTATCGCG AAGGGGCATT CGGCAGTTCC CGCCCGGCAG TTCCGCAGCC TTCACACCAG GAGGTGATGT TCCCTGTTCC TGAAGTCCCG GCGGCCCGTG AGGATATCTC ACAGCTGCTC CGCTCGAGCA TGCACGAGGG CCCGGAAGGC GCCGGAGTGG AGCCGAAAGG GGAGGAACCG AAGATCTGGC AGCTCCACAA CAAGTACCTC ATCTGCCAGA TCAAGACCGG GCTCATGATC ATCGACCAGC ACGTGGCTCA TGAGCGGGTG CTCTACGAAC GCGCGGTGGA GGTGATGGAG AGCCGCGTGC CGAACTCCCA GCAGCTGCTC TTTCCGCAGA AGGTCGAGTT CCGGCCGTGG GAGTATGAAG TGTTCGAGGA GATCAAAGAC GATCTGTACC GGCTGGGCTT CAACCTTCGT TCGTTCGGGA CCCGGGCGGT GATGATCGAG GGCGTCCCGC AGGATGTGCG GCCCGGAAGC GAGGCCACCA TCATGCAGGA CATGATTGCC GAGTACAGGG AGAACGCCAC CCGGCTGCGG CTGGAGAGGC GCGACAATCT GGCGAAATCA TACTCCTGCC GCAACGCCAT CATGGCGGGC CAGAAACTCT CGATGGGGGA GATGCGCACC CTCATCGACA ATCTTTTCGC CACCAGGGAA CCTTACTCAT GCCCGCATGG CAGGCCCGTC ATCATCAAGA TGACGCTGAC CGAGCTCGAC CATATGTTCG GCAGGTCCTG A
|
Protein sequence | MPSIARLPDN VANKISAGEV VQRPASVVKE LLENAIDSGA DRISVVIKDA GRELVRIIDN GRGMSRADAL LSVERFATSK LRDVDDLDTL GTLGFRGEAL ASISSVSHFE LRTRMTDAPV ALRFRYEGGI AVEESEVQGE AGTSVSVRNL FYNVPARRKF LKSNATEYGH IFELVRSFSL AYPEIQWQLL NDDQELFNFR TSDMLERLDT FYGKGFADSL IEVGEENDYL SIRGYIGRPA LQKRKKLDQY FFINRRPIQN RMLTQALQQA YAELLVERQA PFALLFLGID PSRVDVNVHP AKLEVRFDDE RSVRNMFYPV IKRAVTLHDF SPDLAAGGRT SQAGDDSASR GFTHAGGGGF RTLAFQEVPE RAITTGELYG SYREGAFGSS RPAVPQPSHQ EVMFPVPEVP AAREDISQLL RSSMHEGPEG AGVEPKGEEP KIWQLHNKYL ICQIKTGLMI IDQHVAHERV LYERAVEVME SRVPNSQQLL FPQKVEFRPW EYEVFEEIKD DLYRLGFNLR SFGTRAVMIE GVPQDVRPGS EATIMQDMIA EYRENATRLR LERRDNLAKS YSCRNAIMAG QKLSMGEMRT LIDNLFATRE PYSCPHGRPV IIKMTLTELD HMFGRS
|
| |