Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0614 |
Symbol | |
ID | 4030988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 674305 |
End bp | 677028 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637969146 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_575963 |
Protein GI | 92116234 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATCC AGCCCGCCGT CCCGACATCG CCGACCAACG CTGCCCCCGA GGCGGCGCGG GTCACGCCGA TGATGGAACA ATATCTTGAA ATCAAGGCGG CCAATCCCGG TTTGCTGCTG TTCTACCGGA TGGGCGATTT CTACGAGTTG TTCTTCGAGG ATGCCGAAAT CGCCTCCTGC ACGCTCGGCA TTACGCTGAC CAAACGCGGC AAGCATCAGG GCGCGGACAT CCCGATGTGC GGCGTTCCGG TGGAACGCTC GGACGATTAC CTGCATCGGC TGATCGCAGC CGGGCACCGC GTCGCCGTGT GCGAGCAGAT GGAGGACCCG GCGGCGGCGC GCAGGCGCGG CAACAAGAGC GTGGTTCGCC GCGATGTGGT TCGCCTCATC ACACCGGGAA CGCTGACCGA AGACACGCTG CTCGATGCCA GGGCCAACAA TTACCTGCTG GCGCTGGCGC GTGCGCGGGC GTCCTCCGGC GGAAACCGCA TCGCGCTGGC GTGGATCGAC ATCTCGACGG CGGAATTTAT TGTCACCGAA TGCAGTACGG GCGAACTCGC GGCGACGCTC GCGCGCATCA ATCCCAATGA AGTGATCGTT TCCGACGCGC TCTATGGCGA TCCGGACATG GCCGCGCTGC TGCGCGAACT GCCGTCGGTC ACGCCGCTGA CCCGCGACGT GTTCGACGGC GCGACCGCCG AACGCCGGCT GTGCGACTAT TTCGCCGTCG CCACCATGGA CGGCCTCAGC GCCATGTCGT GGCTGGAAGC GACCGCGGCC GCCGCCGCCG TCACCTACGT CGACCGCACC CAGATCGGCA AACGGCCGCC GCTGTCGCCG CCATCGCGCG AGGCCGCCGG CAGCACCATG GCGATCGATC CCGCCACCCG CGCCAACCTC GAACTGACAC GCACGCTCGC CGGCGAACGG CGCGGCTCGC TGCTCGACGC CATCGACCGC ACAATGACGG CGGCGGGGTC GCGGCTCTTG GCGCAACGCC TGGCGGCGCC GCTGACCGAT ATCGCCGCCA TCGCGCGGCG GCTCGATGCC GTCGCCGCGT TCACATCCGA CAGCGCGGCG CGCGACGACA TCCGCACCAT CCTGCGGACT GCTCCCGACA TGTCGCGGGC ACTGGCGCGG CTCTCGGTCG GACGCGGCGG CCCGCGCGAC CTCGCGGGCC TGCGCGACGG CATCATGGCT GCGGACCGAA CACTGGCGCG GCTTTCGGCG CTCCCCGATC CGCCGCAGGA CATCGTCGCG GCGATGCAGG CGCTGCGGCG GCCGTCCCGC GAGTTGGCGC GCGAACTCGG CGAGGCGCTG GCCGAAAACC TGCCGCTGAT GAAACGCGAC GGCGGCTTTG CCCGCGAGGG CTATGAGCCG ACGCTCGACG AGGCACGCAA GCTGCGCGAC GACTCGCGGC TCGTCGTCGC CGCGATGCAG GCGCGCTACA CTGAGGAGAC CGGCGTCAAG ACGCTGAAGA TCCGACACAA CAACGTGCTC GGCTATTTCG TCGAGGTGAC GGCGCAGCAC GGCGACAAGC TGACGAGCGC GCCGCTGAAT GCCACATTCA TCCATCGCCA GACGCTGGCC GGTCAGGTCC GCTTCACCAC ATCCGAGCTT GGCGAGATCG AGGCCAGGAT CGCCAATGCC GGCGACCGCG CGCTCGGCCT CGAGCTTGAG ATTTTCGACC GGCTTGCCGC GGTGGTGGTG GAGGCCGGCG ACGACCTGCG CGCCGCCGCG CACGCTTTCG CGCAACTGGA CGTCGCCGCG TCGCTCGCAA AACTCGCCAC CGACGAGAAT TTCACCCGCC CGGAGGTCGA TGCTTCTCTC GGCTTCGCCA TCGAGGGCGG CCGGCATCCG GTGGTGGAGC AGGCGCTGAA GCGCGCCGGC CAGCCGTTCA TCGCCAATGC CTGCGACCTG TCGCCGGGCC CAGGCCAGAC CTCGGGGCAG ATCTGGCTGC TCACCGGCCC CAACATGGCC GGCAAATCGA CGTTTCTGCG CCAGAACGCG CTGATCGCGC TGATGGCCCA GATCGGCAGC TTCGTGCCGG CGACGCGCGC GCGGATCGGC ATGATCGACC GGCTGTTCTC GCGGGTCGGC GCCGCCGACG ATCTGGCGCG CGGCCGGTCG ACCTTCATGG TCGAGATGGT GGAAACCGCC GTGATCCTCA ATCAGGCATC GGAACGCGCG CTGGTTATCC TCGACGAAAT CGGACGCGGT ACCGCGACCT TCGATGGCCT GTCGATCGCG TGGGCGGCGA TCGAGCACCT GCATGAGGCC AACAAGTGTC GCGCACTATT CGCGACGCAC TATCATGAAC TCACCGCGCT GTCGGCAAAA CTGCCGCGTC TGTTCAACGC CACCGTGCGG GTCAAGGAAT GGCACGGCGA GGTGGTATTC CTGCACGAGG TGCTGCCCGG CGCCGCCGAC CGTTCCTACG GCATCCAGGT GGCGAAGCTC GCCGGACTGC CGCCGTCGGT CGTCGCGCGC GCAAAATCGG TGCTGGCCAA ACTGGAGGCG CAGGATCGCG GATCGACCGT GCGCGCGCTG GTGGACGATC TGCCGCTGTT CGCGGTGCCG TCGCGCGCCG CCGACGAATC CGCGCCGCCG GGCGAGGCCG CACCACTGAT CGAGGCGCTG AAAGCCTTGC ATCCCGACGA GATGTCGCCG CGCGAGGCGC TGGAGGCGCT TTACGCGCTG AAGGCGAAGC TGCCGAAGCC ATAG
|
Protein sequence | MTIQPAVPTS PTNAAPEAAR VTPMMEQYLE IKAANPGLLL FYRMGDFYEL FFEDAEIASC TLGITLTKRG KHQGADIPMC GVPVERSDDY LHRLIAAGHR VAVCEQMEDP AAARRRGNKS VVRRDVVRLI TPGTLTEDTL LDARANNYLL ALARARASSG GNRIALAWID ISTAEFIVTE CSTGELAATL ARINPNEVIV SDALYGDPDM AALLRELPSV TPLTRDVFDG ATAERRLCDY FAVATMDGLS AMSWLEATAA AAAVTYVDRT QIGKRPPLSP PSREAAGSTM AIDPATRANL ELTRTLAGER RGSLLDAIDR TMTAAGSRLL AQRLAAPLTD IAAIARRLDA VAAFTSDSAA RDDIRTILRT APDMSRALAR LSVGRGGPRD LAGLRDGIMA ADRTLARLSA LPDPPQDIVA AMQALRRPSR ELARELGEAL AENLPLMKRD GGFAREGYEP TLDEARKLRD DSRLVVAAMQ ARYTEETGVK TLKIRHNNVL GYFVEVTAQH GDKLTSAPLN ATFIHRQTLA GQVRFTTSEL GEIEARIANA GDRALGLELE IFDRLAAVVV EAGDDLRAAA HAFAQLDVAA SLAKLATDEN FTRPEVDASL GFAIEGGRHP VVEQALKRAG QPFIANACDL SPGPGQTSGQ IWLLTGPNMA GKSTFLRQNA LIALMAQIGS FVPATRARIG MIDRLFSRVG AADDLARGRS TFMVEMVETA VILNQASERA LVILDEIGRG TATFDGLSIA WAAIEHLHEA NKCRALFATH YHELTALSAK LPRLFNATVR VKEWHGEVVF LHEVLPGAAD RSYGIQVAKL AGLPPSVVAR AKSVLAKLEA QDRGSTVRAL VDDLPLFAVP SRAADESAPP GEAAPLIEAL KALHPDEMSP REALEALYAL KAKLPKP
|
| |