Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_1897 |
Symbol | |
ID | 8419740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | - |
Start bp | 2178954 |
End bp | 2181257 |
Gene Length | 2304 bp |
Protein Length | 767 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645038483 |
Product | MutS2 family protein |
Protein accession | YP_003198759 |
Protein GI | 258406017 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1193] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01069] MutS2 family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.557227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0324553 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATCCC GTACGCTACG TTTATTGGAA TTTCCCCAAC TTTTGGAACA CCTTGCTTCG CGGGCCCAGT CCGAAGCCGG GCAAGCGGCT TGCCGCGATC TGGCCCCGAT GGCGGATCGC AATGCCTTGC ACCACCGGCA CGCTCTGGTC GCCGAGGGGC TGGAATGGGC CGGGCAATGG CTGAATGCCG TCCTGCCGTT TCCCGGACTG CAAGGTGTTT TCGAGTATGT ACAGCAGGAA GACCGCTTTC TCGATGCCGA TGGATTGTGG GGCGTGGCCC AGGTTCTTCA GCGTGCTGTC GCCGTCACCG AAGCCATCGA GACCATCAGT GACGAGAGGG CTGTGGCGCT GCACGCCTGG CGCGCCCGGT GCCCCTGGCC GGAGAAGCTT ACTTCCGCGC TCAAGCGGTG TCTGGGACCC GACGGACGGC TGACCGACGA GAGTTCCCCC GGTCTGTTGG CCGTGCGGGT CGAATTGCGG CGCATCCAGC AGCAGTGCAC AAAAAAAGTC CAGGATTCCC TGCACGATGA GGCCCTGTCG GCCTATCTCC AGGACGAGTA TTTTACCGTC TCCTCGGACC GCTATGTGCT GGCCCTGAAA GCCAATTTCA AGGGCCGTAT CCCTGGTATT GTCCACGACT ACTCCCAAAG CGGGGACACC TGCTATCTGG AACCCTTTTT CCTGGTCGAT CTCAATAACG CCCTCCAGGA ACTCAAGCAG CAGGAACGTG AGGAAGAACG CGAAGTCCTG CGGTATCTGA CCGGGCTATT GCATCAGGAA CGCGACGAGG TGGAGGCGCT GTACGACTGG CTGGTGCAGA CCGATGTCCT GGCCGCCAAG GTCCGCTTGG CCCAGGCGAT GGACGGGGCC CCACTGGTAC CCGACCCCGG CGCTGCCCTG CGTTTGAGCG AGGCGCGTCA CCCTTTGCTC GCCCTGGGCG CTGAAACGGT GCAGCCGGTG GACATCGCCC TGCATGAAGG CCAGCGCGCC CTGGTCATCA GCGGGGGGAA CGCGGGCGGC AAGACCGTCT GCTTGAAGAC CCTGGGCCTG GTGGCACTCA TGGCCCACGC CGGCCTGCCG GTCCCTGTGG CGGCCGACAG CACCCTGCCG TTGTGGGAGA CCATATTTGC CGTTCTCGGT GACGAGCAGA GTCTGGAAGA CCATTTGAGC ACCTTCACCG CACAGATCGG CCATTTTCAA CGCGCCTGGC CCCAGATGGG TCCCTCGAGC CTGGTCATTC TCGACGAATT CGGCGCCGGG ACCGACCCGA GTCAGGGCGC CGCCCTGGCC CAGGCAGTCC TCGACGGATT GCTGTCCGCC GGGGCCTGGA TCGGTGCCGC GACCCACTTC CCGGCCTTGA AGGCCTATGC TATGGGCACA GAGGGAGTCC GGGCCGCGTC GGTTCTCTTT GATCCCGATT CCCGTCGTCC GCTGTACCGT CTGGCCTACG ACCAGGTCGG AGCGAGTCAG GCCCTGGATG TGGCCGAAGA CCAGGGGCTG CCGCAACAGA TTTTGGATCG GGCCAAGGAA TACCTGCTTC TGGAAAGCGA CGAGACCGAC CGCCTTTTGG AGCGGCTCAA CCGCTTGGCG GCCCGGCGCC AGGAGACCCT GGACGAACTC GAGGGTCAAA AGGCGCAGCT CGAACGCGAG CGTCAACGGC TACAGGACCG GTTTGAACGG GAAAGCCGTG CCCTGCTCAA AGATCTCAAG GCCGCCTCCC AGGAGATTGT CCGGGAGTGG AAGGCCGGCA AGAAAAGCCG GAAAAAGGCC CAGGAAGAAT TGGCTCGGAT GCGGCACCAG ATCGAAGCCC AGGCCGAAGC CGAGGCGCAG TCCCAGCCGC TGACTCTGGA GGAGATCGAG CCGGGGCATC GGTTGATCTA TGTGTCCTGG GGGAAGAAGG GCGCCGTCGA GGAGGTCGAC AGCCGCAAAA AGCGGATCAA GCTCGATCTC GGCGGCGTGT CGTTATGGGC TTCACTTGCT GAGGTCCAGC GCGTTGCAGG CGAGGGCACG AAGTCGCCGT CTTCCTCGGT CCAATTCGTT CAGACCGAAG AGTCCGGCCT GCCTTTGCGC CTCGATGTGC GGGGCATGCG GGCTGAAGAG GCCCGCAACG AAGTCGAGCG GTTTCTGGAT CGCGCGCGCC TCGAAGGCCG GCGCCATCTG GAAATCGTCC ACGGCAAGGG CGAGGGGGTG CTGCGCCGCG AGGTGCAGGA CATGCTCAGA CGGGCGCCCG GTGTTGTCCA TTTCGAACTC GGCCGCCCCG AGCAGGGCGG CGACGGAGTG ACGCAGGTGG AATTGGGTGA TTGA
|
Protein sequence | MESRTLRLLE FPQLLEHLAS RAQSEAGQAA CRDLAPMADR NALHHRHALV AEGLEWAGQW LNAVLPFPGL QGVFEYVQQE DRFLDADGLW GVAQVLQRAV AVTEAIETIS DERAVALHAW RARCPWPEKL TSALKRCLGP DGRLTDESSP GLLAVRVELR RIQQQCTKKV QDSLHDEALS AYLQDEYFTV SSDRYVLALK ANFKGRIPGI VHDYSQSGDT CYLEPFFLVD LNNALQELKQ QEREEEREVL RYLTGLLHQE RDEVEALYDW LVQTDVLAAK VRLAQAMDGA PLVPDPGAAL RLSEARHPLL ALGAETVQPV DIALHEGQRA LVISGGNAGG KTVCLKTLGL VALMAHAGLP VPVAADSTLP LWETIFAVLG DEQSLEDHLS TFTAQIGHFQ RAWPQMGPSS LVILDEFGAG TDPSQGAALA QAVLDGLLSA GAWIGAATHF PALKAYAMGT EGVRAASVLF DPDSRRPLYR LAYDQVGASQ ALDVAEDQGL PQQILDRAKE YLLLESDETD RLLERLNRLA ARRQETLDEL EGQKAQLERE RQRLQDRFER ESRALLKDLK AASQEIVREW KAGKKSRKKA QEELARMRHQ IEAQAEAEAQ SQPLTLEEIE PGHRLIYVSW GKKGAVEEVD SRKKRIKLDL GGVSLWASLA EVQRVAGEGT KSPSSSVQFV QTEESGLPLR LDVRGMRAEE ARNEVERFLD RARLEGRRHL EIVHGKGEGV LRREVQDMLR RAPGVVHFEL GRPEQGGDGV TQVELGD
|
| |