Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1761 |
Symbol | |
ID | 7085728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1981910 |
End bp | 1983016 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643698780 |
Product | Smr protein/MutS2 |
Protein accession | YP_002355409 |
Protein GI | 217970175 |
COG category | [S] Function unknown |
COG ID | [COG2840] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCC GCGCTCCCGC GCCGGGTCCG GCCGGCACCC CAGGACGCGA CGCGCAGGGC GCGGTGCCGA AGCCCGCCTC GCCCTTCGCA GCCCTGCGCA AGCAGCTGCA GCAGCGCGCG CTCCCGGCAC CCGTGTCGTC GCCCGCCCCG GCGAAGCGTA TCCGCCACCC GCTTGAGGAA GACTCCTCCA GCGCCGGTGC CGATCGTGCA CAGGAGCCCG ACGCGGAGGC GCTCGAGCTC TTCCGGCGCA GCGTCGGCGC GGTGCGCCCG GTGCGCGGCA CGGACCGCGT CGAGATCCAC CGCCCCCGCC CTGCCCCGCG GCCGCGCACG CAAGCCGTGG AGGACGAGGA AACCGAGGAG CCCGTCCGTG CGCGACCCGA GACCGACCCG CTACGCGCGG CCTACGAGGG CGTGATGCCG CTGAGGGATA CCGGCCGAGT GGCGCTCGAC ACACCGCTGC GTCACCACGC CCGCCATGCA GGCGGTACGC ATCCCGCGCC GGCGCTGCGA CCCGACGCGA TCGTGCTGCC CGCGGACGCC GACGTCAGCG ATCCGGCAGC GCTCTTTCTT GCGGTAGTGG GCAATGCCCG CCCGGTCACC GACCGCAACC GCGTGGAGCT GGAGCGCCCG CAGCCGGCAC CTGCGCCGCT CAAGCGCGAG GAGGACGAGC GCGCGGCGCT CGGCGAATCG CTCGCCGCAC CGCTCACCTT CGAGGATCGC CTGGACATGG GCGACGAGGC GGCCTTCCTG CGGACCGGGC TGCCGCGCCG GGTGCTGACC GATCTGCGCC GCGGGCGCTG GGTGCTGCAG GGCCAGATCG ACCTCCACGG CCTCACCCGC GACGAGGCGC GCGCCGCGCT GGCGAACTTC CTGCACGACG CGCTTGCCCA GGGCAAGCGC TGCATCCGGG TGATCCACGG CAAGGGCCAC GGCTCGCCCG GGAAGGTGTC GATCCTGAAA CAGCTGTCGC GCGGCTGGCT GGCGCAGCGC GAGGAGATCC TCGCCTTTTG CCAGGCCGGC CCCCACGATG GCGGCGGCGG CGCCCTGCTG GTGCTGCTGC GCGCGCAGAA CGCCGCGCCG CGCGCCCGAA TGCCGTTACC CGCCTGA
|
Protein sequence | MSRRAPAPGP AGTPGRDAQG AVPKPASPFA ALRKQLQQRA LPAPVSSPAP AKRIRHPLEE DSSSAGADRA QEPDAEALEL FRRSVGAVRP VRGTDRVEIH RPRPAPRPRT QAVEDEETEE PVRARPETDP LRAAYEGVMP LRDTGRVALD TPLRHHARHA GGTHPAPALR PDAIVLPADA DVSDPAALFL AVVGNARPVT DRNRVELERP QPAPAPLKRE EDERAALGES LAAPLTFEDR LDMGDEAAFL RTGLPRRVLT DLRRGRWVLQ GQIDLHGLTR DEARAALANF LHDALAQGKR CIRVIHGKGH GSPGKVSILK QLSRGWLAQR EEILAFCQAG PHDGGGGALL VLLRAQNAAP RARMPLPA
|
| |