Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1047 |
Symbol | |
ID | 7084031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1147900 |
End bp | 1149873 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698065 |
Product | ATP-dependent DNA helicase RecQ |
Protein accession | YP_002354705 |
Protein GI | 217969471 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0514] Superfamily II DNA helicase |
TIGRFAM ID | [TIGR00614] ATP-dependent DNA helicase, RecQ family [TIGR01389] ATP-dependent DNA helicase RecQ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.763377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCCC GGGCGTCGCA GACGCCCGGC TGCCACCGGT CCGTGTCATC GGACGAGACC CGGCGCGGTA AAATGCGCGC CATGCCCACC GACTTCCCCC GCCTGCCCGG CGAACCACGC ACGCCCCCCC CGACCACGTT CGACCAGCGC GACGTTCCCC TCGGCGACGC CGCGCATCGC GTGCTCGAGC ACGTCTTCGG CTACCCCGCC TTCCGCGGCG AGCAGGGGGA GATCGTCGAG CACGTGGCTG GCGGCGGCGA CGCGCTGGTG CTGATGCCGA CCGGTGGCGG CAAGTCGCTG TGCTACCAGA TCCCGGCGCT GCTGCGCCAC GGCACCGCGA TCGTGGTGTC GCCGTTGATC GCGCTGATGC AGGACCAGGT GAGCGCGCTG GTCGAGGCCG GCGTGCGCGC CGCCTTCCTC AACTCCAGCC TGGACATGGA GCGCGCACGC GCGGTGGAGC GCGCGCTCTG GGACGGCGAG CTCGAGCTGC TCTACGTCGC CCCCGAGCGC CTGATGACAC CGCGCTTCCT CGACCAGCTC GACCACCTGC GCGACACCGG CCGGCTCTCG CTGTTCGCGA TCGACGAGGC GCACTGCGTG TCGCAGTGGG GCCACGACTT CCGCCCCGAG TACCTGCAGC TCTCCATCCT GCCCGAGCGC TACCCGGCCA TCCCGCGCAT CGCGCTCACC GCCACCGCCG ACCGCCAGAC CCGCGAGGAG ATCGCCGAGC GCCTCAACCT GCAGGCGGCG CGCCGCTTCG TCTCCAGCTT CGACCGCCCC AACATCCGCT ACACCATCGT CGAGAAGAAC GACCCGCGCC GCCAGCTGCT CGACTTCATC CGCGAGGAAT GTCCCGGCCA GGCCGGCATC GTGTATTGCC TGTCGCGGCG CAAGGTCGAG GAGACCGCCG CCTGGCTGCA GGAGCAGGGC CTCGCCGCCC TGCCCTACCA CGCCGGCATG ACGCAGGAGA TCCGCGCCGA GCACCAGAGC CGCTTCCTGC GCGAGGACGG GCTGATCATG GTGGCGACGA TCGCCTTCGG CATGGGCATC GACAAGCCCG ACGTGCGCTT CGTCGCCCAT CTGGACCTGC CGCGCTCGAT CGAGGGCTAT TACCAGGAGA CCGGCCGCGC GGGGCGCGAC GGCCTGCCGG CGCAGGCCTG GATGGCCTGG GGCGCGCAGG ACGTGGTGCA GCAGCGCCGC ATGATCGACG AGTCGGAGGC GAACGAGGAG TTCAAGCGCC TGGCGCGCAA CCGGCTCGAC GTGCTGGTCG GCCTGGTCGA GGCCACCGAC TGCCGCCGCC AGCACCTCCT TGCCTACTTC GGTGAACAAT CGACCCCCTG CGGCAACTGC GACAACTGCC TGCACCCGCC GCAGACGTGG GATGCCACCG AGGCGGCGCG CAAGGCCTTG AGCTGCGTAT TCCGCACCGG CCAGCGCTAC GGCGCCGGCC ACCTGATCGA CGTGCTGCGC GGCGAGCTCA CCGAAAAGGT GGTCGAGCGC CGCCACCAGG ACATCACCAC CTTCGGCATC GGCAGCGAGC TCGACGAGAA GCGCTGGCGC ACGGTGTTCC GCCAGCTCGT CGCGCGCGAG TTGGTCGCGG TGGACCACGA GCGCTACAAC GCGCTGCGCC TCACCGACGC GGCCCGCCCG CTGCTGCGCG GCGAGGCCGA GTTCCACCTG CGCCTGGAGC CCGAGCGCAG CCGCAGCCGC GCCCGGCGGC GCAGCGGCGC AAGCCTGGAT ATCCCCGACG GCATCCCCAC CACGCTCTTC GACCGCCTGC GCGCCTGGCG CTTCGCCACC GCCAAGGAGC GCAACGTGCC GGCCTACGTG GTCTTCCAGG ACGCGACGCT GCGCGAGATC GCCATCGCCC GTCCGCACAC GCTGGCGGAG CTGGCCGGCA TCAGCGGCGT GGGCGATCGC AAGCTGGAGC ACTACGGGGC GGCGATCCTG CAGCTGGTCG CCGAAGCGGG CTGA
|
Protein sequence | MRARASQTPG CHRSVSSDET RRGKMRAMPT DFPRLPGEPR TPPPTTFDQR DVPLGDAAHR VLEHVFGYPA FRGEQGEIVE HVAGGGDALV LMPTGGGKSL CYQIPALLRH GTAIVVSPLI ALMQDQVSAL VEAGVRAAFL NSSLDMERAR AVERALWDGE LELLYVAPER LMTPRFLDQL DHLRDTGRLS LFAIDEAHCV SQWGHDFRPE YLQLSILPER YPAIPRIALT ATADRQTREE IAERLNLQAA RRFVSSFDRP NIRYTIVEKN DPRRQLLDFI REECPGQAGI VYCLSRRKVE ETAAWLQEQG LAALPYHAGM TQEIRAEHQS RFLREDGLIM VATIAFGMGI DKPDVRFVAH LDLPRSIEGY YQETGRAGRD GLPAQAWMAW GAQDVVQQRR MIDESEANEE FKRLARNRLD VLVGLVEATD CRRQHLLAYF GEQSTPCGNC DNCLHPPQTW DATEAARKAL SCVFRTGQRY GAGHLIDVLR GELTEKVVER RHQDITTFGI GSELDEKRWR TVFRQLVARE LVAVDHERYN ALRLTDAARP LLRGEAEFHL RLEPERSRSR ARRRSGASLD IPDGIPTTLF DRLRAWRFAT AKERNVPAYV VFQDATLREI AIARPHTLAE LAGISGVGDR KLEHYGAAIL QLVAEAG
|
| |