Gene Tmz1t_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1047 
Symbol 
ID7084031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1147900 
End bp1149873 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content72% 
IMG OID643698065 
ProductATP-dependent DNA helicase RecQ 
Protein accessionYP_002354705 
Protein GI217969471 
COG category[L] Replication, recombination and repair 
COG ID[COG0514] Superfamily II DNA helicase 
TIGRFAM ID[TIGR00614] ATP-dependent DNA helicase, RecQ family
[TIGR01389] ATP-dependent DNA helicase RecQ 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.763377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGCCC GGGCGTCGCA GACGCCCGGC TGCCACCGGT CCGTGTCATC GGACGAGACC 
CGGCGCGGTA AAATGCGCGC CATGCCCACC GACTTCCCCC GCCTGCCCGG CGAACCACGC
ACGCCCCCCC CGACCACGTT CGACCAGCGC GACGTTCCCC TCGGCGACGC CGCGCATCGC
GTGCTCGAGC ACGTCTTCGG CTACCCCGCC TTCCGCGGCG AGCAGGGGGA GATCGTCGAG
CACGTGGCTG GCGGCGGCGA CGCGCTGGTG CTGATGCCGA CCGGTGGCGG CAAGTCGCTG
TGCTACCAGA TCCCGGCGCT GCTGCGCCAC GGCACCGCGA TCGTGGTGTC GCCGTTGATC
GCGCTGATGC AGGACCAGGT GAGCGCGCTG GTCGAGGCCG GCGTGCGCGC CGCCTTCCTC
AACTCCAGCC TGGACATGGA GCGCGCACGC GCGGTGGAGC GCGCGCTCTG GGACGGCGAG
CTCGAGCTGC TCTACGTCGC CCCCGAGCGC CTGATGACAC CGCGCTTCCT CGACCAGCTC
GACCACCTGC GCGACACCGG CCGGCTCTCG CTGTTCGCGA TCGACGAGGC GCACTGCGTG
TCGCAGTGGG GCCACGACTT CCGCCCCGAG TACCTGCAGC TCTCCATCCT GCCCGAGCGC
TACCCGGCCA TCCCGCGCAT CGCGCTCACC GCCACCGCCG ACCGCCAGAC CCGCGAGGAG
ATCGCCGAGC GCCTCAACCT GCAGGCGGCG CGCCGCTTCG TCTCCAGCTT CGACCGCCCC
AACATCCGCT ACACCATCGT CGAGAAGAAC GACCCGCGCC GCCAGCTGCT CGACTTCATC
CGCGAGGAAT GTCCCGGCCA GGCCGGCATC GTGTATTGCC TGTCGCGGCG CAAGGTCGAG
GAGACCGCCG CCTGGCTGCA GGAGCAGGGC CTCGCCGCCC TGCCCTACCA CGCCGGCATG
ACGCAGGAGA TCCGCGCCGA GCACCAGAGC CGCTTCCTGC GCGAGGACGG GCTGATCATG
GTGGCGACGA TCGCCTTCGG CATGGGCATC GACAAGCCCG ACGTGCGCTT CGTCGCCCAT
CTGGACCTGC CGCGCTCGAT CGAGGGCTAT TACCAGGAGA CCGGCCGCGC GGGGCGCGAC
GGCCTGCCGG CGCAGGCCTG GATGGCCTGG GGCGCGCAGG ACGTGGTGCA GCAGCGCCGC
ATGATCGACG AGTCGGAGGC GAACGAGGAG TTCAAGCGCC TGGCGCGCAA CCGGCTCGAC
GTGCTGGTCG GCCTGGTCGA GGCCACCGAC TGCCGCCGCC AGCACCTCCT TGCCTACTTC
GGTGAACAAT CGACCCCCTG CGGCAACTGC GACAACTGCC TGCACCCGCC GCAGACGTGG
GATGCCACCG AGGCGGCGCG CAAGGCCTTG AGCTGCGTAT TCCGCACCGG CCAGCGCTAC
GGCGCCGGCC ACCTGATCGA CGTGCTGCGC GGCGAGCTCA CCGAAAAGGT GGTCGAGCGC
CGCCACCAGG ACATCACCAC CTTCGGCATC GGCAGCGAGC TCGACGAGAA GCGCTGGCGC
ACGGTGTTCC GCCAGCTCGT CGCGCGCGAG TTGGTCGCGG TGGACCACGA GCGCTACAAC
GCGCTGCGCC TCACCGACGC GGCCCGCCCG CTGCTGCGCG GCGAGGCCGA GTTCCACCTG
CGCCTGGAGC CCGAGCGCAG CCGCAGCCGC GCCCGGCGGC GCAGCGGCGC AAGCCTGGAT
ATCCCCGACG GCATCCCCAC CACGCTCTTC GACCGCCTGC GCGCCTGGCG CTTCGCCACC
GCCAAGGAGC GCAACGTGCC GGCCTACGTG GTCTTCCAGG ACGCGACGCT GCGCGAGATC
GCCATCGCCC GTCCGCACAC GCTGGCGGAG CTGGCCGGCA TCAGCGGCGT GGGCGATCGC
AAGCTGGAGC ACTACGGGGC GGCGATCCTG CAGCTGGTCG CCGAAGCGGG CTGA
 
Protein sequence
MRARASQTPG CHRSVSSDET RRGKMRAMPT DFPRLPGEPR TPPPTTFDQR DVPLGDAAHR 
VLEHVFGYPA FRGEQGEIVE HVAGGGDALV LMPTGGGKSL CYQIPALLRH GTAIVVSPLI
ALMQDQVSAL VEAGVRAAFL NSSLDMERAR AVERALWDGE LELLYVAPER LMTPRFLDQL
DHLRDTGRLS LFAIDEAHCV SQWGHDFRPE YLQLSILPER YPAIPRIALT ATADRQTREE
IAERLNLQAA RRFVSSFDRP NIRYTIVEKN DPRRQLLDFI REECPGQAGI VYCLSRRKVE
ETAAWLQEQG LAALPYHAGM TQEIRAEHQS RFLREDGLIM VATIAFGMGI DKPDVRFVAH
LDLPRSIEGY YQETGRAGRD GLPAQAWMAW GAQDVVQQRR MIDESEANEE FKRLARNRLD
VLVGLVEATD CRRQHLLAYF GEQSTPCGNC DNCLHPPQTW DATEAARKAL SCVFRTGQRY
GAGHLIDVLR GELTEKVVER RHQDITTFGI GSELDEKRWR TVFRQLVARE LVAVDHERYN
ALRLTDAARP LLRGEAEFHL RLEPERSRSR ARRRSGASLD IPDGIPTTLF DRLRAWRFAT
AKERNVPAYV VFQDATLREI AIARPHTLAE LAGISGVGDR KLEHYGAAIL QLVAEAG