Gene Tmz1t_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0972 
Symbol 
ID7085075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1073023 
End bp1075293 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content65% 
IMG OID643697994 
ProductDEAD-like helicase 
Protein accessionYP_002354634 
Protein GI217969400 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCG ATCTCGAAAC CGTTCCCGAA ACCGCTGTGC AGGGCGACCT GCTCGAAACG 
GCCGCTTCAC CCCTCACGCT CAGCCTGCAG GACTTCGTAT CGGAGTTCGG CGACGAGCTT
CTCGACTCGC TCAACCGCGC CAATCCTCCG GTCTACACCG GCCAGGTGCG GGTGCATCGG
CAACTGATCC TCGCCGCGCT CAAGCGCAAG CTGTTCCCGG CGCAAGCCGA TGTGGTCCAC
GCCGTCACCG AGCTGTTGGT CGACCGTGGC GAACGCGCCG CGATCGTCAA TGGCGAGATG
GGCTGCGGCA AGACGACGGT GGGTATTGCC ACCGCCGCCG TGCTCAACGC CGAAGGCTAC
CGCCGCACCC TGGTGCTCTC GCCACCCCAC CTGGTCTACA AGTGGCGGCG CGAAATCCAG
GAGACGGTGG CCGGCGCCAA GGTCTGGGTG CTCAATGGCC CGGACACGCT GGTCAAGCTG
CTGAAACTGC GCGAGCAGTT GGGCGTGCCG GCCCAGGGCC AGGAGTTCTT CGTCCTGGGC
CGTGTGCGGA TGCGGATGGG GTTCCACTGG AAGCCTATCT TCGTTCGGCG GCGCACGCCT
CACGGCGACG TGGGGGCCTG CCCGGGTTGC GGGCATGTCA TCACCGACCT GGACGGCGAG
CCTGTCAACA CCATCGATCT AGAAGCGGAA GAGTCCCGCC GCAAATGCAG CCACTGCCGC
GCACCGCTGT GGTCGTTGAT CCGTCCCAGA GGCCTGTCCG CCAGCGACCA GTCCTCGAGC
GTGCTCAAGG CACTGAAGCG TATTCCAACC ATCGGGGAAG TCACCGCGCA GAAGCTGATG
CAGAAGTTCG GTGACGCCTT CCTCGCGTCG ATGCTCGGGG ACAACATCCA CGAGTTCATC
AACCTGATGG ATGGCAACGG CGAGCTGGTC TTCTCGGACC GGCAAGCCCA TCGGATGGAA
CGCGCGATGG CCAACATGGA GTTCGGCTTC GGCGAGGGCG GCTACCAGCC GTCCGAGTTC
ATCAAGAGGC AGCTTCCCCA AGGTACGTTC GACCTGCTCA TCGCCGACGA GGCGCACGAG
TACAAGAACG GCGGCTCCGC CCAGGGCCAG GCCATGGGGG TGTTGGCGGC CAAGGCGCGC
AAGACGCTGC TGCTCACCGG CACACTGATG GGCGGCTACG GCGACGACCT GTTCCACCTG
CTGTTCCGCG CCCTGCCGGG GCGGATGATC GAAGACGGCT ACCGGCCGAC GAAGAGCGGC
AGCATGACCT CGGCCGCGAT GGCGTTCATG CGCGATCACG GTGTCTTGAA GGACATCTAC
TCCGAGAGCA CCGGCACGGC GCACAAGACG GCCAAAGGCA CCAAGGTATC GGTGCGCACG
GTCAAGGCGC CGGGTTTCGG TCCGAAGGGC GTGCTGCGTT GCGTTCTGCC GTTCACGGTC
TTCCTCAAGT TGAAGGACAT CGGCGGCAAT GTGCTGCCGC CCTACGACGA GGAGTTCCGC
GAAGTCGCGA TGGACACGGC GCAGGCCGCG GCCTACCGCG ATCTGGCGGG TCGCCTGACC
CAGGAGCTGA AGCAGGCCCT GGCGAAGCGC GACACGACGC TGCTCGGTGT AGTCCTCAAC
GTGCTGCTGG CCTGGCCGGA CTGCTGCTTC CGGTCGGAAA CGGTGGTGCA TCCGCGCACG
CGCAACACCT TGGCGTTCGT TCCGGCTCAG TTCAACGGGC TGGAGGTGAT GCCCAAGGAA
CGCGAGCTGA TCGAGATCTG CAAGCAGGAG AAGGCAGAAG GTCGCAAGAC CCTGGTCTAT
TCGGTCTACA CCGGCACCCG CGACACCACG TCGCGTTTGA AGGTGCTGCT GGAGCAGGAA
GGCTTCAAGG TGGCGGTGCT GCGCGCGAGC GTGGATGCCT CCCGCCGCGA GGACTGGATC
GCCGAGCAGT TGGACCGTGG CATCGACGTG CTCATCACCA ATCCCGAGCT GGTGAAAACC
GGCCTGGACT TGCTGGAGTT CCCGACCATC GTGTTCCTCC AGTCCGGCTA CAACGTGTAT
TCGTTGCAGC AGGCCGCCCG GCGCTCATGG CGCATCGGCC AGAAGCAGCC GGTGCGCGTG
ATCTACCTCG GCTACGCCAA CTCCTCGCAG ATGACCTGCC TGGGGTTGAT GGCCCGGAAG
ATCATGGTGT CGCAATCCAC CTCGGGCGAC GTTCCTGAAT CCGGGCTCGA TGTCCTGAAC
CAGGATGGCG ACTCGGTGGA GGTGGCACTG GCTCGGCAGT TGGTGCATTG A
 
Protein sequence
MSLDLETVPE TAVQGDLLET AASPLTLSLQ DFVSEFGDEL LDSLNRANPP VYTGQVRVHR 
QLILAALKRK LFPAQADVVH AVTELLVDRG ERAAIVNGEM GCGKTTVGIA TAAVLNAEGY
RRTLVLSPPH LVYKWRREIQ ETVAGAKVWV LNGPDTLVKL LKLREQLGVP AQGQEFFVLG
RVRMRMGFHW KPIFVRRRTP HGDVGACPGC GHVITDLDGE PVNTIDLEAE ESRRKCSHCR
APLWSLIRPR GLSASDQSSS VLKALKRIPT IGEVTAQKLM QKFGDAFLAS MLGDNIHEFI
NLMDGNGELV FSDRQAHRME RAMANMEFGF GEGGYQPSEF IKRQLPQGTF DLLIADEAHE
YKNGGSAQGQ AMGVLAAKAR KTLLLTGTLM GGYGDDLFHL LFRALPGRMI EDGYRPTKSG
SMTSAAMAFM RDHGVLKDIY SESTGTAHKT AKGTKVSVRT VKAPGFGPKG VLRCVLPFTV
FLKLKDIGGN VLPPYDEEFR EVAMDTAQAA AYRDLAGRLT QELKQALAKR DTTLLGVVLN
VLLAWPDCCF RSETVVHPRT RNTLAFVPAQ FNGLEVMPKE RELIEICKQE KAEGRKTLVY
SVYTGTRDTT SRLKVLLEQE GFKVAVLRAS VDASRREDWI AEQLDRGIDV LITNPELVKT
GLDLLEFPTI VFLQSGYNVY SLQQAARRSW RIGQKQPVRV IYLGYANSSQ MTCLGLMARK
IMVSQSTSGD VPESGLDVLN QDGDSVEVAL ARQLVH