Gene Tmz1t_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3044 
Symbol 
ID7874514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3297919 
End bp3299043 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content68% 
IMG OID643699967 
Productchaperone protein DnaJ 
Protein accessionYP_002890019 
Protein GI237653705 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.952099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAAAC GCGATTACTA CGAAGTCCTG GGCGTCAACC GCGACGCCGG CGACGACGAG 
ATCAAGAAGG CCTACCGCAA GCTGGCCATG AAGTTTCATC CGGACCGCAA TCCGGACAAC
AAGGAAGCCG AGGAGAAGTT CAAGGAGGCC AAGGAGGCCT ACGAGATGCT CTCCGACCCG
CAGAAGAAGG CTGCCTACGA CCGCTACGGC CACGCCGGCG TCGATCCGTC GATGGGCGCG
GGCCCCGGCG CGCAGGGCTT CGACGGCTTC GCCGACGCCT TCGGCGACAT CTTCGGCGAC
CTCTTCGGGG GCGGCGGACG CGGCGGGCGC TCAAACGTCT ATCGCGGCGC CGACCTGCGC
TACAACCTCG AGATCACCCT GGAAGAGGCC GCGCGCGGCG CCGAGAAGAC GATCCGCATC
CCCACCGTCG AGGAGTGCGG CACCTGCCAC GGCAGCGGCG CCAAGCCCGG CACCCATCCC
AAACCCTGCC CGACCTGCCA GGGCCACGGC CAGGTGCGCG TGCAGCAAGG CTTCTTCTCG
ATCCAGCAGA CCTGCCCGAA GTGCCACGGC AGCGGCAAGA TCATCCCCGA CCCGTGCCGC
GACTGCGGCG GCGCCGGCCG CACCAAGAAG CAGAAGACGC TCGAGGTGAA GATCCCCGCC
GGCATCGACG ACGGCATGCG CCTGCGCCAC GCCGGCCACG GCGAGCCCGG CCTCAACGGC
GGCCCGCCGG GCGACCTCTA CGTCGAGATC CACATCCGCA AGCACGCGGT GTTCGAGCGC
GACCACGACG ACCTGCACTG CGAGATGCCG ATCAGCATCA CCACCGCGGC GCTCGGCGGC
GAGATCGAGA TCCCGACGCT GGAAGGCATG GCGCGGCTGA AGATCCCCGC GGAGACGCAG
AGCGGCAAGG TCTTCCGGCT GCGCGGCAAG GGCATCAAGA ACGTGCGCAG CCACGTGCAC
GGCGACCTGA TGTGCCACGT GGTGGTCGAG ACCCCGGTGA ACCTGACCGA GCGTCAGAAG
GAGTTGCTGC GCGAGTTCGA GGAGAGCGCC AGCGGCAACG CCACCCGCCA CAACCCCAAG
GCGCAGGGGT GGATGGACAA GGTGCGGGAC TTCTTCGGCG GCTGA
 
Protein sequence
MSKRDYYEVL GVNRDAGDDE IKKAYRKLAM KFHPDRNPDN KEAEEKFKEA KEAYEMLSDP 
QKKAAYDRYG HAGVDPSMGA GPGAQGFDGF ADAFGDIFGD LFGGGGRGGR SNVYRGADLR
YNLEITLEEA ARGAEKTIRI PTVEECGTCH GSGAKPGTHP KPCPTCQGHG QVRVQQGFFS
IQQTCPKCHG SGKIIPDPCR DCGGAGRTKK QKTLEVKIPA GIDDGMRLRH AGHGEPGLNG
GPPGDLYVEI HIRKHAVFER DHDDLHCEMP ISITTAALGG EIEIPTLEGM ARLKIPAETQ
SGKVFRLRGK GIKNVRSHVH GDLMCHVVVE TPVNLTERQK ELLREFEESA SGNATRHNPK
AQGWMDKVRD FFGG