Gene Tmz1t_3902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3902 
Symbol 
ID7873550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4299378 
End bp4301372 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content56% 
IMG OID643700841 
Producthydrolase (HAD superfamily)-like protein 
Protein accessionYP_002890864 
Protein GI237654550 
COG category[R] General function prediction only 
COG ID[COG5610] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGCGC ACCCACGCGA CCTGTTTTAT GAATTGGGTC TGCGTGTCGC CCCTGCGTCC 
AGCGGAGAGT CGGCGCGCGA ACGCTTTGCG CGGCGTTTCC AGCGCGCTCG CATTCGGGCA
GAAAAACTTG CCAACTGGAG TGCGCGCCGC AGGGGGCACG CGCACGCATC GATCGACGCC
ATCTACCAGA AGGTTAACTG GTTTATACGT CTGGATCGAC CAGCCGCCGA ACTGATCGAG
GCTGAACTGA CGCTGGAGGA GGAAAGCCTT TACCCTATCG AAGAAACCAT GCTTCAGCTG
AATGCCTTAC GTGCAGCTGG GCATCGCATC CTGTTCATTT CCGACATGTA CATTCCTGCG
TCAATGCTGC GCCCCATGCT TGAGCGCATG GGCGTAATGG AGGAAGGCGA CCGGTTGTAT
GTGTCATGTG ATATTGGCGT CTCGAAGCAC AATGGAAAGC TGTTCCAACA TGTCCTGCAA
GCCGAGGGGC TGAGGGGAGA GCAACTGCAG CACACCGGCG ACAACGCTCA TGCAGACATT
CGAATGGCCG AAAAACTGGG CATCCTGACC CGGCATTTCA CCGCAGCCCA TTTGACTGAA
CACGAGACGC GGATCGCGGG TTCCCGGCTG CCCCGTCATC CTGGCGCGTC GCGGCAGGCC
GGTTTCAGCC GGCGCTGCAG GCTGGCCATG CATTTGATCC ACGGCAATCC AACTCATGTG
CTGGATGACG TGATCTTCAG TGTAATCGTA CCGTTTCTGC TGGCCTATGT CCTATGGATA
CTGGATGACG CGCGAAAGCG CGGCATACAA CGACTGTATT TTGTCGCCCG CGATGGTGAG
GTGTTGCTCA AGATCGCCCG CGAACTGAAA CCCGATGGTA TCGAGCTACG CTATCTGTAT
GGATCAAGGC GTGCCTGGCT GCCTCCGTCC ATTTCAACTG ACGATGTCGA TTGGAAGCGT
TTGCTTGCAG TAGCAGGGAA CGCAAATGCA CCTGTCGACA TCACGGCGCG CGCTGGATTG
AGTGAGGTCG AACAAGCCAG TGTCCGGGTC ATTCTGAGTC TGGACGAAAA TACCTGGCGC
ACGGCGCTGG CATTTGAAGA TGCATGCACA TTCATCGACA CACTGACAGC AAACCCGCGA
TCCAGAAAAG TTCTATTGGA TTCGGTAGCT GCCAAGCGAG AAGCCGCACT GCACTATTTA
CGCCAGGAAG GCCTTATGGA CGGGGTCAAC TGGGCCTTGG TGGATGCGGG CTGGTCGCTG
AACGGACAAG CCGCGCTGAA ACGCATGCTC TCAACCGTTT CGCCATCCAG TCACCAAATT
CAAGGCTATT ACATCGGCCT TGCTCGTGAT TGCTTGCCCG AAGCTCGCGC AGGCAGGGCC
TATGCCTTCT GCCCTCCCCC TGGCAGCATT TTTTCTCGCC GTCGCGTAGT GCTCGAGCAC
TGTTTTCTGC CAGCCAGTCA CGCAAGTACG CGCAGCTATT TTCTCAAGGG AGGCCACGCA
ACTCCTGATT TCAGCGCTGA CTCCCGGAAC AACGAAGAAC TCGAATACGC TCTTCGACTT
CATACCGTAG CTTTAGCGTC AGCAAGATTG CTGAAGCAAC AGCCGGCAAT CGGCGACTTG
ATGCGAAGGT TCCGTACTCA ACTGACGAAT TCGGCCGCCG GATTCATCTG CGCCCCTGGC
GTGCAAGATG CAATTGCCTA TTCTATGCTT ACTGCGGTAG CCGACATGCG ACAGGAACGC
GAGTTTTCCC GGAGGTTGTG CCGACCGCTG TCTTTGGCCG ATGTGTGGAC GACGATGGGC
ATGGCTTTTT CAAGGAGGAT GGCCTTTAAA TCGCCTGCCT GGATGTGGCT GGAGGGATCG
ATCGCCCTTT CACCTTCATA CGTCGCCTTT CCCCTGAGGT TCATGCTTCG GATCGATGAC
TTTCTGAACA GAATCAAGTC ACTGCGTTTC CCACGGTGCA AATCTTCATC GCGGATCGAT
CAAGGCAAGA CCTGA
 
Protein sequence
MYAHPRDLFY ELGLRVAPAS SGESARERFA RRFQRARIRA EKLANWSARR RGHAHASIDA 
IYQKVNWFIR LDRPAAELIE AELTLEEESL YPIEETMLQL NALRAAGHRI LFISDMYIPA
SMLRPMLERM GVMEEGDRLY VSCDIGVSKH NGKLFQHVLQ AEGLRGEQLQ HTGDNAHADI
RMAEKLGILT RHFTAAHLTE HETRIAGSRL PRHPGASRQA GFSRRCRLAM HLIHGNPTHV
LDDVIFSVIV PFLLAYVLWI LDDARKRGIQ RLYFVARDGE VLLKIARELK PDGIELRYLY
GSRRAWLPPS ISTDDVDWKR LLAVAGNANA PVDITARAGL SEVEQASVRV ILSLDENTWR
TALAFEDACT FIDTLTANPR SRKVLLDSVA AKREAALHYL RQEGLMDGVN WALVDAGWSL
NGQAALKRML STVSPSSHQI QGYYIGLARD CLPEARAGRA YAFCPPPGSI FSRRRVVLEH
CFLPASHAST RSYFLKGGHA TPDFSADSRN NEELEYALRL HTVALASARL LKQQPAIGDL
MRRFRTQLTN SAAGFICAPG VQDAIAYSML TAVADMRQER EFSRRLCRPL SLADVWTTMG
MAFSRRMAFK SPAWMWLEGS IALSPSYVAF PLRFMLRIDD FLNRIKSLRF PRCKSSSRID
QGKT