Gene Tmz1t_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3592 
Symbol 
ID7873097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3938806 
End bp3941010 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content64% 
IMG OID643700532 
Productprotein of unknown function DUF181 
Protein accessionYP_002890562 
Protein GI237654248 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03549] conserved hypothetical protein TIGR03549 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0680194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCA AGGTCAACTT CCTCGACAAG CTTCGCCTCG AAGCCAGGTT CGACGACTTC 
ACGGTGATCG CCGATCAGCC GATCCGTTAC AAGGGCGACG GCTCGGCGCC GGGGCCGTTC
GACTATTTCC TGGCCTCGTC GGCCTTGTGC GCGGCCTACT TCGTGAAGCT GTACTGCGAC
ACCCGCAACC TCCCCACCGA CAACATCCGC CTGTCGCAGA ACAACATCGT CGACCCCGAG
AACCGCTACA AGCAGATCTT CAAGATCCAG GTCGAGCTGC CGGAGGACCT CTCCGCCAAG
GACCGCCAGG GCATCCTGCG CTCGATCGAG CGCTGCACCG TGAAGAAGGT GGTGCAGACC
GGGCCGGAGT TCGTCATCGA GGAGGTGGAG AACCTGGACG CCGATGCGCA GGCCCTGCTG
ACGCTGAATC CGGATCCGGA CGCCCACACC TACATCCTCG GCAAGGATCT GCCGCTGGAG
CAGACGATCG CCAACATGTC GAAGCTGCTG GCGGACCTCG GCATCAAGAT CGAGATCGCG
TCGTGGCGCA ACCTCGTGCC CAACGTGTGG TCGCTGCACA TCCGCGATGC GCATTCGCCG
ATGTGCTTCA CCAACGGCAA GGGGGCGAGC AAGGAGAGCG CCTTGGCGTC GGCGCTGGGC
GAATACATCG AGCGCCTCAA CTGCAACCAC TTCTACAACG ACCAGTTCTG GGGCGAGGAC
ATCGCCGACG CGGCGTTCGT GCATTACCCC AACGAGCGCT GGTTCAAGCC CGGCCGCAAG
GATGCGCTGC CGGCCGGGCT GCTCGACGAG TATTGCCTGG AGATCTACGA CCCCGAGGGC
GAGCTGCGCG CCTCGCATCT GTACGACACC AACTCCGGCA ACGTGGCGCG CGGCATCTGC
GCGCTGCCCT ACGTGCGCCA GTCGGACGGC GAGGTGGTGT ATTTCCCCAC CAACCTGATC
GACAACCTCT ACCTCAGCAA CGGCATGAGC GCCGGCAACA CGCTGCCCGA GGCGCAGGTG
CAGTGCCTGT CGGAGATCTT CGAGCGCGCG GTCAAGCGCG AGATCATCGA AGGCGAGATG
GCGCTGCCCG ACGTGCCGCC GCAGGTGCTG GCGAAGTATC CCGGCATCGT GGCCGGCATC
CAGGGGCTGG AGGCGCAGGG CTTTCCGGTG CTGGTGAAGG ATGCGTCGCT GGGCGGCAAG
TATCCGGTGA TGTGCGTGAC CCTGATGAAC CCGCGCACGG GCGGCGTGTT CGCCTCGTTC
GGGGCGCACC CGAGCCTGGA GGTGGCGCTG GAGCGCAGCC TGACCGAGCT GCTGCAGGGG
CGCAGCTTCG AGGGCCTCAA CGACCTGCCC GCGCCCACCT TCGAGAGCAA CGCGGTCACC
GAGCCGAATA ATTTCGTCGA GCACTTCATC GATTCGAGCG GCGTGGTGTC GTGGCGCTTC
TTCAGCGCCA AGGCCGATTA CCCGTTCGTC GAATGGGATT TCTCTTCCCA CGGTGAAAGG
TCCAACGCCG AGGAAGCCGC GACCCTGTTC GGCATCCTCG AGGACATGGG CAAGGAAGTG
TACATGGCGG TGTTCGACCA GCTCGGCGCC ACCGTCTGCC GCATCGTCGT GCCGGGGTAT
TCCGAGGTCT ATCCGGTGGA TGACCTGATC TGGAACAACA CCAACAAGGC GCTGGCCTTC
CGCGCCGACA TCCTCAACCT GCATCGCCTC GACGATGAGG CCCTCGAAGC CCTGCTCGAG
CGCCTGGAAG AAAGCGAGCT CGACGATTAC ACCGACATCA TCGAGCTGAT CGGCATCGAG
TTCGACGAAA ACACGGTGTG GGGTCAGCTC ACGATCCTGG AGCTGAAGCT GCTGATCCAT
CTCGCCCTGC AGCAATTGGA AGAGGCGAAG GAGCGCGTCG AGGCCTTCCT GCAATACAAC
GACAACACGG TCGATCGCGT GCTGTTCTAC CAGGCCCTGA ACGTGGTGCT GGAGGTGATG
CTGGACGACG AGCTGGCGCT GGAAGACTAC GAGGCCAACT TCCGCCGCAT GTTCGGCGAC
GCGCGCATGG ACGCGGTGAT CGGCTCGGTG GAGGGCCGCG TGCGCTTCCA TGGCCTGACG
CCGACGAGCA TGAAGCTCGA AGGGCTCGAC CGCCACCAGC GCCTGATCGA CAGCTACCGC
AAGCTGCACC GGGCGCGGGC ACGCGCGGCC GGCGTGTCCG CGTAG
 
Protein sequence
MEIKVNFLDK LRLEARFDDF TVIADQPIRY KGDGSAPGPF DYFLASSALC AAYFVKLYCD 
TRNLPTDNIR LSQNNIVDPE NRYKQIFKIQ VELPEDLSAK DRQGILRSIE RCTVKKVVQT
GPEFVIEEVE NLDADAQALL TLNPDPDAHT YILGKDLPLE QTIANMSKLL ADLGIKIEIA
SWRNLVPNVW SLHIRDAHSP MCFTNGKGAS KESALASALG EYIERLNCNH FYNDQFWGED
IADAAFVHYP NERWFKPGRK DALPAGLLDE YCLEIYDPEG ELRASHLYDT NSGNVARGIC
ALPYVRQSDG EVVYFPTNLI DNLYLSNGMS AGNTLPEAQV QCLSEIFERA VKREIIEGEM
ALPDVPPQVL AKYPGIVAGI QGLEAQGFPV LVKDASLGGK YPVMCVTLMN PRTGGVFASF
GAHPSLEVAL ERSLTELLQG RSFEGLNDLP APTFESNAVT EPNNFVEHFI DSSGVVSWRF
FSAKADYPFV EWDFSSHGER SNAEEAATLF GILEDMGKEV YMAVFDQLGA TVCRIVVPGY
SEVYPVDDLI WNNTNKALAF RADILNLHRL DDEALEALLE RLEESELDDY TDIIELIGIE
FDENTVWGQL TILELKLLIH LALQQLEEAK ERVEAFLQYN DNTVDRVLFY QALNVVLEVM
LDDELALEDY EANFRRMFGD ARMDAVIGSV EGRVRFHGLT PTSMKLEGLD RHQRLIDSYR
KLHRARARAA GVSA