Gene Tmz1t_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3879 
Symbol 
ID7873530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4278519 
End bp4281995 
Gene Length3477 bp 
Protein Length1158 aa 
Translation table11 
GC content71% 
IMG OID643700821 
Producttransglutaminase domain protein 
Protein accessionYP_002890844 
Protein GI237654530 
COG category[S] Function unknown 
COG ID[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCATCC ACGTCAAGCT CAACCACGTC ACCCGTTACC GCTACGACCG CCCGGTCGCG 
CTGTCGCCGC AGGTCGTGCG CCTGCGCCCG GCGCCGCACT GCCGCACGCC CATCCATGCC
TACGCGCTGA AGGTCGACCC CGCCGGCCAC TTCATCAACT GGCAGCAGGA CCCGCAGTCG
AACTATCTCG CCCGCCTCGT CTTCCCCGAC AAGGCTACCG AACTGCGCAT CGAGGTCGAC
CTGGTGGCCG AGCTGTCGGT CATCAACCCC TTCGATTTCT TCCTCGAACC CGCGGCGGAG
AAGATCCCCT TCGCCTACGA GGACGCCCTG CGCGTCGAGC TCGCGCCCTA CCTGGCGAAG
GCGCCGGCTG CCGCGCTGGG GCCGAAGTTC ATCGACTGCC TCGAATCCAT CCCGCGCGCG
CCGCGGGCCA GCGTGGATTT CCTGGTCGCG CTCAACCAGC GCCTGCAGCA GGACATCCGC
TACCTGGTGC GCCTGGAGCC GGGCGTGCAG ACGCCGGAAG AGACGCTGAC GCTGGCGAGC
GGCTCGTGCC GCGACTCGGC CTGGCTGCTG GTGCAGTTGC TGCGCCACCT CGGCCTGGCG
GCGCGCTTCG TGTCCGGCTA CCTGATCCAG CTCGTGCCGG ACGTGAAGTC GCTCGACGGC
CCCTCGGGCA CCGCGGTGGA CTTCACCGAC CTGCACGCCT GGTGCGAGGT CTATCTGCCC
GGTGCCGGCT GGGTGGGCCT GGATCCGACC TCCGGCTTGT TCGCCGGCGA GGGCCACATC
CCGCTCGCCT GCTCGCCCGA GCCCTCGTCG GCGGCGCCGA TCACCGGCTT CACCGATGAA
TGCGAATGCG AGTTCGAGCA CCACATGAAG GTCGAGCGCG TGTGGGAGGC GCCGCGCGTC
ACCAAGCCCT ATACCGACGA ACAGTGGGCG GCGATCGAGG TGCTGGGCCA GCGCATCGAC
GCCGACCTGA AGGCCCGCGA CGTGCGCCTC ACGCAAGGCG GCGAGCCCAC CTTCGTGGCA
CTCGACGACC GCGACGGCGC GGCCTGGAAC AGCGACGCGC TCGATCCGCA GAACCGTGAT
GCCAGCAAGC CGAGCAAGCG CACCCTGGCC GAAAACCTGA TGTTCCGCCT CAAGGACCAC
TATGCGCCGC ACGGCCTGCT GCACTTCGGC CAGGGCAAGT GGTATCCGGG CGAGCAGTTG
CCGCGCTGGT CGCTCAATTG CTACTGGCGG CGCGACGGCG AGCCGATCTG GCGCAAGCCC
GAGCTGTTCG CGCGCGAGCT CTCGGGCACT GCGGTGGACG AGGCCGTGGC GGCGCGCTTC
CTCACGCGCG TCGGCGAGCG CCTCGGCGTG GATACGCAGT GGATGATGGC GGCCTTCGAG
GACGCCTGGT ACTACCTGTG GCGCGAGCGC CGGCTGCCGG CCAACGTCGA TCCGCACGAT
GCGCGCGTCG ACGACCCGCT CGAACGCGCG CGGCTCGCCA AGGTCTTCGA CCAGGGGCTG
AGGCAGGTCG TCGGGCACGT GCTGCCGCTG GTGCGCGATC ACGTCGGCGC CGATCGCTGG
CAGAGCGGGC CGTGGTTCCT GCGTGCCGAG CGCCTGTACC TGATCCCCGG CGATTCGCCG
ATCGGCTACC GCCTGCCGCT CGATTCCCAG CCCTGGGCGG GCAAGGGCGA CCTGCCCTTG
ATCCACCCCG CCGACCCCAA CCAGCCCTTC CCGACGCTGC CCGCGCACGC CGAGCTGCGC
CAGCAGCTGC GGCCGACGGG CACGACGCTC GGCGCCGACA GCGGCCTCGG CTTCCCCTTC
GCCGCCCACG GCGGCGGGTG GACGCCGAGC CGTGTCGAAC ATGCCGGCTT CCATGGGGCG
CAGGGCGAGG CCGCGGTGCC GCGGCCCGGC GCGTCCGCGC TGGATCGCCG CGACGAGCGC
ACCGACCCCG CTCGCCGTCC GGCACCCTTC GAGTCCGCGG CCTGGATCAC TCGCAGCGCG
CTGTGCGCCG AGCCGCGCGG CGGCCTGCTC CATCTCTTCA TGCCCCCGAC TGCCGCGCTG
GAGGACTACC TCGAGATCGT CACCGCCATC GAGGACACCG CCGCCGAATT CAAGCTGCCG
GTGGTGCTCG AAGGCTACGA GCCGCCGTCC GACCCGCGCC TGGCTCACTT CCGCATCACC
CCCGACCCGG GCGTCATCGA GGTCAACATC CACCCCGCGG CGAGCTGGGA CGAACTCGTC
GAGCGCACCG GCACGCTCTA CGAGGAGGCT CGGCAGGCGC GCCTGTCGAC CGAGAAGTTC
ATGCTCGACG GCCGTCATAC CGGCACCGGC GGTGGCAACC ACTTCGTGCT CGGCGGTGCC
ACGCCGGGCG ACTCGCCCTT CCTGCGGCGG CCCGACCTGG TGCGCAGCCT GTTGAGCTAC
TGGCACAACC ACCCCAGCCT GTCCTACCTG TTCTCGGGGC TCTTCATCGG CCCGACCTCG
CAGGCGCCGC GGGTGGACGA AGCGCGCCAC GACTCGGTGC ACGAGCTCGA AGTCGCCTTC
CAGCAGATGC CGGAGCCGGG CATCCAGGTG CCGCCGTGGC TCATCGACCG CCTGCTGCGC
AACCTGCTGA TCGACGCCAG CGGCAACACC CACCGAGCCG AGTTCTGCAT CGACAAGCTC
TACAGCCCCG ACAGCGCCAC CGGGCGGCTG GGCCTGCTCG AGCTGCGCGC CTTCGAGATG
CCGCCGCACG CGCGCATGAG CCTGGCCCAG CAGCTGCTGC TGCGCGCGCT GATCGCGCGC
TTCTGGCAGC AGCCTTATGC CCCGGGTCGG CTGCGGCGCT GGGGTACCGA GCTGCACGAC
CGCTTCCTGC TGCCGCACTG GGTGGCGGAG GATTTCCGCG ACGTCGTCGG CGAGCTGCGC
GAATTCGGCT ACCCGCTCGA GTTCGACTGG TTCGCGCCGC ACTTCGAGTT CCGCTTCCCC
AGGTACGGCG AGTTCGCCGC GCGCGGCGTG CGGGTCGAGC TGCGCATGGC GCTCGAGCCC
TGGCACGTGA TGGGGGAGGA GGGGGCGCCC GGTGGCACGG TGCGCTACGT CGATTCCTCG
GTGGAGCGCC TGCAGGTCAA GGTGTCCGGG CTCAACGACG ACCGCCACGT GCTGACCTGC
AACGGCCGCG CCGTGCCGCT GCAGCCCACC GGCACGGTGG GCGAGTTCGT CGCCGGGGTG
CGCTACCGCG CCTGGCAGCC GCCGTCCTGC CTGCACCCGA CCATCGGCGT GCACGCGCCG
CTGGTGTTCG ACCTGGTCGA TGGCTGGATG CAGCGCTCGA TGGGCGGCTG CGTGTATCAC
GTGGCGCATC CGGGCGGGCG CAACCACGAG AGCTACCCGG TGAACCCCTA CGAGGCCGAA
GGCCGCCGCC TGGCCCGCTT CACCACCACC GGTCACACGC CGGGCCGCGT CGCACCCGCG
CCCGCCACGC GCAACGCCGA TTTCCCCTTC ACCCTCGACC TGCGCCGCCA GGGCTGA
 
Protein sequence
MSIHVKLNHV TRYRYDRPVA LSPQVVRLRP APHCRTPIHA YALKVDPAGH FINWQQDPQS 
NYLARLVFPD KATELRIEVD LVAELSVINP FDFFLEPAAE KIPFAYEDAL RVELAPYLAK
APAAALGPKF IDCLESIPRA PRASVDFLVA LNQRLQQDIR YLVRLEPGVQ TPEETLTLAS
GSCRDSAWLL VQLLRHLGLA ARFVSGYLIQ LVPDVKSLDG PSGTAVDFTD LHAWCEVYLP
GAGWVGLDPT SGLFAGEGHI PLACSPEPSS AAPITGFTDE CECEFEHHMK VERVWEAPRV
TKPYTDEQWA AIEVLGQRID ADLKARDVRL TQGGEPTFVA LDDRDGAAWN SDALDPQNRD
ASKPSKRTLA ENLMFRLKDH YAPHGLLHFG QGKWYPGEQL PRWSLNCYWR RDGEPIWRKP
ELFARELSGT AVDEAVAARF LTRVGERLGV DTQWMMAAFE DAWYYLWRER RLPANVDPHD
ARVDDPLERA RLAKVFDQGL RQVVGHVLPL VRDHVGADRW QSGPWFLRAE RLYLIPGDSP
IGYRLPLDSQ PWAGKGDLPL IHPADPNQPF PTLPAHAELR QQLRPTGTTL GADSGLGFPF
AAHGGGWTPS RVEHAGFHGA QGEAAVPRPG ASALDRRDER TDPARRPAPF ESAAWITRSA
LCAEPRGGLL HLFMPPTAAL EDYLEIVTAI EDTAAEFKLP VVLEGYEPPS DPRLAHFRIT
PDPGVIEVNI HPAASWDELV ERTGTLYEEA RQARLSTEKF MLDGRHTGTG GGNHFVLGGA
TPGDSPFLRR PDLVRSLLSY WHNHPSLSYL FSGLFIGPTS QAPRVDEARH DSVHELEVAF
QQMPEPGIQV PPWLIDRLLR NLLIDASGNT HRAEFCIDKL YSPDSATGRL GLLELRAFEM
PPHARMSLAQ QLLLRALIAR FWQQPYAPGR LRRWGTELHD RFLLPHWVAE DFRDVVGELR
EFGYPLEFDW FAPHFEFRFP RYGEFAARGV RVELRMALEP WHVMGEEGAP GGTVRYVDSS
VERLQVKVSG LNDDRHVLTC NGRAVPLQPT GTVGEFVAGV RYRAWQPPSC LHPTIGVHAP
LVFDLVDGWM QRSMGGCVYH VAHPGGRNHE SYPVNPYEAE GRRLARFTTT GHTPGRVAPA
PATRNADFPF TLDLRRQG