Gene Tmz1t_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0642 
Symbol 
ID7084580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp727333 
End bp728484 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID643697668 
Producttransglutaminase domain protein 
Protein accessionYP_002354310 
Protein GI217969076 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCGCC GCAACTTCCT GCGCGCCGGC ACGCTCGCGG CCGGCCTCGC GCTGCCCGCG 
CTCGCGCGCG CCCAACAATC CGCCGCGACC ACGGCGACCA CGCCGCTCGC CACCCCCGCC
CCGCTCGCCG AGCCCCCCGC CTTCGCGCCC ACGCCGGACG CCGGCTGGCG CCGCTTCGAG
TTGACGAGCC GCGTCGAACC CCTGCCCGGC GACGGTCCGC TGCGCGTGTG GGTGCCGCTG
CCGGCGATGC ACGAGACCGC CTGGCAGCGC CCGATGGGCA GCCTGTGGCA GGGCAACGCC
GACCTCATGG AACGTGTGCG CGACCGCTCC GGCGCCGAGA TGGTCTACGC CGAGTGGGCG
CCGGGCATCG CCCAGCCCCG GCTCGAGATC CTCAGCCGCT TCGCCACCCG CGACCGCGCC
ATCGACTTCA GCCAGCCCAG CGCCGCGCCC CAGGTACTGC CGCGCGCCGA ACGCATGCAC
TACCTGCGCG CCACCGCGCT GCTGCCCACC GACGGCATCG TGCGCGATAC CGCACGCGAC
ATCGTCCACG GCGCGAAGAC CGACGAGGAC AAGGCGCGCG CCATCTACGA ATGGATCGTG
GACAACACCT TCCGCGAGCC CAAGGTGCGC GGCTGCGGCA TCGGCGACAT CCGCACGATG
CTGGAGACCG GCAACCTCGC GGGCAAGTGC GCCGACCTCA ACGCCCTCTT CGTCGGCCTG
GCGCGCGCAG CCGGGCTGCC GGCACGCGAC GTCTACGGCC TGCGCGTCGC CGACTCGCGC
TTCGGCTACA AGAGCCTGGG CAAGAGCGGC AACGTCTCCA AGGCCCAGCA CTGCCGAGCC
GAGGTCTTCC TGGAGCGCTT CGGCTGGGTG CCGGTGGACC CCGCGGACGT GCGCAAGGTC
GTCCTGGAGG AACCGCCGGG CAAGCTCTCC ATGGTCGATC CCAAGGTCGC CGCGGTGCGC
AAGCAGCTCT TCGGCGCCTG GGAGATGAAC TGGCTGGCCT ACAACGACGC CCACGACCTG
CGCCTGCCCG GCAGCACCGG CAGCGAGATC CCCTTCCTGA TGTACCCCCA GGGCGAACTC
GCCGGCCAGC GCTTCGACAG CCTGGACCCC GACGCCTTCA GCTACACGCT GAGCGCACGG
GAAATTGCCT GA
 
Protein sequence
MNRRNFLRAG TLAAGLALPA LARAQQSAAT TATTPLATPA PLAEPPAFAP TPDAGWRRFE 
LTSRVEPLPG DGPLRVWVPL PAMHETAWQR PMGSLWQGNA DLMERVRDRS GAEMVYAEWA
PGIAQPRLEI LSRFATRDRA IDFSQPSAAP QVLPRAERMH YLRATALLPT DGIVRDTARD
IVHGAKTDED KARAIYEWIV DNTFREPKVR GCGIGDIRTM LETGNLAGKC ADLNALFVGL
ARAAGLPARD VYGLRVADSR FGYKSLGKSG NVSKAQHCRA EVFLERFGWV PVDPADVRKV
VLEEPPGKLS MVDPKVAAVR KQLFGAWEMN WLAYNDAHDL RLPGSTGSEI PFLMYPQGEL
AGQRFDSLDP DAFSYTLSAR EIA