Gene Tmz1t_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3011 
Symbol 
ID7874400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3262227 
End bp3264245 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content75% 
IMG OID643699932 
Producttransglutaminase domain protein 
Protein accessionYP_002889986 
Protein GI237653672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.340664 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCGG AGCGCCCGCC GCGCCTGGGC GGCCTCCTGC ACCGCAGCGG CCCGCCGCGC 
CGCACGCGCG CCGCGCGCCC GACGCCGACG CTGCGGCGCG ACCAGGGTGT CTGGCTGCTC
GCCGCGGCGG CGCTGGCGAT CGCCCCGCAT GCGGTCTGGC TGCCGGGATG GGTCGCGGCG
CTCGGGCTCG GCCTGCTCGC CTGGCGCGCG CTGCTGCTGT GGCGCGGCAG CCGGCCGCCT
CCGCGCGCGC TGCTGTTCGC GCTCGCGCTC GCCGCCGCCG CGGGGGTGCG ACTGGAGTTC
GGGCACTTCT TCGGCCGCGA GCCGGGCGTG GCGGTACTGG TCCTGCTGCT CGGCCTCAAG
CTGCTGGAGA CGCGCGCGGC GCGTGACATC CGCGCCGGGG TGCTGCTGTC CCTGTTCCTG
CAGCTCGCGA TCTTCTTCGA GGATCAGTCC CTGCCGGTGG CCGTGCCGGT CCTCGCCGGC
ACCCTGCTCG CACTCGGCGC CCTGGTGGCG CTCGCCGACC CGGACGGTGG CGAGCGCGAG
CGCCTGCGCA CCGCGGCCAC CCTGCTCGCC CAGGGCCTGC CCTTCATGCT CATCCTGTTC
GTGCTCTTCC CGCGCATCCA GGGCCCGCTG TGGGGCCTGC CGGCCGACGC CTTCTCGGCG
CGCACCGGGC TATCGGACAC GATGCGACCG GGCTCGATCA GCGCCCTCGG ACAATCCGAC
GAGATCGCCC TGCGCGCCGC CTTCGCCGGC GCGCCGCCAC CACCCGCGCA GCGCTACTGG
CGCGGCCCGG TGCTCACCCG CTTCGACGGC CGCGAGTGGC ACGCCGAGGC AGCCGCCGAG
TCCTTCGCAC CCAGCTACAC ACCACAGGGC GAACGCATCG ACTACCTGAT CACCATCGAG
CCGCACCTGC GCCGCTGGCT GCTCGCCCTC GAACACCCCG GGCCTGCGCA GCCGCCGATC
CGCTACACCG GCGACCTGCG CGCGCTCGCC GCAGAGCCCC TGCGGGCGCG AGCGCGCTTC
ACGCTCGGCG CGTATCCGCA CACGCCGGTC GGCATGGACG AGGCGCCCGC GGTGCTTGCC
GCGGCCACCG CCCTGCCCGC GGAGAGCAAC CCCCGCAGCC GCCGGCTCGC CGCCGAACTC
GCCGTCGGTG CGCGCGACCA CGCCGAGATC CTGGAGCGCG TGCTCGCCCG CCTGCGCGCC
CTGCGCCTGG GCTATACGCT GCGTCCACCC ATGCTCGGCC GCCACGCCGC CGACGAGTTC
CTGTTCGACA CCCGGCGCGG CTTCTGCGAG CACTTCGCCT CCGCCTTCGC CGTGCTGATG
CGCGCCGCAG GCGTGCCGAC GCGGATCGTC ACCGGCTACC AGGGCGGCGA CATCAACCCG
ATCGACGGTC AGCTCGTGGT TCGCCAGTCC GACGCCCACG CCTGGGCCGA AGTCTGGCTG
CAGGGACGCG GCTGGTTGCG AGTCGACCCA ACCGCCCTCG CCGCCCCCGA GCGCATCGAT
GGCGGGCTGG CCGCGGCGCT CGCCGACGCG GGCGAGCTAC CGTTCATGCT GCGCGCCGAC
ATGGCCTGGC TGCGCGGCCT GCGCCACCGC TGGGAGGCCG TCGCGAACCT GTGGAACCAG
CACGTCCTTG GCTACAATCC CGAGCGTCAG CGCGAACTCC TCGCCCGCAT CGGCCTCGGC
ACGGGCAGAC TGGCGCCGCC CCTGGGGGCG CTCGTCGCCA CAGCGGTGCT GTTGTTCGCC
GCCCTCTATG CCTGGAGCCT GCGCCGCCCG CACGTGCGCG ACCCGCTCAC GTGCACCTGG
GAACGCTTCT GCGCGAAGAT GGCCGCCGCT GGCGCGGCCC GTCCGGCCTG GCAGGGACCG
CAGGACTATG CGGACGAACT GGCCGCACGC TTCCCGGCGC ATGCCTCGGA ACTACGCGGC
ATCTGCATGC TCTATGCCCG CCTGCGCTAC GGACCGCCCG CCCCGGAGGA GCAACTCCGG
CTCCTGTACA ACCGCATCGC CTCACTGCGC CTCGAATGA
 
Protein sequence
MSAERPPRLG GLLHRSGPPR RTRAARPTPT LRRDQGVWLL AAAALAIAPH AVWLPGWVAA 
LGLGLLAWRA LLLWRGSRPP PRALLFALAL AAAAGVRLEF GHFFGREPGV AVLVLLLGLK
LLETRAARDI RAGVLLSLFL QLAIFFEDQS LPVAVPVLAG TLLALGALVA LADPDGGERE
RLRTAATLLA QGLPFMLILF VLFPRIQGPL WGLPADAFSA RTGLSDTMRP GSISALGQSD
EIALRAAFAG APPPPAQRYW RGPVLTRFDG REWHAEAAAE SFAPSYTPQG ERIDYLITIE
PHLRRWLLAL EHPGPAQPPI RYTGDLRALA AEPLRARARF TLGAYPHTPV GMDEAPAVLA
AATALPAESN PRSRRLAAEL AVGARDHAEI LERVLARLRA LRLGYTLRPP MLGRHAADEF
LFDTRRGFCE HFASAFAVLM RAAGVPTRIV TGYQGGDINP IDGQLVVRQS DAHAWAEVWL
QGRGWLRVDP TALAAPERID GGLAAALADA GELPFMLRAD MAWLRGLRHR WEAVANLWNQ
HVLGYNPERQ RELLARIGLG TGRLAPPLGA LVATAVLLFA ALYAWSLRRP HVRDPLTCTW
ERFCAKMAAA GAARPAWQGP QDYADELAAR FPAHASELRG ICMLYARLRY GPPAPEEQLR
LLYNRIASLR LE