Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3011 |
Symbol | |
ID | 7874400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3262227 |
End bp | 3264245 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643699932 |
Product | transglutaminase domain protein |
Protein accession | YP_002889986 |
Protein GI | 237653672 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.340664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGCGG AGCGCCCGCC GCGCCTGGGC GGCCTCCTGC ACCGCAGCGG CCCGCCGCGC CGCACGCGCG CCGCGCGCCC GACGCCGACG CTGCGGCGCG ACCAGGGTGT CTGGCTGCTC GCCGCGGCGG CGCTGGCGAT CGCCCCGCAT GCGGTCTGGC TGCCGGGATG GGTCGCGGCG CTCGGGCTCG GCCTGCTCGC CTGGCGCGCG CTGCTGCTGT GGCGCGGCAG CCGGCCGCCT CCGCGCGCGC TGCTGTTCGC GCTCGCGCTC GCCGCCGCCG CGGGGGTGCG ACTGGAGTTC GGGCACTTCT TCGGCCGCGA GCCGGGCGTG GCGGTACTGG TCCTGCTGCT CGGCCTCAAG CTGCTGGAGA CGCGCGCGGC GCGTGACATC CGCGCCGGGG TGCTGCTGTC CCTGTTCCTG CAGCTCGCGA TCTTCTTCGA GGATCAGTCC CTGCCGGTGG CCGTGCCGGT CCTCGCCGGC ACCCTGCTCG CACTCGGCGC CCTGGTGGCG CTCGCCGACC CGGACGGTGG CGAGCGCGAG CGCCTGCGCA CCGCGGCCAC CCTGCTCGCC CAGGGCCTGC CCTTCATGCT CATCCTGTTC GTGCTCTTCC CGCGCATCCA GGGCCCGCTG TGGGGCCTGC CGGCCGACGC CTTCTCGGCG CGCACCGGGC TATCGGACAC GATGCGACCG GGCTCGATCA GCGCCCTCGG ACAATCCGAC GAGATCGCCC TGCGCGCCGC CTTCGCCGGC GCGCCGCCAC CACCCGCGCA GCGCTACTGG CGCGGCCCGG TGCTCACCCG CTTCGACGGC CGCGAGTGGC ACGCCGAGGC AGCCGCCGAG TCCTTCGCAC CCAGCTACAC ACCACAGGGC GAACGCATCG ACTACCTGAT CACCATCGAG CCGCACCTGC GCCGCTGGCT GCTCGCCCTC GAACACCCCG GGCCTGCGCA GCCGCCGATC CGCTACACCG GCGACCTGCG CGCGCTCGCC GCAGAGCCCC TGCGGGCGCG AGCGCGCTTC ACGCTCGGCG CGTATCCGCA CACGCCGGTC GGCATGGACG AGGCGCCCGC GGTGCTTGCC GCGGCCACCG CCCTGCCCGC GGAGAGCAAC CCCCGCAGCC GCCGGCTCGC CGCCGAACTC GCCGTCGGTG CGCGCGACCA CGCCGAGATC CTGGAGCGCG TGCTCGCCCG CCTGCGCGCC CTGCGCCTGG GCTATACGCT GCGTCCACCC ATGCTCGGCC GCCACGCCGC CGACGAGTTC CTGTTCGACA CCCGGCGCGG CTTCTGCGAG CACTTCGCCT CCGCCTTCGC CGTGCTGATG CGCGCCGCAG GCGTGCCGAC GCGGATCGTC ACCGGCTACC AGGGCGGCGA CATCAACCCG ATCGACGGTC AGCTCGTGGT TCGCCAGTCC GACGCCCACG CCTGGGCCGA AGTCTGGCTG CAGGGACGCG GCTGGTTGCG AGTCGACCCA ACCGCCCTCG CCGCCCCCGA GCGCATCGAT GGCGGGCTGG CCGCGGCGCT CGCCGACGCG GGCGAGCTAC CGTTCATGCT GCGCGCCGAC ATGGCCTGGC TGCGCGGCCT GCGCCACCGC TGGGAGGCCG TCGCGAACCT GTGGAACCAG CACGTCCTTG GCTACAATCC CGAGCGTCAG CGCGAACTCC TCGCCCGCAT CGGCCTCGGC ACGGGCAGAC TGGCGCCGCC CCTGGGGGCG CTCGTCGCCA CAGCGGTGCT GTTGTTCGCC GCCCTCTATG CCTGGAGCCT GCGCCGCCCG CACGTGCGCG ACCCGCTCAC GTGCACCTGG GAACGCTTCT GCGCGAAGAT GGCCGCCGCT GGCGCGGCCC GTCCGGCCTG GCAGGGACCG CAGGACTATG CGGACGAACT GGCCGCACGC TTCCCGGCGC ATGCCTCGGA ACTACGCGGC ATCTGCATGC TCTATGCCCG CCTGCGCTAC GGACCGCCCG CCCCGGAGGA GCAACTCCGG CTCCTGTACA ACCGCATCGC CTCACTGCGC CTCGAATGA
|
Protein sequence | MSAERPPRLG GLLHRSGPPR RTRAARPTPT LRRDQGVWLL AAAALAIAPH AVWLPGWVAA LGLGLLAWRA LLLWRGSRPP PRALLFALAL AAAAGVRLEF GHFFGREPGV AVLVLLLGLK LLETRAARDI RAGVLLSLFL QLAIFFEDQS LPVAVPVLAG TLLALGALVA LADPDGGERE RLRTAATLLA QGLPFMLILF VLFPRIQGPL WGLPADAFSA RTGLSDTMRP GSISALGQSD EIALRAAFAG APPPPAQRYW RGPVLTRFDG REWHAEAAAE SFAPSYTPQG ERIDYLITIE PHLRRWLLAL EHPGPAQPPI RYTGDLRALA AEPLRARARF TLGAYPHTPV GMDEAPAVLA AATALPAESN PRSRRLAAEL AVGARDHAEI LERVLARLRA LRLGYTLRPP MLGRHAADEF LFDTRRGFCE HFASAFAVLM RAAGVPTRIV TGYQGGDINP IDGQLVVRQS DAHAWAEVWL QGRGWLRVDP TALAAPERID GGLAAALADA GELPFMLRAD MAWLRGLRHR WEAVANLWNQ HVLGYNPERQ RELLARIGLG TGRLAPPLGA LVATAVLLFA ALYAWSLRRP HVRDPLTCTW ERFCAKMAAA GAARPAWQGP QDYADELAAR FPAHASELRG ICMLYARLRY GPPAPEEQLR LLYNRIASLR LE
|
| |