Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2301 |
Symbol | |
ID | 7085286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2587687 |
End bp | 2589534 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699320 |
Product | peptidase M61 domain protein |
Protein accession | YP_002355936 |
Protein GI | 217970702 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.9294 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCC GTTCCACCCG CTTGTCCGCC GCCGCGTCGG CGCTCCATGG CCGTCCGCCG CCCGCGGTCG AATACCGCAT CCGCCCGGCG AACCCCGGCG CCCACCTCTT CGAGGTGAGC TGCACGGTGG CCGAGCCCGA CCCCGCGGGG CAGGTCTTCA GCCTGCCGGC GTGGATCCCG GGCAGCTACA TGATCCGCGA GTTCGCGCGC AACATCGTGC GCCTGCGTGC CGAGGCCGAC GGCGAGCCCT GCGCGCTGGA GAAGCTCGAC AAGCACACCT GGCGCGCGGC TGCGGTGCCG GGCGCGCGCG TGCTGAGCGT GCATTACGAG GTCTATGCCT GGGACCTGTC GGTGCGCACC GCCCATCTCG ACACCACACA CGGCTTCTTC AACGGCACGA GCGTCTTCCT CGCGGTGGCG GGGCGTACCG AAGCACCTTG CGTGGTGACC ATCGAACGCC CCGAGGGCGA CTGCGGGCGC GACTGGAAGC TCGTCACCGC GCTGCCGCCC GAGCACGGCC ACCCGGGCCA GGCCTGCCGC TTCGGCCGCT TCCGCGCCGC GGACTACGAC GAGCTCATCG ACCACCCGGT CGAGATGGGC CGTTTCACGC TGGCACGCTT CGAGGCCGCG GGCGTGCCGC ACGACATCGC CCTCACCGGC CGCCACGACT GCGACCTCGA GCGCCTGTGC GCCGACCTGC GCCGGGTGTG CGAATGGCAG ATCGCGCTCT TCGGCACGCC GGCGCCGGTG GACTATTACG CCTTCCTGAC CATGGTGGTG GGAGAGGGCT ACGGCGGGCT GGAGCATCGC GCCTCGACCG CGCTGATCTG CAGCCGCGCC GAGCTGCCGT GGAAGGGCAT GGAGGGTCTG CCCGACGGCT ACAAGAGCTT CCTCGGCCTG TGCAGCCACG AGTATTTCCA CACCTGGAAC GTCAAGCGCA TCAAGCCGGT GGCGTTCACG CCCTACGATC TCGCGCGCGA GAACCACACC CGGCTGCTGT GGGCCTTCGA GGGCTTCACC TCGTATTACG ACGACCTCGC CCTGGTGCGC AGCGGCGTGA TCGGCATCGA TGACTACCTG GGGCTGCTCG GCAAGACCAT CGCCAACGTC TTGCGCGGCA GCGGCCGGCT CAAGCAGAGC GTGGCGGAGT CCTCCTTCGA CGCGTGGACC AAGTACTACC GCCAGGACGA GAACGCACCC AATGCCATCG TCAGCTACTA CGCCAAGGGT GCGCTGATCG CGCTCGCGCT CGACCTGCAG CTGCGCGCGG GCAGCGAGGG TGCGGCCAGC CTGGACGACG TGATGCGGCT GCTGTGGCGG CGCCACGGCC TCACCGGCGT GGGCGTGCCG GAGGATGGCA TCTTCGCCGC GGTGCGCGAC GCGGGCGGCG AACGCCTCGG CGCGCGCCTG GCGAAATGGC TGCAGAAGGC GGTGGACGGC TGCGAGGATC TGCCGCTGGC GCGCCTGCTG CGTCCCTTCG GCGTGAGCCT GCGCGCCGAG GCGGCGGGGA CCGCGCCGGT GCTCGGGATG AAGCTCGGCG GGGGCAGTGG CGAGGCGAAG GTCGCCAATG TGTACGACGA CGGTCCGGCG CAGGCGGCGG GCGTCTCGGC CGGGGACGTG CTGATCGCGC TCGACGGGCT GAGGATCTCC AGCGCCAAGG GGCTGGAGGA TCTGCTCGCC CGTCGTGGTG CGGGCGACGA GGTGGAACTG CATCTCTTCC GTCGCGACGA GCTGATGAGC TTCCGTGCGG TGCTCGCTGC ACCGCCTGCC GAGCGCCAGG AGCTCAAGCT GGCGCCGCGC GCCGATAGCG CGGCAGCGAA GCTGCGGCGG GGTTGGTTGG GGGGGTGA
|
Protein sequence | MSARSTRLSA AASALHGRPP PAVEYRIRPA NPGAHLFEVS CTVAEPDPAG QVFSLPAWIP GSYMIREFAR NIVRLRAEAD GEPCALEKLD KHTWRAAAVP GARVLSVHYE VYAWDLSVRT AHLDTTHGFF NGTSVFLAVA GRTEAPCVVT IERPEGDCGR DWKLVTALPP EHGHPGQACR FGRFRAADYD ELIDHPVEMG RFTLARFEAA GVPHDIALTG RHDCDLERLC ADLRRVCEWQ IALFGTPAPV DYYAFLTMVV GEGYGGLEHR ASTALICSRA ELPWKGMEGL PDGYKSFLGL CSHEYFHTWN VKRIKPVAFT PYDLARENHT RLLWAFEGFT SYYDDLALVR SGVIGIDDYL GLLGKTIANV LRGSGRLKQS VAESSFDAWT KYYRQDENAP NAIVSYYAKG ALIALALDLQ LRAGSEGAAS LDDVMRLLWR RHGLTGVGVP EDGIFAAVRD AGGERLGARL AKWLQKAVDG CEDLPLARLL RPFGVSLRAE AAGTAPVLGM KLGGGSGEAK VANVYDDGPA QAAGVSAGDV LIALDGLRIS SAKGLEDLLA RRGAGDEVEL HLFRRDELMS FRAVLAAPPA ERQELKLAPR ADSAAAKLRR GWLGG
|
| |