Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0090 |
Symbol | |
ID | 7083473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 102550 |
End bp | 104247 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643697137 |
Product | protein of unknown function DUF342 |
Protein accession | YP_002353786 |
Protein GI | 217968552 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGC AGCCCACGGA AGGCCCGGAA CTGGAGGTGT CGTTCGACGA ACTCACGCGC GTCCTGAGCG TGAGCATCGC CCATGACCCG CATTTTCCGC GCATCGACGC CTTGTGGCTG CGCCGACGGC TCGAAGCCGC GGGCTACGCC GACCTGCAGA TCCGCCCCGA CCCGATCCGC CGCCTGATCG CCCAGTACAA CGCCGGCGAG GCGGTGGCGC CGGTCGAGAT CGCGCAGTGC GTGGATGCGT CGATGCAGAT CGGAGTCTCG CTGGACGGAC TGGTCGCACG CCTGAGCATC GTGCCGGCCA AGGGTGGCAA GCCGGCGAGC AAGACCGAAC TGCTCGCCCT GATCGAATCG CGCGGCATCG TCGAGGGCCT GCTGCTGGAG GAGATGAACC GCGCCATCGC CGACGGTCAG GCCGACGACC TCGCCATCGC GCGCGGCCGG GAACCGGAAC CGGGCAAGGA CGGCTGGCTC GAGTACCTGC TGCCGGAGAC GCGCGAGCGC GTACCCAGCG TGCGCCCCTG CGGCCGCACC GACTACCGCG ACCTCGGCGA GATCCTCGTC GTCCACGCCG GCGATGCGCT GATGCGCCGC CATCCCCCGC AGCCCGGAGT CGACGGCGTC AATGTGTACG GCCGGCCGAT CGTGGCGCGG CGCGGTCGCG AGCAGCGCTT CGCCCCCGGC CTGCGCGGCA CCGCGATCTC GCCCGAGGAC CCCGAGCTGC TCGTGGCTGC CTGCGACGGC CAGCCGGTGC GGGTGCGCAA CGGCGCGATG GTGGAGCCGA TCTTCACCGT GGATGCGGTC AACCTCGCCA CCGGCAACAT CGACTTCGAC GGCAGCGTGC GCATCCGCAA CGATGTGCAG GCCGGGATGA CGGTGCGCGC CAGCGGCGAC ATCGAGGTCG GCGGCGTGGT CGAGCCGGCC ACGCTGGAGG CCGGCGGCAG CATCGTGGTC AAGGGCGGCG TGCTCGGCGG GCTGGGCGGC AAGACCGCCG GCAAGGATTA CAGCGCGCAC GCGATCCGCT GCGAGGGCAG CTTCTGCGCC ACCTACGCGC AGCAGGCGCG CATCAGCGCG GGCGACTCGA TCTTCATCGA CGACGTCGCC ATGCAGTGCC AGCTCGAGGC ACGCAACCAC ATCCGCGTAG GCAAGCGCCT GCGCGGCCAG ATCGTCGGCG GCCATTGCCG CGCCAGCCTG TCCATCCACG CGCGCACGAT CGGCGCGAAC AGCCGCATCC GCACCGAGCT CGAGATCGGC ATGGACAACG GGCTGGAACA CGCCATCCAG GAGAAGGCCG AGGCGCGCGA CGCACTCGAG AACCGCTTGC TCGAGATCGG CAAGATGCTC ACCTTCGCCG ACCGCCATCC CGATCGCGTG ACACCCGAGA TGCTGGGACG TGCCGAGCAG ACGGCGAGCG CGCTGTCGGG CGAGATCGAG AGCTTGCGCA GCGAGGAGGA GGATCTGCAG CACCGCCTCG CGCTCACCCG CGAGGCACGG GTGAATGCGG AGCGCGAGAT GTTCGAGGGC TGCATCGTGC GCATGGGCGA GCAGCTGTTC AAGCTGTCGC AGGACCGCGG GCCGACCACG GTGCGACTGG CCACCCAGGG GCTGGGCGTG TTCCCGCTCG AGGACGACAG CCGCTTCGAC GAGCCGCAGC GCCCGGCGGC GGCCAGCCAG GGCTCGTCCC GGCGCTGA
|
Protein sequence | MDKQPTEGPE LEVSFDELTR VLSVSIAHDP HFPRIDALWL RRRLEAAGYA DLQIRPDPIR RLIAQYNAGE AVAPVEIAQC VDASMQIGVS LDGLVARLSI VPAKGGKPAS KTELLALIES RGIVEGLLLE EMNRAIADGQ ADDLAIARGR EPEPGKDGWL EYLLPETRER VPSVRPCGRT DYRDLGEILV VHAGDALMRR HPPQPGVDGV NVYGRPIVAR RGREQRFAPG LRGTAISPED PELLVAACDG QPVRVRNGAM VEPIFTVDAV NLATGNIDFD GSVRIRNDVQ AGMTVRASGD IEVGGVVEPA TLEAGGSIVV KGGVLGGLGG KTAGKDYSAH AIRCEGSFCA TYAQQARISA GDSIFIDDVA MQCQLEARNH IRVGKRLRGQ IVGGHCRASL SIHARTIGAN SRIRTELEIG MDNGLEHAIQ EKAEARDALE NRLLEIGKML TFADRHPDRV TPEMLGRAEQ TASALSGEIE SLRSEEEDLQ HRLALTREAR VNAEREMFEG CIVRMGEQLF KLSQDRGPTT VRLATQGLGV FPLEDDSRFD EPQRPAAASQ GSSRR
|
| |