Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3717 |
Symbol | |
ID | 7873716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4083466 |
End bp | 4084389 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643700663 |
Product | DNA binding domain protein, excisionase family |
Protein accession | YP_002890687 |
Protein GI | 237654373 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.222458 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGC GCCCCGCGCA GCCGGCAGGC ACCGACGCGG CATCCGCGAC CTGCCTCAAC GCACGCGAGG CCGCGGCCTT CCTGCAGCTC AACGAGAAGA AGCTCTACGA ACTCGCCAAC AGCCGCGAAA TCCCCGCCGC ACGCGTGGGC GGCAAGTGGC TGTTCCCGCG CGCACTGCTC GAGGAGTGGC TGCTCGAGCA GGCGCATGGC GGCGCGCTCA GCGACCGGCT GGTGATCACC GGCAGCGACG ACCCGCTGCT GGCCGCCACG GTGGGCGCGC TCGCTCCCGT GCTGGGGGGC GATGCCTTCG TCGCCTACAG CCCCACCGGC ACCCTGCCCG GGCTCGAGCT GCTCGCGCGT CGCCGCGCCA GCGTGTGTGC GCTGCACTGG GGCGGGGTGG AGCACAGCCT GCAGCAGCAC GCCATGCTGC TGCGCCGCTT CGCCCCGCAT CGCCATTGGG CGCTGGTGCG CCTGGCGCTG CGCGAGCAGG GCGTGATCCT GCGCCGCGGC CTGGCGGTCG ACGGCATCGA GACGCTCGCC GCCTTCGACT ATCGCTGGGC GATGCGCCAG GCCGGCGCGG GTTCGGCGCA CTTTCTCGAG TCCGTGCTCG CCAGCCGCGG TTTCTCGGCG GCCGACTGCA GCGTGGTCGC CACCGCCCGC AGCGAGCGCG AGGCCGCGGC CCTGGTCGCA CGCGAGGACG CCGACTGCGC GCCCGGCACG CGCGCCGCAG CGACCGAGTT CGGCCTCGGC TTCCTGCCCT TGGGCTGGGA GGCCTTCGAC CTCGCGCTGC CGCGCGACGT GATGTTCCGC CGCCTCTTCC AGGATCTGCT CGCCGCGCAC GGCGACGCGC GCTCTCAGGC GCTGGCGCAC CGGCTCGGCG GCTACGACCT CGGCCCGCTC GGGCGCGTGC TCGGCCTCGA CTGA
|
Protein sequence | MSARPAQPAG TDAASATCLN AREAAAFLQL NEKKLYELAN SREIPAARVG GKWLFPRALL EEWLLEQAHG GALSDRLVIT GSDDPLLAAT VGALAPVLGG DAFVAYSPTG TLPGLELLAR RRASVCALHW GGVEHSLQQH AMLLRRFAPH RHWALVRLAL REQGVILRRG LAVDGIETLA AFDYRWAMRQ AGAGSAHFLE SVLASRGFSA ADCSVVATAR SEREAAALVA REDADCAPGT RAAATEFGLG FLPLGWEAFD LALPRDVMFR RLFQDLLAAH GDARSQALAH RLGGYDLGPL GRVLGLD
|
| |