Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0798 |
Symbol | |
ID | 7084190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 879856 |
End bp | 881538 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697822 |
Product | nitrite and sulphite reductase 4Fe-4S region |
Protein accession | YP_002354463 |
Protein GI | 217969229 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACCAAT ACGACGAACA CGACCAGCGC CTGCTCGACC AGCGCGTCGC CCAGTTCCGC GACCAGATGC GGCGTCACCT GGCCGGCGAG CTCTCGGCCG ACGAGTTCCG TCCGCTGCGC CTGCAGAACG GGCTGTACAT CCAGCGCCAC GCGCCGATGT TCCGCATCTC GGTGCCCTAC GGCCACCTCG CCAGCCGCCA GCTGAGGAAG CTCGCCCACC TCGCGCGCAG CTACGACCAC GGCTACGCCC ACTTCACCAC CCGCACCAAC GTGCAGTTCA ACTGGCCGAA GCTGGAAGAC GTGCCGGAGA TGCTCGCCGA ACTGGCCAGC GTGCAGATGC ACGCCAACCA GACCTCGGGC AACTGCATCC GCAACATCAC CGCCGACCCC TTCGCCGGCG TGGCCGCCGA CGAGGTCGCC GATCCGCGCC CCTTCGCCGA GATCCTGCGC CAGTGGGCGA CCTACAACCC GGAATTCGCC TTCCTGCCGC GCAAGTTCAA GATCGCCTTC AACAGCGCCG CCGAAGACCG CGTGGTGCTG CGGGTGTACG ACATCGGCCT CGACCTGGTC CGTAACGAGG CCGGCGAGCT CGGCTTCCGC GTGCTGGTGG GCGGCGGCCT CGGCCGCACC CCGATCCTGG GCGAGGAGAT CAAGCCCTTC CTGCCCTGGC AGCACCTGAT CAGCTACTGC GAGGCCATCC TGCGCGTCTA CAACCGCTGG GGCCGCCGCG ACAACCTGTG GAAGGCGCGC ATCAAGATCC TCGTCAAGGC GCTCGGGCCG GAAGAGTTCG GCCGCCAGGT GGAGGAAGAA TGGGCCAACG GCAAGGACGG CCCCAACACC ATCACCGCGG CGGAGCTGGC GCGCGTGTCG GCCCACTTCG CCCCGCCTGC CTACGAGACC CTGCCCGCGC AGGACGCCGC CTTCGACGAA CTCCTCGCCG GCAACAAGGC CTTCGCCACC TGGGTCAAGC GCAATGTGCG CGCCCACCGC CAGCCCGGCT ACGCGGCGGT GGTCCTGTCG CTGAAGAAGA CCGGCACCGC CCCGGGCGAC GTCACCGCCG AGCAGATGGA CCTGATCGCC GATCTCGCCG ACCGCTTCAG CTTCGGCGAG GCGCGCATGC TGCACGAGCA GAACGTCGCG CTCGCCGACG TGCGCCGCAC CGACCTCTTC GAACTGTGGC AGGCGGCGCG TGCCGCCGGC CTGGCGACGC CGAACATCGG CCTGCTGACC GACATCATCT GCTGCCCCGG CGGCGACTTC TGCAGCCTCG CCAACGCACG CTCGATCCCG ATCGCCGACG CCATCCAGCG CCGCTTCGAC GACCTCGACT ACCAGCACGA CATCGGCGAG ATCGACCTCA ACATTTCCGG CTGCATGAAC TCCTGCGGCC ACCACCACGT CGGCCACATC GGCATCCTCG GCGTCGATAA GAACGACGAG GAGTGGTACC AGGTCAGCAT CGGCGGCACA CAGGGCAACG GCACCACGCT CGGCAAGGTG ATCGGTCCGT CCTTCGCCCT CGACGAGATC GTGGGCGTGA TCGAGAAGCT GCTCGAAACC TACGTCGAAC TGCGCCATGA GGACGAGCGC TTCATCGACA CCGTACAACG CGTCGGCACC GAGCCCTTCA AGGCACGCGT GTATGCCGAC CGCCAGCCGC AAAGGAAAGC CGCCCATGTC TGA
|
Protein sequence | MYQYDEHDQR LLDQRVAQFR DQMRRHLAGE LSADEFRPLR LQNGLYIQRH APMFRISVPY GHLASRQLRK LAHLARSYDH GYAHFTTRTN VQFNWPKLED VPEMLAELAS VQMHANQTSG NCIRNITADP FAGVAADEVA DPRPFAEILR QWATYNPEFA FLPRKFKIAF NSAAEDRVVL RVYDIGLDLV RNEAGELGFR VLVGGGLGRT PILGEEIKPF LPWQHLISYC EAILRVYNRW GRRDNLWKAR IKILVKALGP EEFGRQVEEE WANGKDGPNT ITAAELARVS AHFAPPAYET LPAQDAAFDE LLAGNKAFAT WVKRNVRAHR QPGYAAVVLS LKKTGTAPGD VTAEQMDLIA DLADRFSFGE ARMLHEQNVA LADVRRTDLF ELWQAARAAG LATPNIGLLT DIICCPGGDF CSLANARSIP IADAIQRRFD DLDYQHDIGE IDLNISGCMN SCGHHHVGHI GILGVDKNDE EWYQVSIGGT QGNGTTLGKV IGPSFALDEI VGVIEKLLET YVELRHEDER FIDTVQRVGT EPFKARVYAD RQPQRKAAHV
|
| |