Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1147 |
Symbol | |
ID | 7084676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1264823 |
End bp | 1267624 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698162 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_002354802 |
Protein GI | 217969568 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000889263 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATC CCCTGCCCGA ACCTGCCCGG CCTTCGCCCG CTGGCCGGCG CACGATCCCG ATCCGCGCCG CGGCGCCGGG CGAGACCCCT GCCACCTGTC CCTACTGCGG CGTCGGCTGC GGCGTGCTGA TCGAGCACGA CGGCACCCGC ATCACCGGCG TGCGCGGCGA CCCTGCACAT CCGGCCAACT TCGGCCGCCT GTGCACCAAG GGCTCGACCC TGCACCTCAC CGCCGGCCAC GATACCCGCC TGCTCTACCC CGAATACCGC GCCCGCCGCG GCGAGGCCCG CACGCGCATG GGCTGGGACG CTGCGATCGA CGTGGCCGCG CAGCGCTTCG CCGACGTGAT CGCCGAGCAT GGGCCGGACG CGGTCGCCTT CTACATCTCC GGCCAGCTCC TCACCGAGGA CTACTACGTC TTCAACAAGG CGATGAAGGG GCTGATCGGC AGCAACAACG TCGACACCAA CTCGCGCCTG TGCATGTCGA GCGCGGTCGC CGCCTACAAG GCCACGCTGG GTGCGGACGC GCCGCCGTGC TGCTACGAGG ACTTCGACCA CGCCGACACG ATCCTGATCG CCGGCGCCAA CCCGGCGTGG GCGCATCCGG TGGCCTTCCG CCGCATCGAG GAGGCGAGGG CGCGGCGACC GGGCATGAAG ATCGTCGTGG TGGATCCGCG CCGCACCGAC ACCGCGGCGA TGGCCGACCT GCACCTGGCC ATCCTGCCCG GCACCGACGT CTGGCTGTTC GACGCCATGC TGCACGTGCT GCTGCGCGAC GGGTTGGCCG ACGAAGCCTG GATCGCCGCC CACACCGAGG GTTTCGAGGC GCTGCGCGCG CATGTGCACG GGGTGAGCCC GGCGGTGGCC GCGGCGGTGT GCGGGGTGGC GGCGGAGGAC ATCGTCACCG CGGCGCGCTG GTGGGGCGAG GCGCCGGCGG CGCTGTCGCT GTGGTGCCAG GGGCTCAACC AGTCGACGCA CGGCACCCAC AACGGCGCCG CGCTGATCGC GCTTTCGCTC GCCACCGGCA AGATCGGCCG CCCCGGCTGC GGGCCGTTCT CGCTCACCGG CCAGCCCAAC GCGATGGGCG GGCGCGAGGT CGGCGGGCTG GCGAACCTGC TGCCGGCGCA TCGTGACCTG GCGAATCCCG CGCATCGCGC GGAGGTGGCG CGGCTGTGGG GCGTGGACAG CGTGCCTTCG GAGCCCGGCC TGACCGCGGT GGAGCTGTTC CGCGCGGCGC GCGAGGGCCG CGTCAAGGCG ATCTGGATCG CGTGCACCAA CCCGGCGCAG TCCATGCCCG AGCAGGCCCT GGTGCACGAG GCGCTGGCGG CCTGCGACTT CGTCGTGCTG CAGGAGGCCT TCGCCGGCAC CGAGACCGCG GCCTTCGCCG ACCTCGTGCT GCCGGCCGCC GCCTGGGGCG AGAAGGAGGG CACGGTGACC AACTCCGAGC GCCGCATCAG CCGCGTGCGC GCGGCCGTGC CGGCGCCGGG CGAGGCGCGT GCCGACTGGC GCATCGTGTG CGACTTCGCA CGCGCGCTCG CACCGCGCAT CGGCAAGCCG CAGGCGGCGG CGATGTTCGC CTACGCGTCG GCGCAGGCGA TCTTCGCCGA GCACGTCGCC AGCACCGCCG GGCGCGACCT CGACATCACC GGGCTGGACT ATTCCCTGCT GGAAGGTGAA GGCCCGCAGC AATGGCCGTT TCCGGCGGGA GCGGCGGTGG ATGGCGCCGA TGCCGCGGCG CGGGCGCGTC TCTACGCCGA CGGTCGCTTC GCGACGCCGA ACGGGCGCGC ACGCTTCGTC GTGCCGGCGC GCGCGCTCAC TGCCGAGAAG ACCGGCGTGC GCTATCCGCT CGCCCTCACC AGCGGCCGCC TGCGCGACCA GTGGCACGGC ATGAGCCGCA GCGGCAAGGT GGCGCGCCTG TACGGCCATG CGGACGAGGC ACGCATCGAG CTGCACCCGC AGGAGCTGGC GCGGCGCGGC CTCGTCGACG GCGACCTGGT CAAGGTGTCG AGCCGGCACG GCGAGCTGGT GCTGCGCGTG GCCGCGGCGA GCGCGCTGCG GCACGGCATG GCCTTCGTCG GGATGCACTG GGGGCGGCGG GCGCTGAACT CGGCCGGGAT CAACCTGCTC TTCGGCGGCG CCTGCGATCC GCTGTCGAAG CAGCCGGAGC TCAAGCACGC GGCGGTGCGC ATCGAGCCGG TCGAGCTGCC GAACCGCATG CTGGTGGTGC GTGCCGAGCG CCCCGGGCGT ACCGCCGCCG AGGAGGCTGC GCTGCTGTCG CCCTGGCTGG AGCGCTTCGC CTACGCCTCG CTCGCGCTCG CCGGCCGCGA CACGCCCGCG GTGGTGATGC GGGTCGCGCA CGACCGTCCG ATCCCGGCCG GCTGGCTCGC CGAGCTCGAC GCCTTGCTCG GCCTCGAGGG CGACGGGGTG CTGTCGTATG CCGACCCCGC GCGCGGGATC AGCAAGCGTG CGCTGATCGA GGACGGCCGC CTGGTCGGCC TGCGCCTGAC CGGCGAGACC GCGGCCGCGG GCTGGCTGAG CGAGGCGGTG GTCGAGCGCC GCGACGCGGC GGCGCTGCGT CCGTGGCTGC TGGCGCCGCT GGCGACCCCG CCGACAGGAG ATGCGGGGCG CGGGCGGGTG GTGTGCAACT GCCTGAACGT GTCCGAGGCC GACATCACGG CGGCGATCGC TGCGGGCGCC GGCGTACGCG ACTTCGACGC GCTGCAGGCC AGGCTGCGCT GCGGCACGTC CTGTGGTTCG TGCGTGCCCG AGATCCGCCG CATGCTCGCC GAGGGGAGCT GA
|
Protein sequence | MTHPLPEPAR PSPAGRRTIP IRAAAPGETP ATCPYCGVGC GVLIEHDGTR ITGVRGDPAH PANFGRLCTK GSTLHLTAGH DTRLLYPEYR ARRGEARTRM GWDAAIDVAA QRFADVIAEH GPDAVAFYIS GQLLTEDYYV FNKAMKGLIG SNNVDTNSRL CMSSAVAAYK ATLGADAPPC CYEDFDHADT ILIAGANPAW AHPVAFRRIE EARARRPGMK IVVVDPRRTD TAAMADLHLA ILPGTDVWLF DAMLHVLLRD GLADEAWIAA HTEGFEALRA HVHGVSPAVA AAVCGVAAED IVTAARWWGE APAALSLWCQ GLNQSTHGTH NGAALIALSL ATGKIGRPGC GPFSLTGQPN AMGGREVGGL ANLLPAHRDL ANPAHRAEVA RLWGVDSVPS EPGLTAVELF RAAREGRVKA IWIACTNPAQ SMPEQALVHE ALAACDFVVL QEAFAGTETA AFADLVLPAA AWGEKEGTVT NSERRISRVR AAVPAPGEAR ADWRIVCDFA RALAPRIGKP QAAAMFAYAS AQAIFAEHVA STAGRDLDIT GLDYSLLEGE GPQQWPFPAG AAVDGADAAA RARLYADGRF ATPNGRARFV VPARALTAEK TGVRYPLALT SGRLRDQWHG MSRSGKVARL YGHADEARIE LHPQELARRG LVDGDLVKVS SRHGELVLRV AAASALRHGM AFVGMHWGRR ALNSAGINLL FGGACDPLSK QPELKHAAVR IEPVELPNRM LVVRAERPGR TAAEEAALLS PWLERFAYAS LALAGRDTPA VVMRVAHDRP IPAGWLAELD ALLGLEGDGV LSYADPARGI SKRALIEDGR LVGLRLTGET AAAGWLSEAV VERRDAAALR PWLLAPLATP PTGDAGRGRV VCNCLNVSEA DITAAIAAGA GVRDFDALQA RLRCGTSCGS CVPEIRRMLA EGS
|
| |