Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3592 |
Symbol | |
ID | 7873097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3938806 |
End bp | 3941010 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643700532 |
Product | protein of unknown function DUF181 |
Protein accession | YP_002890562 |
Protein GI | 237654248 |
COG category | [S] Function unknown |
COG ID | [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03549] conserved hypothetical protein TIGR03549 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0680194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCA AGGTCAACTT CCTCGACAAG CTTCGCCTCG AAGCCAGGTT CGACGACTTC ACGGTGATCG CCGATCAGCC GATCCGTTAC AAGGGCGACG GCTCGGCGCC GGGGCCGTTC GACTATTTCC TGGCCTCGTC GGCCTTGTGC GCGGCCTACT TCGTGAAGCT GTACTGCGAC ACCCGCAACC TCCCCACCGA CAACATCCGC CTGTCGCAGA ACAACATCGT CGACCCCGAG AACCGCTACA AGCAGATCTT CAAGATCCAG GTCGAGCTGC CGGAGGACCT CTCCGCCAAG GACCGCCAGG GCATCCTGCG CTCGATCGAG CGCTGCACCG TGAAGAAGGT GGTGCAGACC GGGCCGGAGT TCGTCATCGA GGAGGTGGAG AACCTGGACG CCGATGCGCA GGCCCTGCTG ACGCTGAATC CGGATCCGGA CGCCCACACC TACATCCTCG GCAAGGATCT GCCGCTGGAG CAGACGATCG CCAACATGTC GAAGCTGCTG GCGGACCTCG GCATCAAGAT CGAGATCGCG TCGTGGCGCA ACCTCGTGCC CAACGTGTGG TCGCTGCACA TCCGCGATGC GCATTCGCCG ATGTGCTTCA CCAACGGCAA GGGGGCGAGC AAGGAGAGCG CCTTGGCGTC GGCGCTGGGC GAATACATCG AGCGCCTCAA CTGCAACCAC TTCTACAACG ACCAGTTCTG GGGCGAGGAC ATCGCCGACG CGGCGTTCGT GCATTACCCC AACGAGCGCT GGTTCAAGCC CGGCCGCAAG GATGCGCTGC CGGCCGGGCT GCTCGACGAG TATTGCCTGG AGATCTACGA CCCCGAGGGC GAGCTGCGCG CCTCGCATCT GTACGACACC AACTCCGGCA ACGTGGCGCG CGGCATCTGC GCGCTGCCCT ACGTGCGCCA GTCGGACGGC GAGGTGGTGT ATTTCCCCAC CAACCTGATC GACAACCTCT ACCTCAGCAA CGGCATGAGC GCCGGCAACA CGCTGCCCGA GGCGCAGGTG CAGTGCCTGT CGGAGATCTT CGAGCGCGCG GTCAAGCGCG AGATCATCGA AGGCGAGATG GCGCTGCCCG ACGTGCCGCC GCAGGTGCTG GCGAAGTATC CCGGCATCGT GGCCGGCATC CAGGGGCTGG AGGCGCAGGG CTTTCCGGTG CTGGTGAAGG ATGCGTCGCT GGGCGGCAAG TATCCGGTGA TGTGCGTGAC CCTGATGAAC CCGCGCACGG GCGGCGTGTT CGCCTCGTTC GGGGCGCACC CGAGCCTGGA GGTGGCGCTG GAGCGCAGCC TGACCGAGCT GCTGCAGGGG CGCAGCTTCG AGGGCCTCAA CGACCTGCCC GCGCCCACCT TCGAGAGCAA CGCGGTCACC GAGCCGAATA ATTTCGTCGA GCACTTCATC GATTCGAGCG GCGTGGTGTC GTGGCGCTTC TTCAGCGCCA AGGCCGATTA CCCGTTCGTC GAATGGGATT TCTCTTCCCA CGGTGAAAGG TCCAACGCCG AGGAAGCCGC GACCCTGTTC GGCATCCTCG AGGACATGGG CAAGGAAGTG TACATGGCGG TGTTCGACCA GCTCGGCGCC ACCGTCTGCC GCATCGTCGT GCCGGGGTAT TCCGAGGTCT ATCCGGTGGA TGACCTGATC TGGAACAACA CCAACAAGGC GCTGGCCTTC CGCGCCGACA TCCTCAACCT GCATCGCCTC GACGATGAGG CCCTCGAAGC CCTGCTCGAG CGCCTGGAAG AAAGCGAGCT CGACGATTAC ACCGACATCA TCGAGCTGAT CGGCATCGAG TTCGACGAAA ACACGGTGTG GGGTCAGCTC ACGATCCTGG AGCTGAAGCT GCTGATCCAT CTCGCCCTGC AGCAATTGGA AGAGGCGAAG GAGCGCGTCG AGGCCTTCCT GCAATACAAC GACAACACGG TCGATCGCGT GCTGTTCTAC CAGGCCCTGA ACGTGGTGCT GGAGGTGATG CTGGACGACG AGCTGGCGCT GGAAGACTAC GAGGCCAACT TCCGCCGCAT GTTCGGCGAC GCGCGCATGG ACGCGGTGAT CGGCTCGGTG GAGGGCCGCG TGCGCTTCCA TGGCCTGACG CCGACGAGCA TGAAGCTCGA AGGGCTCGAC CGCCACCAGC GCCTGATCGA CAGCTACCGC AAGCTGCACC GGGCGCGGGC ACGCGCGGCC GGCGTGTCCG CGTAG
|
Protein sequence | MEIKVNFLDK LRLEARFDDF TVIADQPIRY KGDGSAPGPF DYFLASSALC AAYFVKLYCD TRNLPTDNIR LSQNNIVDPE NRYKQIFKIQ VELPEDLSAK DRQGILRSIE RCTVKKVVQT GPEFVIEEVE NLDADAQALL TLNPDPDAHT YILGKDLPLE QTIANMSKLL ADLGIKIEIA SWRNLVPNVW SLHIRDAHSP MCFTNGKGAS KESALASALG EYIERLNCNH FYNDQFWGED IADAAFVHYP NERWFKPGRK DALPAGLLDE YCLEIYDPEG ELRASHLYDT NSGNVARGIC ALPYVRQSDG EVVYFPTNLI DNLYLSNGMS AGNTLPEAQV QCLSEIFERA VKREIIEGEM ALPDVPPQVL AKYPGIVAGI QGLEAQGFPV LVKDASLGGK YPVMCVTLMN PRTGGVFASF GAHPSLEVAL ERSLTELLQG RSFEGLNDLP APTFESNAVT EPNNFVEHFI DSSGVVSWRF FSAKADYPFV EWDFSSHGER SNAEEAATLF GILEDMGKEV YMAVFDQLGA TVCRIVVPGY SEVYPVDDLI WNNTNKALAF RADILNLHRL DDEALEALLE RLEESELDDY TDIIELIGIE FDENTVWGQL TILELKLLIH LALQQLEEAK ERVEAFLQYN DNTVDRVLFY QALNVVLEVM LDDELALEDY EANFRRMFGD ARMDAVIGSV EGRVRFHGLT PTSMKLEGLD RHQRLIDSYR KLHRARARAA GVSA
|
| |