Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2513 |
Symbol | |
ID | 7873952 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2713928 |
End bp | 2715145 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643699435 |
Product | protein of unknown function DUF344 |
Protein accession | YP_002889492 |
Protein GI | 237653178 |
COG category | [S] Function unknown |
COG ID | [COG2326] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.197532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA CCGAGGAAAC CGTCCCCGTA GTGGACACCG CCTCCACCCG CAGCAAGAAG CCGCGCACGG CTCCGCGCGC CAAGGACGCG GGCGCCGCGC GCGCGAAGTC CAAGGCCGCC GGCCCCGCAC GTCCGATCGC GGCCGGCGAC ACCGCGCCCG ATTTCAGCAG CACGGGCATC GAGGCCTTCG AGATCGAGCA GGCGATCGAC GCCGCGCAGG ACTCCAAGCT CGTCGCGGTC AAGGACATCC TGGCCCGAGC GGCTGCGGAT CCGCAGGCCA AAAGGGAGGA CGCGCTCGAG GCCATCCTCG ACGGCGCCTC GCCCGACGAC ATCCACATGC TGCGCCGCGC GCTGATGCGC GCCGACCTGC CGCAGGCGCC GGGCGCCCAT CCGGACGACG AGCTCTCGCC CGACTGGCGC AAGGGCGGCT ACCCCTACCG CAACCTGATG TCGCGCAAGG CCTACGAAAA GCAGAAGTAC CGCCTGCAGG TCGAGTTGCT CAAGCTGCAG GCCTGGGTCA AGGAGACCGG TCAGCGCGTG GTGATCCTTT TCGAGGGCCG TGACGCAGCC GGCAAGGGCG GCACGATCAA GCGCTTCATG GAGCACCTCA ACCCGCGCGG CGCGCGCGTG GTGGCGCTGG AGAAGCCCTC CGAGCTCGAG CGCGGCCAAT GGTACTTCCA GCGCTACATC CAGCACCTGC CCACCGCCGG CGAGATCGTG CTGTTCGACC GCTCCTGGTA CAACCGCGCC GGCGTCGAGC GCGTCATGGG CTTCTGCACG CCCGACGAGT ACAACGAATT CATGCGCCAG GTCCCCGAGT TCGAGCGCAA CCTGGTGCGC AGCGGCATCC ACCTGATCAA GTTCTGGTTC TCGGTGAGCC GCGAGGAGCA GCGTCGCCGC TTCAAGGAGC GCGAGTCGCA CCCGCTCAAG CAGTGGAAGC TGTCGCCGAT CGACCTCGCC TCGCTCGACA AGTGGGACGA ATACACCAAG GCCAAGGAGG CGATGTTCTT CTACACCGAC ACCGCCGACG CGCCATGGAC CGTAGTCAAG TCCGACTGCA AGAAGCGAGC GCGCCTCAAC GCGCTGCGCT ACGTGCTGCA CAAGCTGCCC TATGCCAACA AGGATGTCTC CCTCATCGGC CCGCTCGATC CCTTGCTGGT AGGCCGTGCC AACGTGGTGT ACGAGCGCGG CGAGAAGGAC GCCGTGGCGC TGCTGTAA
|
Protein sequence | MNKTEETVPV VDTASTRSKK PRTAPRAKDA GAARAKSKAA GPARPIAAGD TAPDFSSTGI EAFEIEQAID AAQDSKLVAV KDILARAAAD PQAKREDALE AILDGASPDD IHMLRRALMR ADLPQAPGAH PDDELSPDWR KGGYPYRNLM SRKAYEKQKY RLQVELLKLQ AWVKETGQRV VILFEGRDAA GKGGTIKRFM EHLNPRGARV VALEKPSELE RGQWYFQRYI QHLPTAGEIV LFDRSWYNRA GVERVMGFCT PDEYNEFMRQ VPEFERNLVR SGIHLIKFWF SVSREEQRRR FKERESHPLK QWKLSPIDLA SLDKWDEYTK AKEAMFFYTD TADAPWTVVK SDCKKRARLN ALRYVLHKLP YANKDVSLIG PLDPLLVGRA NVVYERGEKD AVALL
|
| |