Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3902 |
Symbol | |
ID | 7873550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4299378 |
End bp | 4301372 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643700841 |
Product | hydrolase (HAD superfamily)-like protein |
Protein accession | YP_002890864 |
Protein GI | 237654550 |
COG category | [R] General function prediction only |
COG ID | [COG5610] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGCGC ACCCACGCGA CCTGTTTTAT GAATTGGGTC TGCGTGTCGC CCCTGCGTCC AGCGGAGAGT CGGCGCGCGA ACGCTTTGCG CGGCGTTTCC AGCGCGCTCG CATTCGGGCA GAAAAACTTG CCAACTGGAG TGCGCGCCGC AGGGGGCACG CGCACGCATC GATCGACGCC ATCTACCAGA AGGTTAACTG GTTTATACGT CTGGATCGAC CAGCCGCCGA ACTGATCGAG GCTGAACTGA CGCTGGAGGA GGAAAGCCTT TACCCTATCG AAGAAACCAT GCTTCAGCTG AATGCCTTAC GTGCAGCTGG GCATCGCATC CTGTTCATTT CCGACATGTA CATTCCTGCG TCAATGCTGC GCCCCATGCT TGAGCGCATG GGCGTAATGG AGGAAGGCGA CCGGTTGTAT GTGTCATGTG ATATTGGCGT CTCGAAGCAC AATGGAAAGC TGTTCCAACA TGTCCTGCAA GCCGAGGGGC TGAGGGGAGA GCAACTGCAG CACACCGGCG ACAACGCTCA TGCAGACATT CGAATGGCCG AAAAACTGGG CATCCTGACC CGGCATTTCA CCGCAGCCCA TTTGACTGAA CACGAGACGC GGATCGCGGG TTCCCGGCTG CCCCGTCATC CTGGCGCGTC GCGGCAGGCC GGTTTCAGCC GGCGCTGCAG GCTGGCCATG CATTTGATCC ACGGCAATCC AACTCATGTG CTGGATGACG TGATCTTCAG TGTAATCGTA CCGTTTCTGC TGGCCTATGT CCTATGGATA CTGGATGACG CGCGAAAGCG CGGCATACAA CGACTGTATT TTGTCGCCCG CGATGGTGAG GTGTTGCTCA AGATCGCCCG CGAACTGAAA CCCGATGGTA TCGAGCTACG CTATCTGTAT GGATCAAGGC GTGCCTGGCT GCCTCCGTCC ATTTCAACTG ACGATGTCGA TTGGAAGCGT TTGCTTGCAG TAGCAGGGAA CGCAAATGCA CCTGTCGACA TCACGGCGCG CGCTGGATTG AGTGAGGTCG AACAAGCCAG TGTCCGGGTC ATTCTGAGTC TGGACGAAAA TACCTGGCGC ACGGCGCTGG CATTTGAAGA TGCATGCACA TTCATCGACA CACTGACAGC AAACCCGCGA TCCAGAAAAG TTCTATTGGA TTCGGTAGCT GCCAAGCGAG AAGCCGCACT GCACTATTTA CGCCAGGAAG GCCTTATGGA CGGGGTCAAC TGGGCCTTGG TGGATGCGGG CTGGTCGCTG AACGGACAAG CCGCGCTGAA ACGCATGCTC TCAACCGTTT CGCCATCCAG TCACCAAATT CAAGGCTATT ACATCGGCCT TGCTCGTGAT TGCTTGCCCG AAGCTCGCGC AGGCAGGGCC TATGCCTTCT GCCCTCCCCC TGGCAGCATT TTTTCTCGCC GTCGCGTAGT GCTCGAGCAC TGTTTTCTGC CAGCCAGTCA CGCAAGTACG CGCAGCTATT TTCTCAAGGG AGGCCACGCA ACTCCTGATT TCAGCGCTGA CTCCCGGAAC AACGAAGAAC TCGAATACGC TCTTCGACTT CATACCGTAG CTTTAGCGTC AGCAAGATTG CTGAAGCAAC AGCCGGCAAT CGGCGACTTG ATGCGAAGGT TCCGTACTCA ACTGACGAAT TCGGCCGCCG GATTCATCTG CGCCCCTGGC GTGCAAGATG CAATTGCCTA TTCTATGCTT ACTGCGGTAG CCGACATGCG ACAGGAACGC GAGTTTTCCC GGAGGTTGTG CCGACCGCTG TCTTTGGCCG ATGTGTGGAC GACGATGGGC ATGGCTTTTT CAAGGAGGAT GGCCTTTAAA TCGCCTGCCT GGATGTGGCT GGAGGGATCG ATCGCCCTTT CACCTTCATA CGTCGCCTTT CCCCTGAGGT TCATGCTTCG GATCGATGAC TTTCTGAACA GAATCAAGTC ACTGCGTTTC CCACGGTGCA AATCTTCATC GCGGATCGAT CAAGGCAAGA CCTGA
|
Protein sequence | MYAHPRDLFY ELGLRVAPAS SGESARERFA RRFQRARIRA EKLANWSARR RGHAHASIDA IYQKVNWFIR LDRPAAELIE AELTLEEESL YPIEETMLQL NALRAAGHRI LFISDMYIPA SMLRPMLERM GVMEEGDRLY VSCDIGVSKH NGKLFQHVLQ AEGLRGEQLQ HTGDNAHADI RMAEKLGILT RHFTAAHLTE HETRIAGSRL PRHPGASRQA GFSRRCRLAM HLIHGNPTHV LDDVIFSVIV PFLLAYVLWI LDDARKRGIQ RLYFVARDGE VLLKIARELK PDGIELRYLY GSRRAWLPPS ISTDDVDWKR LLAVAGNANA PVDITARAGL SEVEQASVRV ILSLDENTWR TALAFEDACT FIDTLTANPR SRKVLLDSVA AKREAALHYL RQEGLMDGVN WALVDAGWSL NGQAALKRML STVSPSSHQI QGYYIGLARD CLPEARAGRA YAFCPPPGSI FSRRRVVLEH CFLPASHAST RSYFLKGGHA TPDFSADSRN NEELEYALRL HTVALASARL LKQQPAIGDL MRRFRTQLTN SAAGFICAPG VQDAIAYSML TAVADMRQER EFSRRLCRPL SLADVWTTMG MAFSRRMAFK SPAWMWLEGS IALSPSYVAF PLRFMLRIDD FLNRIKSLRF PRCKSSSRID QGKT
|
| |