Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3050 |
Symbol | |
ID | 7874520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3303818 |
End bp | 3304969 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699973 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_002890025 |
Protein GI | 237653711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGAT CCGAGGGCAC GATCATGGTG CTGGTGGCGA CCCGCAAGGG CGCCTGGTTC TTCCACAGCG ACACCGCGCG CCGGACCTGG CGGGTGGAGG GGCCCCAACT GCTCGGCCAG ATCGTCAGCC ACGCGGTGCT CGACCCGCGC GACGGCCGTA CGCTGCTCGC CGCCGCCAGG ACCGGCCATC TTGGCCCCAC CCTGTTCCGC TCCACCGACC TCGGCCGGCA CTGGTCGGAG GCCGTCCAGC CGCCCGCCTT CGCGCGTGCC CCCCAAGGCA CTGCAGGGCG CACGGTGGAC CACACTTTCT GGCTCACCCC CGCCCCCGCC GGCGAACCCG GCTGCTGGTA TGCAGGCACC TCGCCGCAGG GCCTGTTCCG CTCCACCGAC GGCGGCGTGC ACTGGACGCC GTTCTCCTGC CTCAACGACG ACCCGCAGTA CATCCGGTGG ATGGGCACCG TGCAGGACGG TACGCCGGAC GGGCCCAAGC TGCATTCGAT CATCATCGAC CCGCGCGACC CCGCGCATCT GTACATCGGC ATGTCGGGCG GCGGCGTGCA CGAGTCCCGC GACGCCGGGC GCAGCTTCGC GCCACTCCTG AACGGGCTCG AGGTGGTCGA GGGCTTCGAC CGCGCCGACC CCAGCTTCCA CGACCCGCAC TGCATCCGGA TGTGCCCGAG CGCACCGGAC CGGCTCTACC AGCAGAATCA CTGCGGCGTG TACCGGCTCG ACCGCCCGGG CGACGAGTGG GTGCGGATCG GTCGCAACAT GCCTGCGGAG GTCGGCGACA TCGGCTTTCC GCTCGTGGTG CATCCGCGCG ACGCCGAGCG CGCGTGGATC TTCCCGATGG ACGGCACCGA CGTCTGGCCG CGCACGAGCC CGGGCGGCCG CCCGGCGGTG TATGGCACCC GCGACGGCGG GGCGAGCTGG CAGCGCCTGG ACCGCGGCCT GCCGCCGCAG CAGGCGTGGT GGACGGTCAA GCGCCAGTCG ATGTGCGCCG ATGGGCTGGA CCCGCTGGGG CTCTACTTCG GCACCACCAG CGGCGAGCTG TGGACGAGCG CCGACGAGGG TGAGCACTGG CAGTGCATCG CACGGCACCT GCCGGAGATC CTCGCGGTCG AGACCGGCCA TCCGAGCGCC GCCGCGACAT GA
|
Protein sequence | MNGSEGTIMV LVATRKGAWF FHSDTARRTW RVEGPQLLGQ IVSHAVLDPR DGRTLLAAAR TGHLGPTLFR STDLGRHWSE AVQPPAFARA PQGTAGRTVD HTFWLTPAPA GEPGCWYAGT SPQGLFRSTD GGVHWTPFSC LNDDPQYIRW MGTVQDGTPD GPKLHSIIID PRDPAHLYIG MSGGGVHESR DAGRSFAPLL NGLEVVEGFD RADPSFHDPH CIRMCPSAPD RLYQQNHCGV YRLDRPGDEW VRIGRNMPAE VGDIGFPLVV HPRDAERAWI FPMDGTDVWP RTSPGGRPAV YGTRDGGASW QRLDRGLPPQ QAWWTVKRQS MCADGLDPLG LYFGTTSGEL WTSADEGEHW QCIARHLPEI LAVETGHPSA AAT
|
| |