Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0997 |
Symbol | |
ID | 7083731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1097750 |
End bp | 1099408 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698019 |
Product | hypothetical protein |
Protein accession | YP_002354659 |
Protein GI | 217969425 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTGAGC TGACGCCGCA GGACATGGCT GCCAAGCTTC TGGCCACCGG CTTCGAGCGC AGCGGCCCTT CGGCCGCGAC CTTGAGCGAC CCCATCGCCG ACACGCCGAT GGTGGTGACG CTGGACCAGT TGCGGCCCTA CGACCACGAC CCGCGCGTGA CGCGCAACCC GGCCTATGCG GAGATCAAGG CGTCCATCCG CGAACGCGGG CTGGACGCGC CCCCCGCGAT CACGCGCAGG CCGGGCGAGG CGCATTACAT CATTCGCAAC GGCGGCAACA CGCGGCTCGC GATCCTGCGC GAGTTGTGGA GCGAGACCAA GGAGGAACGC TTCTTCCGCA TTGCGTGCCT GTTCCGCCCG TGGCCGGCGC GCGGCGAAAT CGTGGCGCTG ACTGGACATC TGGCCGAGAA CGAGCTGCGC GGCGGGCTGA GCTTCATCGA GCGGGCCTTG GGCATCGAGA AGGCGCGCGA GTTCTACGAG CAGGAAAGCG GCCAGGCGCT GTCACAGAGC GAACTCGCGC GGCGGCTGAC GGCCGACGGC TATCCGGTGC CGCAGTCACA CATCAGCCGC ATGAACGATG CGGTGCGCTA TCTGCTGCCG GCGATCCCGA CGCTGCTGTA CGGCGGATTG GGCCGGCATC AGGTGGACCG GCTCGCGGTG CTGCGCAAGG CGTGCGAGCG CACCTGGGAG CGGCGTGCGC TGGGCCGCAC CGTGGCCGTG GACTTCGCCA CCTTGTTTCA GGACGTGCTG ACGCAGTTCG ACACACAGCC GGACGACTTC TCGCCGCAGC GGGTGCAGGA CGAGCTGGTC GGCCAGATGG CCGAGCTGCT GGAGGCGGAC TACGACACGC TGGCGCTGGA GATCAACGAC AGCGAAAGCC GCCAGCGTGC GCTGACCAGC GAACCGGCGG CGCCGACGCC ACCGGCAGCG CCTGTCGTGC CTGCTGATCC TCCCCCGCCG GTCTCCGCGC CTCAGCAGCC ACCCGCCTCG TCTGTGCCGC GCGACACCAC GCCGGTCGCG CCTTCGGCGC CAGCAGCGAC ACCGCCTGCA TCGCCCGAAG CGCCGGAGGA CCAGCACGGG GAACGCGAAG AGCGCCTGCA AGGGCACATC GTGACACCGG CACCGACCAC CGAGCGCCTG CAGTCCATCC AGCGGATGGT CGCGGACCAG CTCGGCGACA AGCTGCCCGA CTTCGAGGCC GATGCGCTGC GTGCGATCCC CGTGCAGGTC GGCGGGCTCT ATCCCATCTC GGACGTCTGG TACGTCGAGC CGGGGCTGGA CGTGCCGGAT CGCCTGCGCG TGCACATCGC GCAGTTCGCG CGCGAGATCG GCGAGGAAGC GGCGGTCGGC GACCACATCG AGGCCAGCGT CGGCGGCATC GGCTTCGTCT GCGCGGCGCC GGTTGTGGGC CAGGCGAAGG CGCTGCCGGC GTTCGCGCGG GCGGTGCTGA CCCTGCTGCA TGTGCTGAGT GCGGCTCCGC CCTCCGCGAA CGGATTGGAC CGCGCGCGGC TGGCCGACGA GCTGGCGGCG CTGCTGCATG GCCACGGCGG CTCGGCCACA CGCCTGAGCG ATGCTGCGCT GGTGAAGCTG TTCCGTCTGC TGCGCCTGGC GCGCCGGCTG CTGGATCTGG AAGCCGGCGT AGCGAGCCAG GATTCCTGA
|
Protein sequence | MAELTPQDMA AKLLATGFER SGPSAATLSD PIADTPMVVT LDQLRPYDHD PRVTRNPAYA EIKASIRERG LDAPPAITRR PGEAHYIIRN GGNTRLAILR ELWSETKEER FFRIACLFRP WPARGEIVAL TGHLAENELR GGLSFIERAL GIEKAREFYE QESGQALSQS ELARRLTADG YPVPQSHISR MNDAVRYLLP AIPTLLYGGL GRHQVDRLAV LRKACERTWE RRALGRTVAV DFATLFQDVL TQFDTQPDDF SPQRVQDELV GQMAELLEAD YDTLALEIND SESRQRALTS EPAAPTPPAA PVVPADPPPP VSAPQQPPAS SVPRDTTPVA PSAPAATPPA SPEAPEDQHG EREERLQGHI VTPAPTTERL QSIQRMVADQ LGDKLPDFEA DALRAIPVQV GGLYPISDVW YVEPGLDVPD RLRVHIAQFA REIGEEAAVG DHIEASVGGI GFVCAAPVVG QAKALPAFAR AVLTLLHVLS AAPPSANGLD RARLADELAA LLHGHGGSAT RLSDAALVKL FRLLRLARRL LDLEAGVASQ DS
|
| |