Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0048 |
Symbol | |
ID | 7083431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 51312 |
End bp | 52766 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697097 |
Product | hypothetical protein |
Protein accession | YP_002353746 |
Protein GI | 217968512 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTCG ACTTCACCAC CCTCGACACC CTGCGCAGCC ACCACCCCGC GTGGCGGCTA CTGCGCTCGG ACCACGCGGC GCTGGTGGCG AGCTTCCTGC ACCGCGTCTT CGTCGCCCCC AACGTGCGCG GCATGGCGGC GGCGGACCTC GCCGAGGCGC TCGAGGACGA GCTCTACGCG CTGCGTCTGC TGCGCGGCGA CGACGCCTTC CCCAAGCCGG CGCTCGACTA CCTCAACGAC TGGGCCGCGG CCGACAAGGG CTGGCTGCGC AAGTACTACA GGCCCGGCAC CGACGAGGCC CAGTTCGACC TCACCCCGGC CACAGAGAAG GCGCTCGCCT GGCTGGCGCA GCTGAGCGAG CGCCAGTTCG TCGGCACCGA GTCGCGCCTG CTCACCTTGT TCGCCCTGCT CAAGCAGATG GGCGAGGGCA GCGAGACCGA CCCCGCGCGC CGCATCGCCG AGCTGCAGCG CAAGCGCGAC GAGATCGACG CCGAGATGGC GCGCGTGCTC GCCGGCGACA TCCCGCTGCT CGACGACACC GCGCTCAAGG ACCGCTTCCA GCAGTTCATG CAGGGCGCAC GCGAGCTGCT CGCCGACTTC CGCGAGGTCG AGCACAACTT CCGCCAGCTC GACCGCCGCG TGCGCGAGCG CATCGCGCTG TGGGAGGGCA GCAAGGGCGC GCTGCTCGAA CAGATCATGG GCGAACGCGA CGCCATCGCC GACTCCGACC AGGGGCGCAG CTTCCGTGCC TTCTGGGACT TCCTGCTGTC GAGCCGGCGC CAGGAGGAAC TCAGCGACCT GCTCGACCAG GTGCTCGCCC TGCCCGCGGT GGCCGAGCTG AACCCCGACG CCCGCACCCG CCGCGTGCAT TACGACTGGC TCGAGGCGGG CGAGCACACC CAGCGCACGG TGGCGCAGCT GTCGCAGCAG CTGCGCCGCT TCCTCGACGA CCAGGCGTGG CTGGAAAACC GCCGCATCAT GGACCTGCTG CACGGCATCG AGAGCAAGGC GCTCGCCCTG CGCGCCGCGC CGCCGCCCGG CCCCGTGATG GAGATCGCCC TGCCCGGCGC GGCGATCGAG CTGGCGCTGG AGCGCCCGCT GCACACCCCG CCCACCCGGC CGGCGATCGC CGCGCTGCGC GTCGAGGCCG GCGACGAGGA TGTCGACCCG CACGCGCTCT TCGAGCAGGT CACGGTGGAC AAGGCACGCC TCGCCCGCCA CATCCGCCAC GCCCTGCAGG ACCGCGCCCA GGTCACGCTC GCCGCGCTGG TCGCCGCGCA GCCGCTGGAG CAGGGCCTGG CCGAGCTGAT CGCCTACCTG CAGCTCGGCA GCGACGCCTT CACCACCACC GTCGACGAGG CCACCCCCGA GCCGATCGAA TGGCAGGCGA GCGCCGCCGA CGGCGCCATG GTCACCCGCA AGGCCCGCCT GCCGCGGGTG ATCTTCATGC GCTGA
|
Protein sequence | MPLDFTTLDT LRSHHPAWRL LRSDHAALVA SFLHRVFVAP NVRGMAAADL AEALEDELYA LRLLRGDDAF PKPALDYLND WAAADKGWLR KYYRPGTDEA QFDLTPATEK ALAWLAQLSE RQFVGTESRL LTLFALLKQM GEGSETDPAR RIAELQRKRD EIDAEMARVL AGDIPLLDDT ALKDRFQQFM QGARELLADF REVEHNFRQL DRRVRERIAL WEGSKGALLE QIMGERDAIA DSDQGRSFRA FWDFLLSSRR QEELSDLLDQ VLALPAVAEL NPDARTRRVH YDWLEAGEHT QRTVAQLSQQ LRRFLDDQAW LENRRIMDLL HGIESKALAL RAAPPPGPVM EIALPGAAIE LALERPLHTP PTRPAIAALR VEAGDEDVDP HALFEQVTVD KARLARHIRH ALQDRAQVTL AALVAAQPLE QGLAELIAYL QLGSDAFTTT VDEATPEPIE WQASAADGAM VTRKARLPRV IFMR
|
| |