Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1758 |
Symbol | |
ID | 7085725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1978332 |
End bp | 1979612 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698777 |
Product | protein of unknown function DUF482 |
Protein accession | YP_002355406 |
Protein GI | 217970172 |
COG category | [S] Function unknown |
COG ID | [COG3146] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC CCGACGGATT CCACTTCATC CCCCGCATCG CCCGTATCGA CGCACGCCAG TGGGATGCCC TGGTCGACGC GAGCTGCGCA AGCGCCGCGC CCTGTATCCG CCACGCCTTC CTGCACGCCC TCGAAGAGAG CGGCTGCGTG GGCGGGCGCA GCGGCTGGAC GCCCGCGCAC GCCACCCTGT GGCGCGGCGG CACGTTGGGC GCGGCCATGC CGCTCTATGT GAAGACGCAT TCCCGGGGCG AGTACGTCTT CGACTGGGCC TGGGCCGAGG CCTACCAGCG CCACGGCCTC GACTACTACC CGAAGTGGCT CGCGGCCGTG CCCTTCACGC CCATCCCCGG GCCACGCCTG CTCGGCCACG ACGACGGCGC ACGCCGGGAA CTGCTCGCCG CCCTGCTCGC GAAGGCGCTG GAGAGCGGGC TGTCGTCGCT GCACCTGCTT TTCCCGCTCG ATCGCGAGCG CCCGCTGCTC GAGGCCGCCG GGCTAATGAT CCGCGAGGAC GTGCAGTTCC ATTGGAGAAA TCCACCCGCC TCCGCCGAGA CCGAGCCGAC GCCCGCGGAA TCCAGCGCGC CTATCTCCGT CCGGCAGCGG CGCTGGGCCG ACTTCGAGCA TTTCCTCGCC AGCCTCTCCG GCGCCAAGCG CAAGAAGATC CGCCAGGAAC GCCGCCGCGC AGCCGCACAC GGTCTGGATC TGCAGTGGCT GGACGGGCAT GAAGCGCGGG ACGAGGACTG GGCGTTCTTC GCGCACTGTT ACGCCAACAC CTACGCGCAG CACCGCTCCA CGCCCTATCT CGCAGCAGAT TTTTTTGTGC GGCTCGCGCA CAGCATGCCC GAGGCCGTCC GTCTGCTGGT GGCTCGCCGC GACACCCAGC CGATCGCCGC CGCCTTCTTC CTGTGCGATC GCGAGGCGCT GTACGGGCGT TACTGGGGCT GCGTCGAGGA CCTGCCCTTC CTGCACTTCG AGCTGTGCTA CTATCAGGCC ATCGAATACT GCATCGAGCA TGGTCTGACC CGCTTCGAGG GCGGCGCACA GGGGGAACAC AAGCTCGCCC GCGGTCTGCT CCCGGTGCGT ACCTTCTCCG CGCACTGGCT CGCGGATCCG CGCTTTCGCG GCGCGGTCGA CGATTGGCTG GCAAGGGAGC GTGCGGGCGT GGGGCACTAT CTCGACGAAC TGGCGGAGCA CAGCCCATTC CTTCGCGAGA AGGGACCCGG CGGCATCGAG ACCGGCGCAG CAGGAGCCGC GGAGCCCCAC CGGGGTTCGC GCCGGCAGTG A
|
Protein sequence | MSQPDGFHFI PRIARIDARQ WDALVDASCA SAAPCIRHAF LHALEESGCV GGRSGWTPAH ATLWRGGTLG AAMPLYVKTH SRGEYVFDWA WAEAYQRHGL DYYPKWLAAV PFTPIPGPRL LGHDDGARRE LLAALLAKAL ESGLSSLHLL FPLDRERPLL EAAGLMIRED VQFHWRNPPA SAETEPTPAE SSAPISVRQR RWADFEHFLA SLSGAKRKKI RQERRRAAAH GLDLQWLDGH EARDEDWAFF AHCYANTYAQ HRSTPYLAAD FFVRLAHSMP EAVRLLVARR DTQPIAAAFF LCDREALYGR YWGCVEDLPF LHFELCYYQA IEYCIEHGLT RFEGGAQGEH KLARGLLPVR TFSAHWLADP RFRGAVDDWL ARERAGVGHY LDELAEHSPF LREKGPGGIE TGAAGAAEPH RGSRRQ
|
| |