Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2972 |
Symbol | |
ID | 7874362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3218010 |
End bp | 3219737 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699893 |
Product | protein of unknown function DUF1302 |
Protein accession | YP_002889948 |
Protein GI | 237653634 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.715393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC ATCTGAAGAA ACTGGTGCTG GGGCTGGCCA TGGCCGGCAT CTGTGCGCCG GCGGCGTACG CCTTCCAGTT CGAGGCCGGC TCGGTGCGCG GTTCGCTCGA TTCCACGCTG ACCGCGGGCT TCGGCAAGCG CGTCGAGGGC CGCGAGTGCA GCCACCACGG CGATCAATCC ACCCCCTGCG GCGCCGATGC CAACACCGCG GTATGGGGCA ACGGCGACGA CGGCAATCTC AACTACGACA AGGGCGACCT GTTCACCACC CACCTCAAGG GCTCGCACGA GCTGCTGCTG AGCTTTCCCG ACGAGTGGAA GTTCATGGGC CGGGTGGGCT GGCTGTACGA CTTCGTCGCC GACGACACCG CGCGCACCCC GCTCGACGGC GACGCCAAGC AGGCGGTCGG CCAGTACGTG CGCCTCTACG ACCTGTGGGT GAGCAAGGAA TTCGACCTCG GCGAGCGCCG CGGCCGCGTG CGCTTCGGCA ACCAGGTGGT GAGCTGGGGC GAGAGCCTGT TCATGATCGG CGGCATCAAC TCCAACGTGG CGATGGACCT GCAGCGCCTG TCCTCGCCCG GCGTGCAGCT GAAGGAGGCC TTCCTGCCCT CGCCGGCGCT GTCGGTCGCC GCCAACCTCG GCAGCGGAGT GAACGTCGAG GCCTACTACC AGTTCCAGTG GAAGCCGTAC AGCTTCCCGC CGGCGGGGAC GTATTTCTCG CAGAGCGATT TCTTCGACAA GGGGCGTGAC AGCCTGATCT ATTTCACGCC GGATGCCGCC AGCCGGATTC GTACGCCGGG TGATCCGCTT TACGGCAAGA CCCTCGCCGC GGCTCGGGAA GAAATGCTGA ACAGTGAACT CACGGCGATC CCGGTCGACG CCGACGACGA GCCGGGCGAC TCCGGCCAGT TCGGCGTCTC GCTGAAGTAC CGGCCCGAGG GCATGGACGT CGATCTCGGC CTCTACTACC AGCGCTTCCA CGACAAGACC CCCAACCTTC AATACTGGAA CGCGAACGGT GCGGCTCGCA TGTACTTTCT GGAGGACCGC GAGCTCTACG GCATCAGCGC CAACACCTCG GTGGGCAACT GGGCGGTGGG CGCCGAGTTG TCGTATCGGC CGGAGGATGC GGTTTCGCTC GGGGCCTGCC AGGACATCGG CTTCGGGTCA CCCGATCAGT GCGACGGCTC GATCGACGAG GAGCGCTACC AGTTCCACCT CACCGGCATC CTCAGCCTGA CGCCCGGCGA CCACGGCTGG TTCCTCGACC TGATGGGCGC GCAGACCGGC ACCTTCCTTG GCGAGGCGGT GGCGATCGCC TATCCGGGGG TGAGCAAGGA CAAGGCCTAC GTGCGCACCC GCAACGGCGT GACCTACCAG CAACTCCCCG CCGCCGGCGC ATGGACGCAC GACAGCGGCG TCGGCGACAA GCTGTCCTGG GGCTACATGT TCGACTTCAG CCTGACCTAC GACAGCACCC TGATCCCGGG CTGGCAGGTG ATCCCGGGGG TGTTCTTCTC GCATGCGGTC AATGGCAACA CGCCCAACTT CATGGCCAAC TGGATGGAGG GGGCGAAGTC GGCCAACTTC TATGTGCTGT TCAACCGCAA CCCGATGACC TGGCAGGCCG GCATCAACTA CACCCGGTTC TGGGGCGGCG ACTACAGCTT CAGCGCGCCC TACAAGGATC GCGACTTCGT CGGCGGCTTC ATCTCGCGCA ACTTCTGA
|
Protein sequence | MTTHLKKLVL GLAMAGICAP AAYAFQFEAG SVRGSLDSTL TAGFGKRVEG RECSHHGDQS TPCGADANTA VWGNGDDGNL NYDKGDLFTT HLKGSHELLL SFPDEWKFMG RVGWLYDFVA DDTARTPLDG DAKQAVGQYV RLYDLWVSKE FDLGERRGRV RFGNQVVSWG ESLFMIGGIN SNVAMDLQRL SSPGVQLKEA FLPSPALSVA ANLGSGVNVE AYYQFQWKPY SFPPAGTYFS QSDFFDKGRD SLIYFTPDAA SRIRTPGDPL YGKTLAAARE EMLNSELTAI PVDADDEPGD SGQFGVSLKY RPEGMDVDLG LYYQRFHDKT PNLQYWNANG AARMYFLEDR ELYGISANTS VGNWAVGAEL SYRPEDAVSL GACQDIGFGS PDQCDGSIDE ERYQFHLTGI LSLTPGDHGW FLDLMGAQTG TFLGEAVAIA YPGVSKDKAY VRTRNGVTYQ QLPAAGAWTH DSGVGDKLSW GYMFDFSLTY DSTLIPGWQV IPGVFFSHAV NGNTPNFMAN WMEGAKSANF YVLFNRNPMT WQAGINYTRF WGGDYSFSAP YKDRDFVGGF ISRNF
|
| |