Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1476 |
Symbol | |
ID | 7083559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1644345 |
End bp | 1645355 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643698494 |
Product | protein of unknown function DUF1214 |
Protein accession | YP_002355131 |
Protein GI | 217969897 |
COG category | [S] Function unknown |
COG ID | [COG5361] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAT ACCTGATGAT CGCCGCAGTC ACTGGAATCA TGGGACTCGA CCCAGGCGCC GGTTTCGCGG CGGAGAAGGT CACGGTGGAC AGCTTCGTGC GCGCCGAGAC CGACATGACG CTCGACCGCT ACGTCAGGCA GGGCGCGCTG GGCAAACTCA TCCACATCCG CATGCCCGTG CCGATCGACA GGCAGGACGT GATTCGCATG AATCGCGACA CGCTGTATTC CGCCGGCGTC TTCGACCTCT CCGCACCGGT CACCATCGTC AAGCCGGAAA CCGGGGGGCG ATTCCAATCG ATGCTGGTCA TCAACCAGGA CCACTCGATG CTGCCCGCAG AGCACGGCGC GGGCGAGTTC ACCTTCACCC AGGAGAAGAT GGGCACGCGC TACATGATCG TGCTCTTCCG CACCTTCGTC GATTCAAACG ACCCGACTGA TATCAAAGCG GCCAACGCCC TGCAGGACAA GATCGTGGTG AAGCAGGCGG CCCCAGGGAA GTTCGAGATT CCGGAATGGG ATGAGGCCTC ACTGAAGAAG GTCCGCGATG CCATCAACGT TCTGGCGGCG ACCCGGACCA GCGCCAAGGG CATGTTCGGT GACAAGGCCA AGCTCGATCC GATCAGCCAC CTGCTCGGCA CGGCCTTCGG CTGGGGCGGG AATCCGGAAG AAGCCGCCAT TTACGACAAC GTTGTGCCTG CGGAGAACGA CGGCAAGACG CCCCATTCGG TCACGGTCAA GGACGTGCCT GTCGATGGCT TCTGGTCCAT CACCGTTTAC AACAAAGACG GCTTCATGGA GAAGAACGAC CAGAACGTCT ACTCGCACAA CAACGTGACG GCCAAGAAGA ACCAGGACGG GAGCGTGACC ATCCACTTCG GCGCTGGCAC CGATGCGCTC AACAATGTGC CGATCACCCC GGGCTGGAAC TACATCGTCC GCATGTATCA GCCGCGCAAG GAAATCATCG ACGGCACCTG GAAGTTCCCG GTCGCCCAAC CGACGAAGTA G
|
Protein sequence | MNRYLMIAAV TGIMGLDPGA GFAAEKVTVD SFVRAETDMT LDRYVRQGAL GKLIHIRMPV PIDRQDVIRM NRDTLYSAGV FDLSAPVTIV KPETGGRFQS MLVINQDHSM LPAEHGAGEF TFTQEKMGTR YMIVLFRTFV DSNDPTDIKA ANALQDKIVV KQAAPGKFEI PEWDEASLKK VRDAINVLAA TRTSAKGMFG DKAKLDPISH LLGTAFGWGG NPEEAAIYDN VVPAENDGKT PHSVTVKDVP VDGFWSITVY NKDGFMEKND QNVYSHNNVT AKKNQDGSVT IHFGAGTDAL NNVPITPGWN YIVRMYQPRK EIIDGTWKFP VAQPTK
|
| |