Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1777 |
Symbol | |
ID | 7085747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1998619 |
End bp | 1999725 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643698799 |
Product | Rieske (2Fe-2S) domain protein |
Protein accession | YP_002355425 |
Protein GI | 217970191 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.111763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGACA TCGCTTCCAA GGCTCGACTG GCCCAGGCAG CCTCCCAGCT GCCTGTCTCG TGGTACTTCG ACCAGCAGGT TTTCGAACTG GAGAAGAAAC TCCTCTTCGA TGCCGGTCCG GGGTACGTCG GCCATGAGCT GATGGTTCCC GAGGTCGGCA ACTACCGCTC GCTGGAGTGG CTCGACCACG CCAAGCTGCT GTTCCGCACC GAGGCGGGTG TGCACCAGAT GTCCAACGTC TGCCGCCACC GCCAGGCGAT CATGCTGCAG GGCAGCGGCA CGACCAAGCT CGTGGTCTGC CCGGTGCACC GCTGGACCTA CGACCGCCAG GGCAACCTGC TCGGCGCACC GCACTTCCCC GAGAAGCCCT GCCTGGGCCT CAGGCGCGAC GAGCTCGAAC GCTGGAATGG CCTGCTTTTC AAGGGCCCGC GCTCGGCGAG CGCCGACCTT GCCGGGATGC AGGTGGCGGG CGAATTCGAT TTCTCGGGCT ACAAGCTCGA CAAGGTCGAG GTCCACCATT GCAATTACAA CTGGAAGACC TTCATCGAGG TCTATCTGGA GGACTATCAC GTCGTGCCCT TCCACCCCGG ACTGGGCAAC TTCGTGACCT GCAAGGACCT GAGCTGGCAG TTCGGCGACT GGTATTCGGT ACAGAAGGTG GGCATCACCT CGCTCGCCAA GCCGGGCTCG GAGACCTACG CCAAGTGGCA CAAGGCGGTG ATCGACTACT ACGGCGAGAA GAAGCCGACG CATGGTGCGA TCTGGCTCAC CTACTACCCG AATGTGATGG TGGAGTGGTA CCCGCACGTG CTGGTGGTGA GCACGCTGAT TCCGACGGAT GTGGATAAGA CGACCAACGT GGTGGAGTTC TACTACCCGG AGGACATCGT CGAGTTCGAG CGCGAGTTCG TCGAGGCCGA GCAGGCCGCC TACATGGAGA CTGCGATCGA GGACGACGAG ATCGGCGAGC GCATGGATCG CGGCCGGAGG GCGCTGCTGA AGGAGGGGCG CAACGAGGTC GGCCCCTACC AGTCGCCCTT CGAGGACGGC ATGCAGCATT TCCACGAGTT CTACCGGCGC ATCATGGAGC CGCACATCGG CGGGTGA
|
Protein sequence | MSDIASKARL AQAASQLPVS WYFDQQVFEL EKKLLFDAGP GYVGHELMVP EVGNYRSLEW LDHAKLLFRT EAGVHQMSNV CRHRQAIMLQ GSGTTKLVVC PVHRWTYDRQ GNLLGAPHFP EKPCLGLRRD ELERWNGLLF KGPRSASADL AGMQVAGEFD FSGYKLDKVE VHHCNYNWKT FIEVYLEDYH VVPFHPGLGN FVTCKDLSWQ FGDWYSVQKV GITSLAKPGS ETYAKWHKAV IDYYGEKKPT HGAIWLTYYP NVMVEWYPHV LVVSTLIPTD VDKTTNVVEF YYPEDIVEFE REFVEAEQAA YMETAIEDDE IGERMDRGRR ALLKEGRNEV GPYQSPFEDG MQHFHEFYRR IMEPHIGG
|
| |