Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2020 |
Symbol | |
ID | 7083776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2281508 |
End bp | 2282908 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699045 |
Product | sigma54 specific transcriptional regulator, Fis family |
Protein accession | YP_002355666 |
Protein GI | 217970432 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.29433 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCCG ACCCCCGCCC GCTGCCCGAG CTGGTCTCCT TCCTCGAGAC CTTCCCCGAA CCGCACATCC TGTGCGATCG CGACTACCGC ATCCTCGCCG CCAACGCCGC CTACCGACGC TCATGGCCTG AGGCGCGCAG CGTGATCGGG CGCACCTGCT ACGACGTGTC GCACCACTAC AGCGTGCCCT GCGATCGCGC CGGAGAATCC TGCCCGCTCG CGCGCAGCCT GCAGTCGGGA CAGCGCGAGC GCGTGCTGCA CCTGCATCAC ACGCCGCGCG GCGAGGAGTA CGTCGACATC GAACTCTCGC CCGTGCGCGA CGCCGCTGGC GAGATCGCCT GGTTCATCGA GAAGATGGAG CCGCTGCACG TCGCCCGCGG GGTGTCCGAC CGCCGCGGCC TGATCGGACG TTCGCCGGCC TTCCAGCGCA TGCTCGAGCT GATCGCGCGG GTCGCGCCCT CGGACGCGAG CGTGTTGCTG CAGGGCGAAT CCGGCACCGG CAAGGAGCTG CTCGCGAGCG CGGTGCACGA AGCGAGCCGG CGCGCAGAGG GGCCCTTCGT GGTGGTGGAT TGTTCCGGCC TGCCCGAGAC CCTGTTCGAG AGCGAGGTCT TCGGTCACGA GCGCGGTGCC TTTACCGGTG CGACCGCGCG CAAGCCCGGG CTGGTCGAGG CGGCCAGCGG CGGCACCCTC TTCCTCGACG AGGTCGGCGA CATTCCGCTC GCCATGCAGG TCAAGCTGCT GCGCCTGCTC GAGACCGGCA CCTACCGCCG CGTCGGCTCC ACCGAGCTGC GTCGTGCCGA CATCCGCCTG GTCTCGGCCA CCCACCGTCC GCTCAAGCGC ATGATCGCGG AGGGCGGCTT CCGTCAGGAC CTGTACTTCC GCATCAACAC TTTCCCGATC ACCGTACCGC CGCTGCGCGA GCGTGAGGGC GACCTGCCGC TGCTGATCGA CTCCCTGCTC GAACGTGTCG CGCCGAAGCG CAAGCTGTCG CTGAGCCCCG CCGCGCTGCG CGTGCTGAGC AACTACGCCT TCCCGGGGAA CGTGCGTGAG CTGCGCAACG TGCTCGAGCG TGCCAGTCTG ATGTGCGACG GCGAGCTGAT CGGGCTCGAG CATCTGCCCG AGGAGGTGCT GCATCCCGAG AGCGAGAGCG CCGAGTCACG ATTTGCCAGT GGAGGTCCGT TGGCCGCAGC GATCCCGGTG CGCGATCCGC TGGATCTCGA GGAGGTGCAG CGCCAGGCCA TGTTGCGCGC CGTGCGCGCG CATCGTGGCA GTCGGCGCGA GCTGGCTCGC CGGCTGGGGA TCAGCGAGCG CACGCTGTAT CGGCGGCTGA AGTCACTCGG CCTGCTCGAG GGCCAGGCGA TGCGCGGCGG GGATGAGGAG GGTGGGGGGG CGGCGCGCTG A
|
Protein sequence | MPPDPRPLPE LVSFLETFPE PHILCDRDYR ILAANAAYRR SWPEARSVIG RTCYDVSHHY SVPCDRAGES CPLARSLQSG QRERVLHLHH TPRGEEYVDI ELSPVRDAAG EIAWFIEKME PLHVARGVSD RRGLIGRSPA FQRMLELIAR VAPSDASVLL QGESGTGKEL LASAVHEASR RAEGPFVVVD CSGLPETLFE SEVFGHERGA FTGATARKPG LVEAASGGTL FLDEVGDIPL AMQVKLLRLL ETGTYRRVGS TELRRADIRL VSATHRPLKR MIAEGGFRQD LYFRINTFPI TVPPLREREG DLPLLIDSLL ERVAPKRKLS LSPAALRVLS NYAFPGNVRE LRNVLERASL MCDGELIGLE HLPEEVLHPE SESAESRFAS GGPLAAAIPV RDPLDLEEVQ RQAMLRAVRA HRGSRRELAR RLGISERTLY RRLKSLGLLE GQAMRGGDEE GGGAAR
|
| |