Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0649 |
Symbol | |
ID | 7084587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 737604 |
End bp | 739082 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643697675 |
Product | transcriptional regulator, Crp/Fnr family |
Protein accession | YP_002354317 |
Protein GI | 217969083 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0664] cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCAT CGACTGCCGC AGAGCGCCGG GCGCTGCGTG ACCGCCTCAC CGCGGTGCCG CTGTTCGCCG GCCTGCCCGC CGTGCAGCTG GCCGAGCTCG CCGCCGCGGC GCGCCTGCTC GAGCTGCCGG CGCGCACCCT GGTGCATGGC TGCGGCGAGC CCTTCGACGA GGCATACGTG TTGTGCAGCG GGACGGTGGT GCGCTTTCGC GAGCTCGACG GCGACGCGCG CAAGGTGGTC GAGCTGGTGC ACACCCCTCA GATGCTCGGC TGCGGCGAGT TCTTCGGTGC CAGCCGTCAT GAATCGGGCT GCGAGACGGC AAGCGCCTGC GCGCTGATCG CGCTCGACGC CGGCGTGCTG CGCGCGGTGG CCGAGCGCAG CCTGGTGCTG ACCCGGCGCA TGCTGCAGGC GGTGGCGGCG CGGCTGTGCG AGGTCGAGTT CGACGTCGCC GGGCGCCATG CCAGCCGCAC CAGCGCGCAG CGCATCCTCG ATTATCTGGT CGAGCTCGCC GGCGGCAGCC TGGCACTCGC CGGCGAGACC ACGGTGGAGC TGGCCTCGAC CAAGAAGGTG CTCGCCGCGC GCATCGGCAT CACCCCCGAG GCCTTCTCGC GCAGTCTGCG CGAGCTTGCC GACAAGGGCG TGATCGTGGT CGACAAGCGC CTGATCCACA TCCAGAACGC CGCCACCCTG GACACTGGTG CGGGCGAGCA GCCGCAGCGC CTGAGCTTCT CGCGCCGCAA CAAGGGTGCG CGCCGCGACG AGACGATGTC GGTCGGTGAG CTGATCAACC TGTGCGGCCG CCTGCGCCTG CAGGTGCAGC GTCTGGCGAT CGACTGGGTG CTGATCGGGC ATGGCGTGGC GCCGGTCGAG ATGCAGCTGC GCCTGCGCCA GGACATCGCC GAGTTCGAGC GCAGCCTCGC GGTCCTGCAG GCCGCCAGCC TCGCCCCCGA GCTCGGCGAG GCGCTGGAGG ACGTGGGCCG GGTGTGGGCG GATTTTCGCG ATGCGCTGGG TGCGGCCGAG CGCCCGCCGG CGGGCGCGGC GCGGGTGCTG CAGCTGAGCG AGGACTTCAT CACCGCGGCG GATTTTCTCA ACGCCGTGGC CGGCGGCTTC GCCGCCTCCG GCAGCCTGTA CATGGTCAAC GTCGCCGGAC GCAACCGCAT GTTGTCGCAG CGGGTGGCGA AGTTCTTCCT CTTCGAGGGC TTCGAGGGCT GCCCAGACCC GCAGGCGCAG ATCGCCGCTT CGTGCCGGGC CTTCGAGCAC AACCTGCAGC GCTTGCGCGA GAATGCCGGC GAGCTGCCCG AGCTGGTGGC GCAGATCGAC GCCACGGCCG CCCTGTGGGC GCGCTTCATC GCCGCGCTCG ACCCCGCGAT CCGCCAGCCT CATCGCATCG GCCGGGTGCG CGCGGTGCTC GACGAGGGGG AGCGCCTGTT GCGCTACACC AACACCCTGG TCAAGCTCTT CGAGCGCCTC GCGCACTGA
|
Protein sequence | MTPSTAAERR ALRDRLTAVP LFAGLPAVQL AELAAAARLL ELPARTLVHG CGEPFDEAYV LCSGTVVRFR ELDGDARKVV ELVHTPQMLG CGEFFGASRH ESGCETASAC ALIALDAGVL RAVAERSLVL TRRMLQAVAA RLCEVEFDVA GRHASRTSAQ RILDYLVELA GGSLALAGET TVELASTKKV LAARIGITPE AFSRSLRELA DKGVIVVDKR LIHIQNAATL DTGAGEQPQR LSFSRRNKGA RRDETMSVGE LINLCGRLRL QVQRLAIDWV LIGHGVAPVE MQLRLRQDIA EFERSLAVLQ AASLAPELGE ALEDVGRVWA DFRDALGAAE RPPAGAARVL QLSEDFITAA DFLNAVAGGF AASGSLYMVN VAGRNRMLSQ RVAKFFLFEG FEGCPDPQAQ IAASCRAFEH NLQRLRENAG ELPELVAQID ATAALWARFI AALDPAIRQP HRIGRVRAVL DEGERLLRYT NTLVKLFERL AH
|
| |