Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3749 |
Symbol | |
ID | 7873747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4120284 |
End bp | 4121195 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700694 |
Product | transcriptional regulator, ArgP, LysR family |
Protein accession | YP_002890718 |
Protein GI | 237654404 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | [TIGR03298] transcriptional regulator, ArgP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATC ATCGCTTGCT GGCCTTCGAG GCGGTGCTGC AGGAGGGCGG ATTCGAGCGC GCGGCGCGCC GGCTGGCGCT GACCCAGTCC GCGGTGTCGC AGCGGGTCAA GCTGCTCGAG GCCGAGCTCG GCCAGGTTCT GCTGGTGCGC AGCAAGCCGG TGCGGCCCAC GCCGGCCGGG CGCAGGCTGC TGCCCTACCT TGCACAGCTG CGCCTGATGG AGGCGGAGGC GCGACGCGCG CTGTCGCCAC GCGAGGTGGA CGGGCCGCTG CGTCTCGCAG TGGGCGTCAA CGCCGATTCG CTCGCCACCT GGTTCATCGG CGCGGTCGCC GAGGTGGTCC GCGACGAGGG CATCGTGCTC GACTGCGTGG TCGACGACCA GGACCACACC CACGCGCTGC TCGCCGACGG CGAGGTGCTG GGTTGCGTGT CCACCCGCGC CGATCCGATG CGCGGCTGCG CCGCCGTACG CCTCGGGGCC ATGCCCTATC TGTGCGCCGG CTCGCCCGAC TTCCGCGCGC GCTGGTTCCC GCAGGGTCTC ACCCCGGCGG CGCTGGCGCG CGCGCCGGCG ATCGTCTTCG GCCACCACGA CGACATGCAC GAGGCCTTCC TGCTGCGCCA CTTCGGGCTG GATTCGCGGC GCTATCCACA CCACGTCGTG CCCTCGTCCG AGGGCTTCAT GGCCTTCGCG CTCGCCGGCC TGGGCTATGG ATTCGTCCCC GAGATCCAGG CACGTGCGCA CCTTGCGCGC GGCGAGCTGG TCGACCTGGC ACCCGAGCGC GAAGAGGTGG TCCTGTACTG GCACCACTGG CAGGTACAGT CTCCGGTGAT GGCGAGGCTG GCGCAGGCGA TCGGAGATGC TGCAGGGCGG GCACTCGGGG GTGAGCGGCG CGATCCAGCC GGGCCGGGCT GA
|
Protein sequence | MIDHRLLAFE AVLQEGGFER AARRLALTQS AVSQRVKLLE AELGQVLLVR SKPVRPTPAG RRLLPYLAQL RLMEAEARRA LSPREVDGPL RLAVGVNADS LATWFIGAVA EVVRDEGIVL DCVVDDQDHT HALLADGEVL GCVSTRADPM RGCAAVRLGA MPYLCAGSPD FRARWFPQGL TPAALARAPA IVFGHHDDMH EAFLLRHFGL DSRRYPHHVV PSSEGFMAFA LAGLGYGFVP EIQARAHLAR GELVDLAPER EEVVLYWHHW QVQSPVMARL AQAIGDAAGR ALGGERRDPA GPG
|
| |