Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3721 |
Symbol | |
ID | 7873720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4088485 |
End bp | 4089990 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700667 |
Product | two component, sigma54 specific, transcriptional regulator, Fis family |
Protein accession | YP_002890691 |
Protein GI | 237654377 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCGCGA AACGGATCGG GGGCGGCGGG GACGACGTGC AGATGGCGCG CGGCGTGCGG GGCGACGAGG CGAAGGCGCT CGCGCAAGGC GGAGTCGGGC GCGCCGAGCC CGCGCGCCCG GTGGAGCTCG CCGACTACCA GTGGCCTGCG CACTCGGTGC TGGTGGTCGA CGACGAGGAG GGCATGCGCA ATTTCCTCGA GCGCACGCTC GCGCGCCGCT GCGGCATGGT GCAGTCGGCC GCCGATGCCG AGCACGCCGC GGTGCTGATG GCGCGTCTGC ATTTCGACCT GCTGGTCCTC GACATCGCCC TGCCTGGCAA GTCGGGGATC GAGTGGCTGC ACGAGCTGCG CGAGAATGGT TATGCCGGCG ACGTGATCCT GATCACCGCC TTCGCCGACA TGGAGACCGC GATCGACGCG CTGCGCGGCG GCGCCTCGGA CTTCATCCTC AAGCCCTTCC GCGTCGACCA GATCCTCAAC TCGATCAAGC GCTGCTTCGA GCGTGCCGGC CTGGCGCGCG AGAACTTCGT GCTGCGCCGC GAGCTCGCCG GCCTGGGCGC CGAGCCCACC GGGCTGATCG GCCACTCGCC CGCGATGGAG CAGCTGCGCG CGCTGGTGCG CCGCGTCGGG CAGATGCCGA GCACGGTGCT GCTGCTGGGC GAATCCGGTA CCGGCAAGGA GGTGGTGGCG CGCGCGTTGC ACCAGACCAG CCCGCGCGCG CAGCGCCCCT TCGTGCCGCT CAACTGCGCG GCGATCGCCT CCGAGCTGAT CGAGAGCGAG CTCTTCGGCC ACGTCAAGGG CGCCTTCACC GGCGCCACCG AGAACCGCAA CGGTCTGTTC TACTACGCCC ACGGCGGCAC GCTCTTCCTC GACGAGATCA GCGAGCTGCC GCTGGCGATG CAGACCCGGC TGCTGCGGGT ACTCGAGGAG CGCAAGCTGC GCCCGGTCGG TTCCGAGCGC GAGCTGCCGG TGGATGTACG CATCATCGCC GCCTCCAACC GCGACCTCGC CGCCGAGGTG CGCGCCGGAC GCTTCCGCGA GGACCTCTAC TACCGCCTCG CGGTGGTGGA CATCGGCCTG CCGCCGCTGC GCGACCGTAC CGAGGACATC CCCGAGCTGA TGCGCCACTT CATGCAGCAG TTTTCCGTCC AGCTCGGCGT GCCGCCGCTG CCGCTCTCGC ACGAGGTGGT GAACCGGCTC GCCGGCTACA CCTGGCCTGG CAACGTGCGC GAACTGCGCA ACTACATCGA GCGCTCGCTG ATTCTCGGCC ACTTCCCGGC GCAGCCGAGC GCCGCACCGA CCGCCGCGCC GACGCCGGCG GGCGAGCTGG AGTCGAGCCT GGCCGAGGTC GAGCGCCGCC ACATCGAGCG TGTGACCGCC GCCTGCGAGG GCAACAAGAC CGAGGCCGCG CGCCGGCTGG GCGTGTCGCG CAAGACCCTG GAGCGCAAGT TCGCCGAGTG GGCGCTCGAG GACGCCGCGG CCGCGCGTGC GGAGCGGCGG GCCTGA
|
Protein sequence | MVAKRIGGGG DDVQMARGVR GDEAKALAQG GVGRAEPARP VELADYQWPA HSVLVVDDEE GMRNFLERTL ARRCGMVQSA ADAEHAAVLM ARLHFDLLVL DIALPGKSGI EWLHELRENG YAGDVILITA FADMETAIDA LRGGASDFIL KPFRVDQILN SIKRCFERAG LARENFVLRR ELAGLGAEPT GLIGHSPAME QLRALVRRVG QMPSTVLLLG ESGTGKEVVA RALHQTSPRA QRPFVPLNCA AIASELIESE LFGHVKGAFT GATENRNGLF YYAHGGTLFL DEISELPLAM QTRLLRVLEE RKLRPVGSER ELPVDVRIIA ASNRDLAAEV RAGRFREDLY YRLAVVDIGL PPLRDRTEDI PELMRHFMQQ FSVQLGVPPL PLSHEVVNRL AGYTWPGNVR ELRNYIERSL ILGHFPAQPS AAPTAAPTPA GELESSLAEV ERRHIERVTA ACEGNKTEAA RRLGVSRKTL ERKFAEWALE DAAAARAERR A
|
| |