Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3869 |
Symbol | |
ID | 7874110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4263562 |
End bp | 4265412 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643700811 |
Product | PAS/PAC sensor signal transduction histidine kinase |
Protein accession | YP_002890834 |
Protein GI | 237654520 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGGCT TGAACCTGAT CGACGTAATC TGGCCGATGA TGGCCGCCAC CAGCTTGACG CTGGCGGTCA TCTTTTTCTT CATCTGGATT CACCTTCGGG CCCAGCGCGA TTACCTGGCT TTCGCCCTGT TCGCCGTCTG CGTCTGGTTT TACACGTTCG GCGAGTGGTC GCTGATGCGG GCGACGACGC CCGAGGCCTA CGGCGACACC CTGCGCTGGA TGATGGTCGT GGTCCCGATC GGGGTCATCC TCCTGGCCGC GTTCATCCGC TGGCACCTGC GCGCAGGCCG CGACTGGCTG TTCTGGAGCG TCTGCGGCCT GCGGCTGCTC GGCATCCCGC TCAACTTCCT GTTCGGGGCC AATGTCGTCT ATTCGGAGAT CACCGAGCTC CAGCAACAGG CGACCATGTT CGGCGACAGC ATCGCCATGC CGGTCGGCGT CCCGGGGCCC TTCCACTGGC TGGTCAACAA CCTCGCCGAC ATCCTGCTGG TCCTCTTTCT CGCCGACGCA ACCCATTCCG CCTGGCGCCG CGGCGACGCT CTGACCCGGC GGCGCGCCCT TTTGGTCGGT GGCGGCACCA TCCTGTTCTT CCTCATCGCC GCGGGCCATG CCACGCTGCT GCACGCCGGA CTGGCATCCT CGCCATATCT GATCAGCATC AGCTTTCTCG GCATCGTCCT GGTCATGTCC TATCAGCTGG GCTTCGATGT CTTCCGCTCG GTGCACCTGG CCATCCAGCT GCGCGAGAGC GAACAGCGGA TGGAGCTGGC CGTCCATGCC GCGCGCCTCG GGCTCTGGGA ATGGGACATC GAGCGCGACG TCTTCTGGGC CAACGACACC GGCTATGCGA TGTTTTCCTT TCTGCCCGGC GAGCGCGTGG ACTTCGACAG GTTCAGCCAG CGCATCCATC CCGAGGAGAG AGCCGCAGTC GTCACCGCCG TCCACGATGC CCTCGCCAAC CATCGCCGCT ACGAGAAGGA ATACCGCATC GTCCTGCCCT CCGGCGACAC CCGCTGGATG CACGCCTGGG GGCAGGCCGA ATACAGCGCC AGCGGCAAGC CGACCAAGCT GCTCGGCGTC GTCCTCGATG CCACCGAGCG CAAGCTGGCG GAGCAGCGCC AGGACGAGAT CGAGAGACTG GCCAACCGCC AGCGCGACGA ACTCGCGCAC CTGTCGCGCG TCGCCATGCT GGGCGAGCTT TCCGGCGCTC TCGCCCATGA GCTGAACCAG CCGCTGACCT CGATCCTCAG CAATGCCCAG GCGGCCCAGC TGTTCATCGC CCGCGGCGTG GTGAGCGCCG ACGAGATCGG CCCCATCCTC GACGACATCG TCAAGGCCGA CCGCCGTGCC GGCGACATCA TCCGGCGCCT GCGGCGGTTG CTAAAGAAGG ACGACTCGAC GCGGCAGTGG CTCGACCTCA ACGAGGTGGT CAGCGAGGTC CTGCGGCTGA CCAACAGCGA CCGCGTTTCC CGCGGGATCA CCATCAAGCT CGAGTTGTCG CCGGACCTGC CTCCTGTCTA TGGCGACGAG GTGCAATTGC AGCAGGTCCT CCTGAACCTG CTGAACAACG CCTGCGATGC CATCGAGGCC GTCGACGCGC TGCCGGCGCT GACCGTCCGC ACCGTATGCG AGGCCGGCAA CGTCATCGTC TCCGTGGCCG ACCGCGGCAG CGGCATCGGC GCCGAGGACA TGGAGCGCAT CTTCGAGCCC TTCGTCACCA CCAAGGCCAA GGGCCTGGGC TTGGGCCTGT CGATCTGCCG GACCATCATC CAGGCTCACG GCGGCCGGCT CTGGGCTGAA AACCGGCAGG ACGGCGGCGC GGTCTTCCAC TTCGTGGTGC CTGTCGCGTG A
|
Protein sequence | MPGLNLIDVI WPMMAATSLT LAVIFFFIWI HLRAQRDYLA FALFAVCVWF YTFGEWSLMR ATTPEAYGDT LRWMMVVVPI GVILLAAFIR WHLRAGRDWL FWSVCGLRLL GIPLNFLFGA NVVYSEITEL QQQATMFGDS IAMPVGVPGP FHWLVNNLAD ILLVLFLADA THSAWRRGDA LTRRRALLVG GGTILFFLIA AGHATLLHAG LASSPYLISI SFLGIVLVMS YQLGFDVFRS VHLAIQLRES EQRMELAVHA ARLGLWEWDI ERDVFWANDT GYAMFSFLPG ERVDFDRFSQ RIHPEERAAV VTAVHDALAN HRRYEKEYRI VLPSGDTRWM HAWGQAEYSA SGKPTKLLGV VLDATERKLA EQRQDEIERL ANRQRDELAH LSRVAMLGEL SGALAHELNQ PLTSILSNAQ AAQLFIARGV VSADEIGPIL DDIVKADRRA GDIIRRLRRL LKKDDSTRQW LDLNEVVSEV LRLTNSDRVS RGITIKLELS PDLPPVYGDE VQLQQVLLNL LNNACDAIEA VDALPALTVR TVCEAGNVIV SVADRGSGIG AEDMERIFEP FVTTKAKGLG LGLSICRTII QAHGGRLWAE NRQDGGAVFH FVVPVA
|
| |