Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1326 |
Symbol | |
ID | 7084447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1463459 |
End bp | 1466467 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698343 |
Product | diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) |
Protein accession | YP_002354981 |
Protein GI | 217969747 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.121951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCC CGCGCGGCGG CGTCTTCGAG CGTCCGCTGG TCCAGGCGGC GGTGGCGGTG CTGCTGGTGT TCGGGCTGCT CGCCGCCTTC CTGCTGTTCT CCCACCGCTC GGCCGAGACG ATCACCACCA CCGCCTCGCG CAACGAGGTG CGCGTGCTGG CCACGCAGAT CGAGGGCGCG CTGCGCCGCA TCGAGATGAG CATCGGGTTC ATGCGCGACA AGTACATCCG CATCGGCCTG CACGAGAACG CGGGCACCCC CGCGTGGGAG GCCTCGCTGG TGCAGCTGCA GCTGGAGTTG CAGCATCTGG CGCGCGGCTT TCCGGAAGCC GTCGCCATCC TGGTGACCGA CGCGCAGGGC AAGATCCTGG CCAGCACCCT CGACACGCCG CCGCAATGGG ACATCGGCGA CCGCGAATAT TTCCGCCACG CCCGCGAACA CCGCCAGGAC GCGCTCGCTT TCTCCGAGCC CGTGATCTAC AAGGCCGGCC CGCGCGAGGT CGTGGTGGCC TACGAAGCGA TCCGCAACCC GGCGGGCGAT TTCGTCGGCG TGGTCGCGGT CGCGCTCGAC CTGCAGCGCC TGCAGGACCT GCTGCTCGGG GTGGACGCCG GTCGCCAGGG CATGGTCAGC ATCCGGCGCA GCGACGACGG CCGCCTGGTG CTGCGCTTGC CGGACCACGG CGCCAGCCGG GTCGCCGACA TCACCAGCAC CGAGCCTTTC CTGGCCATCC GCCGCGGCGA GACGTCCGGC AGCGCGCGCT ACGTCGGCAT CGCGGACGAA GTCGAGCGCA GCTTCGCCTT CCAGGCGCTG CGCGACTACC CCTTCTACGT GGTGATCGGC CGCGCGATCT CCGAGCAATT CGCGCCCTGG CGGCAGACCG CGACCGTCGC CACGATCATC GCCTTGGCCA CCCTGGGCCT GCTGCTGGCG CTGCAGGCTG CCCTGCACCG CAGCCGCCGC CAGCTCGCGC GCAGCAAGCG CAGCTTCGAC GCCCTCATCG ACAGCCGCCG GGAGGCGACC TGCGCCTGGC GGCCCGATAC CACGCTGCTG ACCTGCAACG AGCGCTACGC CGAACTGCTC GGCACCGAGG CCGCGTCCCT GCCGGGCACG CGCTGGATCG ACAACGTGCC GGAGGCGAAG CGCGCGGAGG TCCACGCCGC CATCGCGCGG ATGCTGCAGG GCGCCGGCAC GCTCACCACC GATCGCCGCC TCCGCCAGGC CGACGGCCGC ACGCGCTGGC TGCGCTGGCT CGACCTGCCG GTGCTCGACG AGGAGGGGCG CTGCGTCGAA GTGCACTCGA TCGGCCAGGA CATCACCGCG CAGAAGGAGG TCGAGCTGCG CCTGCGCCAG CTCGCACTGG CCGTGGAGCA GAGCCCGAAC ACGATCGTGA TCACCGACAC CGATGGTCGC ATCGAGTATG TCAACGAGGC CTTCGTGCGC ACCACCGGCT ACACCGCCGC AGAGGCGATC GGCCAGAACC CGCGCCTGCT CAACGCCGGC AAGACCTCCG CCGCCACCTA TGCCGACCTG TGGGCCACGC TCGGCCGCGG CGAGGTGTGG CGCGGCGAGT TCATCAACAC CCGCAAGGAC GGCGGCACCT ACACCGAGCT CGCCACGATC GCGCCGATCA AGCAGGCCGA CGGCAGCGTC AGCCACTATG TGGCGATCAA GGAGGACATC ACCGGGCGCC GCGAGGCCGA GGCGCGCATC CGCCAGCTCG CCTACTACGA CACCCTCACC GGCCTGCCCA ACCGCAGCCT GATGTGGGAC CGCCTGCGCC ACGCCATCGC GGCGAGCGCG CGCAGCGGCG CGAGCGGCAT GCTGATGCTG CTCGACATCG ACCACTTCAA GCTGCTCAAC GACACCCAGG GCCACGAGGT GGGCGATGCC CTGCTGCGCG AGGTGGCGCA GCGCCTGCGC GGCGCGCTGC GCGAGGAGGA CACGGTGGCG CGGGTGGGCG ACGACGACTT CGCGATCGTG GTCGAGAACC TGGGCGCGGA CCGCGACGAA GCGATCGGCC GCGCCGAGAA GATCGCCGAG CACCTGCACC GCTGCGTGAC CGCGCCCTGC GAGCTCGGCC TCGCGAGCGG CCCCTACCAC GTCGGCGCGA GCCTGGGCCT GACCCTGTTC CGCGGCCGCA ACGCGGCGGC CGACGCGGTG CTCAAGCAGG CCGAGGTGGC GGTGGCGCGC GCCAAGGACG ACGGCCGCAA CCTGATCCGC TTCTTCAGCG AGGCGATGCA GGCGGTGGTG GCCGCGCGCG CCGAGCTGGA GCTGAAGCTG CGCGCGGCGC TGGCCTGCAA CGGCTTCCGC CTGTACTACC AGCCGCAGCT CGACCGCAAC GGCCGCGTGA TCGGCGCCGA GGCGCTGATC CGCTGCTTCG ACGCCGAGGG CGAGATGATC TCGCCCGCCG CCTTCATCCC GCTCGCCGAG GAGACCGGCC TCATCGTGCC GATCGGCGAG TGGGTGCTGG AGACCGCCTG TGCGCAACTG CGTGCATGGC AGCGCACGCC CTCCACGGCC GGGCTGAGCC TGTCGATCAA CGTCAGCGCG CGCCAGTTCC ACCAGCCCGA CTTCGTCGGC AAGGTGGCGG CGGCGATCGA GCGCCACCGC ATCCGCCCCG GCGGCCTCGA GATCGAGCTC ACCGAGAGCG TGGTGATCGG CGACATCGAG ACCACGGTGC TGCGCATGCG CCAGATCAAG GCCCTGGGGG TGAGGTTCGC ACTCGACGAC TTCGGCACCG GCTATTCCTC GCTGTCCTAT CTCAAGCGCC TGCCCTTCGA CCAGCTCAAG ATCGACCAGA TGTTCGTGCG CGACATGGAG AAGGACACCA GCAGCGAGGC GATCGTGCGC GCGATCCTCG CGCTCAGCCG CTCGCTCGAC CTGAAGGTGG TGGCCGAGGG TGTCGAGACC GCCGCCCAGC ACGAGCTGCT GCTGACGCGC GGCTGCGAGC TGTTCCAGGG CTATCTGTTC GGCCGGCCGG TTCCGATCGA GGACTGGAAG GCAGGCTGA
|
Protein sequence | MSAPRGGVFE RPLVQAAVAV LLVFGLLAAF LLFSHRSAET ITTTASRNEV RVLATQIEGA LRRIEMSIGF MRDKYIRIGL HENAGTPAWE ASLVQLQLEL QHLARGFPEA VAILVTDAQG KILASTLDTP PQWDIGDREY FRHAREHRQD ALAFSEPVIY KAGPREVVVA YEAIRNPAGD FVGVVAVALD LQRLQDLLLG VDAGRQGMVS IRRSDDGRLV LRLPDHGASR VADITSTEPF LAIRRGETSG SARYVGIADE VERSFAFQAL RDYPFYVVIG RAISEQFAPW RQTATVATII ALATLGLLLA LQAALHRSRR QLARSKRSFD ALIDSRREAT CAWRPDTTLL TCNERYAELL GTEAASLPGT RWIDNVPEAK RAEVHAAIAR MLQGAGTLTT DRRLRQADGR TRWLRWLDLP VLDEEGRCVE VHSIGQDITA QKEVELRLRQ LALAVEQSPN TIVITDTDGR IEYVNEAFVR TTGYTAAEAI GQNPRLLNAG KTSAATYADL WATLGRGEVW RGEFINTRKD GGTYTELATI APIKQADGSV SHYVAIKEDI TGRREAEARI RQLAYYDTLT GLPNRSLMWD RLRHAIAASA RSGASGMLML LDIDHFKLLN DTQGHEVGDA LLREVAQRLR GALREEDTVA RVGDDDFAIV VENLGADRDE AIGRAEKIAE HLHRCVTAPC ELGLASGPYH VGASLGLTLF RGRNAAADAV LKQAEVAVAR AKDDGRNLIR FFSEAMQAVV AARAELELKL RAALACNGFR LYYQPQLDRN GRVIGAEALI RCFDAEGEMI SPAAFIPLAE ETGLIVPIGE WVLETACAQL RAWQRTPSTA GLSLSINVSA RQFHQPDFVG KVAAAIERHR IRPGGLEIEL TESVVIGDIE TTVLRMRQIK ALGVRFALDD FGTGYSSLSY LKRLPFDQLK IDQMFVRDME KDTSSEAIVR AILALSRSLD LKVVAEGVET AAQHELLLTR GCELFQGYLF GRPVPIEDWK AG
|
| |