Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1907 |
Symbol | |
ID | 7085676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2150358 |
End bp | 2153399 |
Gene Length | 3042 bp |
Protein Length | 1013 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643698932 |
Product | diguanylate cyclase with PAS/PAC sensor |
Protein accession | YP_002355554 |
Protein GI | 217970320 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.527158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCCCTCT CCTCCCTCCT GTATCGTTCG CGCGCCACGG TCGCCGCCCT CGTTCGCAGT GACGGGATCG CGGGCAGGCT GGTGGTTGCC TGGCTGGCGT TCACGGTACT GCTCGGCGGC TACTGGTTGC AGCTCGACCA CTCGCACACC GCGCTCGTCT CCCAGGCCGA GAGCCTGACC CGGTTGCGTG CCCTGCAGAC CGCACACGCA TTGGCGCTGC ACACCGGAGC GCTGTTCCGC AAGCTCGACT ATCTTTCCCT GCACCTGGGC GAGCATTGGC TGAAGGAAGG TCCGCAGGGC GCGCGCGACG CCCTCGTGCG TGCGCACCTT GCGCTTCCCG AAGACGCCCT GGTCGGCATC GACGTCGCCG ATGCCGAGGG GCGGCTCCTC TACTCCACCG TCGGCGAGCG CCTGCCGCTC GAACTCGAGC GCTACGGCAT CGCGGACTTC GGGCATTTTC GCGTCCATCT GGAAGGGGAC AAGGGCCTCT TCATCGGCCA GCCCTACCGC GGACGCATGA CCGGCGCCTG GATCGTGCCG TTTTCCCGTG CGCTCGTGGA GGAGGGGCGA CTGCGCGGCG TGATCGTCCT GTCGGTGTCG GTCGCCCATC TCACCCGGGC ACTGAGGGAA CTCTATCCGG GGACCTCGGA CGTGGCGGCG CTCGTGCTCG ACGATGGTCA CTTCCTCGCC CACTCCAACC GGACCGCGGG AGCCATCGGG CGCTCGGTGC CGTCCGACCG CCCCTATCTG CTCGATCGCG TGGCGCTGCA GGGCAACTTC GATGCGACCG CGCCACTGGA CGGCGTCGAG CGTTATTACG CCTGGCACCG CGTTGCGGGC TTTCCCGTCG TCGCCGCGCT CGGGCTGAGC AAGGCGGACG CGCTCGCCCC GGTGCGCGGC ACGATCCGCG ACAACCGCCT CCGCAGCGGC CTGGGCTCGC TGCTGCTCTT GGCAGCGGCC TTGCTGATCA GCCTGCTCTG GCGCCGCCAC GCCCGCCAGG GCGAAGAGCT CGGCCGCGCC CGCAGGCGGC TGGAGAAGCT GGTCGGTCAC TTTCCGGGCA TGGCCTACCA GTTCCTGCTG CGTGCGGACG GCAGCACCTC CATGCCCTAC TGCAGCCCGG GCGTGCGCGA ACTCTACGGG GTGGGGCCCG AGGGGCTGCA GGATTCGGCG ATGCGCCTGT TCGAGCGCGT CCACGCCGAC GATCTGGACC GTCTGCGCGA CAGCATCCTC CGCTCGGCCG CGAACCTCGC GCCCTGGCGA TGCGAATTCC GCGTGCGGGG CGAGGGCGAC CGGCTGCGCT GGCTGCTCGG CGAAGCCAAC CCCGAGCGCA CGCCCGAAGG CGACACGCTG TGGCACGGCT ATGTGCACGA CATCACCGAG CGCCAGGTGG CTGCGCAGCA GTTGCGCGAA AGCGAGGCGC GCCTGCGCCA GGCGGTCGAC GCCGTCCGCG ACGGCCTGTG GTCGTGGGAT CTCGCCCGCG ACCGCATCGC GCTCGACGAG CGCATCCTCG AGATGCTCGA TCGTCCGGAA CTGGGCTCGG AGATGTGCTT CGACGACTTT TGCGCGCTCG TCCATCCGGC CGATCGCCAG CGCCAGGACG CGATGCTGCG CGCCGTGCCG AGCGCGACGC CCGACGCTCC CGTGGCGGGC GACCTCCGCC TGCGCGCCGC GTCCGGGCGC TGGGTCTGGA TCCACGCCCG CGGCAGCGTC GTGGAGACCG ACGCGCAGGG TGGCCCGCGG CGTCTCGTGG GCACCTTCTC CGACATCACC GCACGGGTCG CCGCCACGCA GTTGCGTCGG GCCCTCCTCG ACCACAGCGC GGCCGCCATC GCCATGCTCG ACGCCGACCT CGACATCCTC GAAGCCAACA GCCGCGCCCA CGAGATCTTC GCGCCGCCCT CGACCCCGAT CGGCGGACTC AAGCTCACCG ACCTCGCCCT GCACGAGGAC CAGGCCATCT TCATCCCGCA GCACTACGCG GCGCTGCGGG CCGGCGGACA GGTCAACATC GAACTGCCCT TGCGCGACAA TCGTGGCAAG AAACGCTGGT TCGACGTACA TGGCGTGATG CTGGACCCGG AGGACCCCGA CAGCGACACC GTCTGGACCC TGGTCGACAT CTCCGACAAG TACCGCACGC GGATGGCGCT CGCCACCGAG CGCCTGCGCC TGAAGACGGT GCTCGAGCGC TTCCCGGGCG GCGTCCTGAT GGAAGACCAG GACGGCCTGA TCAGCTCGGT GAACCGGGGG TTCTGCGAAC TGCTCGGTCT CGCGGCCGCC CCGGACGAAC TGATCGGCCT CACGCACGCG GCGCTGTGCG AGCGGCTGGG TACGGAGCGC ATGGGGTGGC TGCATCGGCC CGACGCCGAC CTCACGGCGG AGAAGCGCGC CACGGTCGAA GTGGAGGGCG TGAAGGGGCG CACGCTGGAG ATCGACTGGC TGCCGATCGA ACACGACGGG CGACGCCTCG GCCGCGTGTG GCTGCTGCGC GACGTCACCG AGCGCAAGGA GCGCGAACGC CGGCTGGCGG AGCTCGCCGC CACCGACCCG CTGACCGGGC TGCCCAACCG GCGCAGCTTC CTGGCGTGCC TGGACGCCGC GCTCGACGAC GCCCGGCGCG AGCCCGCGCG CGGCAGCGCA CTGTTGATGA TCGACATCGA CCACTTCAAG CACGTCAATG ACACCCACGG GCACCCGGTC GGCGACGAGG TATTGCAGCA CGCGGCACGG CTGATCCGCG GCGGCCTGCG CCAGCACGAC CGGGCGGGCC GCCTCGGCGG CGAGGAATTC GCCGTGCTGC TCGACGACGT CGACGCCGAC ACCGACACCG CCCTCACCCT CGCCGAACGC CTGCGCCGTA GCGTGGAGGC GGCGCCCGCG GCGACCGCGG CCGGCGCGGT GTCGCTCACC ATCAGCCTCG GTCTTGCCCT CGTGTCGGGA GGGGATCCCG CCCGCGTCCT CGGTGACGCC GATGCCGCGC TCTACCGTGC CAAGCGCGGC GGCCGCAACC GTGTCTGCGT CGCAGTTCGC ACCCCGGGAT AG
|
Protein sequence | MPLSSLLYRS RATVAALVRS DGIAGRLVVA WLAFTVLLGG YWLQLDHSHT ALVSQAESLT RLRALQTAHA LALHTGALFR KLDYLSLHLG EHWLKEGPQG ARDALVRAHL ALPEDALVGI DVADAEGRLL YSTVGERLPL ELERYGIADF GHFRVHLEGD KGLFIGQPYR GRMTGAWIVP FSRALVEEGR LRGVIVLSVS VAHLTRALRE LYPGTSDVAA LVLDDGHFLA HSNRTAGAIG RSVPSDRPYL LDRVALQGNF DATAPLDGVE RYYAWHRVAG FPVVAALGLS KADALAPVRG TIRDNRLRSG LGSLLLLAAA LLISLLWRRH ARQGEELGRA RRRLEKLVGH FPGMAYQFLL RADGSTSMPY CSPGVRELYG VGPEGLQDSA MRLFERVHAD DLDRLRDSIL RSAANLAPWR CEFRVRGEGD RLRWLLGEAN PERTPEGDTL WHGYVHDITE RQVAAQQLRE SEARLRQAVD AVRDGLWSWD LARDRIALDE RILEMLDRPE LGSEMCFDDF CALVHPADRQ RQDAMLRAVP SATPDAPVAG DLRLRAASGR WVWIHARGSV VETDAQGGPR RLVGTFSDIT ARVAATQLRR ALLDHSAAAI AMLDADLDIL EANSRAHEIF APPSTPIGGL KLTDLALHED QAIFIPQHYA ALRAGGQVNI ELPLRDNRGK KRWFDVHGVM LDPEDPDSDT VWTLVDISDK YRTRMALATE RLRLKTVLER FPGGVLMEDQ DGLISSVNRG FCELLGLAAA PDELIGLTHA ALCERLGTER MGWLHRPDAD LTAEKRATVE VEGVKGRTLE IDWLPIEHDG RRLGRVWLLR DVTERKERER RLAELAATDP LTGLPNRRSF LACLDAALDD ARREPARGSA LLMIDIDHFK HVNDTHGHPV GDEVLQHAAR LIRGGLRQHD RAGRLGGEEF AVLLDDVDAD TDTALTLAER LRRSVEAAPA ATAAGAVSLT ISLGLALVSG GDPARVLGDA DAALYRAKRG GRNRVCVAVR TPG
|
| |