Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2554 |
Symbol | |
ID | 7269400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3108974 |
End bp | 3111967 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643567378 |
Product | protein serine phosphatase with GAF(s) sensor(s) |
Protein accession | YP_002463859 |
Protein GI | 219849426 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG1956] GAF domain-containing protein [COG2208] Serine phosphatase RsbU, regulator of sigma subunit |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGCGCG AACACCTGCT GACATGGTAC GGTTTGATCG TCGGCGGGGT GGTATGGGGC TGGCTGCTAC TGTTACCAAC CGTACCGACA GTACCGGCGT TGGTGGTGTT GTTCGCCGTA CTGGCTTTCG CTGTCGATCT GCTCGCGTTC CGCACCCCGC CTGCCGATGT GCATAGTCTT GCCCCGTTGG TGCTGGTCAG CGCCAGTCTC GCGCTAGGAC CTATTCCCGC GGCATGGATC GCAGCCGTCG AAGGATTTGT CTCCGGCGTG ACGATTCTGT TGCAGACCAA CCGTCCGCGT ACTCTGTTTT CGCTCTTGGG ACGACCGTTG TTACGGAGCG GTTTACGTGC GCTTGGCTTG CTTGTCGGCG CATGGCTCGC CACGATGTCA AGCGGTCAAC CACTGACGGC ATTACCGATG TCGCATGTAT TTGGCTGGAC ACTGCTCAGC TTTCCCTTCG TGACTCAGCT TGGGCGGATT GTGCGTGAGC TGTTGCAGGG TGGCTACAGC GGGCTGGCAA CGTGGTGGCG CTCGGCGTGG CCGGCCATCC TCGGTGCAGA GATAGCCCCG CTGCCACTGG CATGGCTGGG AGCAGCGATT GCCCACGATC TCGGAATGCT CCATCTGATA TTGGCCGGAG GAGCATTAGT TGCATCAGCA GCGATTTTGC GTCGCTCATC GCTCAACCTC CAACGCCAGC GTCGCTCGAT GCGCGAGTTG GCACGGCTTA ACGAAGTGAG CCGGGCGATC ATTCGTAGCG AACTTGATGT CGATGCCCTT TGTGAGTTGA TCTACCGCGA GGCGAGCCGA ATAGTTGACA CCTCGTCGTT TCACCTCGGT CTCTTCAACG GTCATTCCTA CACTCTCGTG GTGCGGGTCC AAGATCGGGT ACGGTTGCCA CGGCTCACGG TCGATCTGTC GGAAAATAGC GGTCTGATCG GCTGGATTCG CGAAACCGGA CGGGCGATTC TGGTCGAGGA TTTTACACGC GAGATGGATC GCCTCCCCGC CCGTCCACGT TATCAGAGCG AACGACCACC ACGTTCGGGT ATCTATGTGC CGCTCATTGC CGGTGAAACC GTGATCGGTT CGATTTCGGT GCAAAGCTAC GAACCGTCGG CATTTGACGC TAACGATCTG CGGCTGATCT CACTCATCGC CGATCAGGCC GCTGTGGCAA TCGCACGGGC GCGGGCCTTT CACGAAGCAC GTCAGCGGGC CAATCAGTTG CAAGCGATTC GTGAGGTAAG CCAGCAAATT ACCGCTATTC TCAATCTCGA TCGGTTGTTA CCCTCGATCG TGCAACTTAT TCGTGAGCGT TTCGGCTATC ACCCGGTACA CATTTTCACC CTCTCACCCG ACGATGAGCG GATCTACTTT CGTGCTTCTA CCGCCGATGG TGCCGATCTC GAACGGTTAC GTGCGCTTTC ACTGCGTATC GGTCAGGGGT TGGTCGGTGA AGCAGTGCAA CGCGGCGAAC CGGTGTTGGT CGGTGATGTG CTGAACGATC ATCGTGCGAT TCGCGATACC TTGCAGACGC GCTCGGAATT GGCGGTACCG TTGCGTGTGG GTACGACGGT GATCGGTGTG CTTGATGTGC AGAGCGATGA ACCGGACGAT TTCGATGAAG ATGATCTGTT CGTGATCCGG ACGTTGGCCG ATCAGATTGC GATTGCGATT GAGAGTGCTA ATGCCTACAC TGCTCAACAA GAGGAGGCCT GGACGCTGAA TGCACTGCTC CAGATCGCCG AAAATATCGG TCGAGCGACG ACACTCTCCG ATCTACTGGC AACGGTAGTT CGCCTACCAT CACTCCTGAT GGGGTGTCCA CGCTGTTATG TTGCCTTGTG GGATCGCGAA CAGGGTGATT TTGTGGTACG TGCAGTGTAC GGTTTGCCGA CCACAGCACG TACCGGTGTT CTCAACCAGC CGACACGTTC ACCGTTTCTC TGGCGGTTAC GTGAACGGGC CGCCGAGACG GATCAATCAC GGCTCGAATT GCTCTGGCAG GCCCAAGACA ATGCCGATCA GTGGCCGACA CTGATAACCG CAGCGCGCAG CGGTACTTTG GTTGGCTTAC CGATCAGTGC CCGTAACACA CTTCTCGGTG TTTTGGTGCT GGATTACAAC GATCCGTTCG TTTCACCGAG CACGCGCCAG CAGAACCTCT GCACCGGTGC TGCGGCTCAG ATTGCCGGTG CGCTGGAAAG TTTGCTGCTC GCTGCCGAAG CTGCCGAAGC TGCTCGTCTC GAACAAGAGT TACGTGTAGC GCGCGAGATT CAGCAATCGC TGCTACCCTC TCGTTTACCG AACGTTGCCG GTTGGCAGAT TGAAGCTACG TGGCAATCGG CCCGGCTGGT CGGCGGCGAT TTCTACGATT TTTGGTCATT GCCGACCGGA ACCGAACCAC CTCGCGAACT AGGGTTCGTC ATCGCCGATG TGAGTGATAA GGGCATACCG GCTGCCATGT TTATGACGAT GGCACGCTCG CTGGTACGAG CTGCGGCACT CGATGGCTCG GCGCCGGCAC GAGCGATGGA ACGCGCTAAC CGCTGGCTTT ACCGCGATTC CGAGTCAGGA ATGTTTGTTA CCCTCTTTTA CGCCCGCCTC GATCTTATCA CCGGTCAGCT TTGCTATACG TGTGCCGGAC ATAATCCTCC ACTGTTGTAC CGCGCAGCTA CCGGTGAGAT CGAAGAGCTA CGTACACCCG GTATTGCGTT AGGAGTGTTG CCGGAAGTGA CATTAGCGGA GGCGGAGACA CGGTTAGCAC CGGAGGATGT GTTGGTCTGT TACACAGATG GTGCAACTGA GACGATCAAT GAACTGCTTG TGCCTTTTGA TGTTGATGGT TTACGGGCTG TGATCAAGGC CTACGCCACA GGATCGGCAG CAACTATTAT GCAAGCAATT TTGGCTGCCG TTGCTCGTCA TAGTCACGGC CAACCACCGT TTGACGATAT TACGCTGATC GTGATTAAAC GTGCGTCAAC ATAA
|
Protein sequence | MSREHLLTWY GLIVGGVVWG WLLLLPTVPT VPALVVLFAV LAFAVDLLAF RTPPADVHSL APLVLVSASL ALGPIPAAWI AAVEGFVSGV TILLQTNRPR TLFSLLGRPL LRSGLRALGL LVGAWLATMS SGQPLTALPM SHVFGWTLLS FPFVTQLGRI VRELLQGGYS GLATWWRSAW PAILGAEIAP LPLAWLGAAI AHDLGMLHLI LAGGALVASA AILRRSSLNL QRQRRSMREL ARLNEVSRAI IRSELDVDAL CELIYREASR IVDTSSFHLG LFNGHSYTLV VRVQDRVRLP RLTVDLSENS GLIGWIRETG RAILVEDFTR EMDRLPARPR YQSERPPRSG IYVPLIAGET VIGSISVQSY EPSAFDANDL RLISLIADQA AVAIARARAF HEARQRANQL QAIREVSQQI TAILNLDRLL PSIVQLIRER FGYHPVHIFT LSPDDERIYF RASTADGADL ERLRALSLRI GQGLVGEAVQ RGEPVLVGDV LNDHRAIRDT LQTRSELAVP LRVGTTVIGV LDVQSDEPDD FDEDDLFVIR TLADQIAIAI ESANAYTAQQ EEAWTLNALL QIAENIGRAT TLSDLLATVV RLPSLLMGCP RCYVALWDRE QGDFVVRAVY GLPTTARTGV LNQPTRSPFL WRLRERAAET DQSRLELLWQ AQDNADQWPT LITAARSGTL VGLPISARNT LLGVLVLDYN DPFVSPSTRQ QNLCTGAAAQ IAGALESLLL AAEAAEAARL EQELRVAREI QQSLLPSRLP NVAGWQIEAT WQSARLVGGD FYDFWSLPTG TEPPRELGFV IADVSDKGIP AAMFMTMARS LVRAAALDGS APARAMERAN RWLYRDSESG MFVTLFYARL DLITGQLCYT CAGHNPPLLY RAATGEIEEL RTPGIALGVL PEVTLAEAET RLAPEDVLVC YTDGATETIN ELLVPFDVDG LRAVIKAYAT GSAATIMQAI LAAVARHSHG QPPFDDITLI VIKRAST
|
| |