Gene Cagg_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2554 
Symbol 
ID7269400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3108974 
End bp3111967 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content57% 
IMG OID643567378 
Productprotein serine phosphatase with GAF(s) sensor(s) 
Protein accessionYP_002463859 
Protein GI219849426 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1956] GAF domain-containing protein
[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCGCG AACACCTGCT GACATGGTAC GGTTTGATCG TCGGCGGGGT GGTATGGGGC 
TGGCTGCTAC TGTTACCAAC CGTACCGACA GTACCGGCGT TGGTGGTGTT GTTCGCCGTA
CTGGCTTTCG CTGTCGATCT GCTCGCGTTC CGCACCCCGC CTGCCGATGT GCATAGTCTT
GCCCCGTTGG TGCTGGTCAG CGCCAGTCTC GCGCTAGGAC CTATTCCCGC GGCATGGATC
GCAGCCGTCG AAGGATTTGT CTCCGGCGTG ACGATTCTGT TGCAGACCAA CCGTCCGCGT
ACTCTGTTTT CGCTCTTGGG ACGACCGTTG TTACGGAGCG GTTTACGTGC GCTTGGCTTG
CTTGTCGGCG CATGGCTCGC CACGATGTCA AGCGGTCAAC CACTGACGGC ATTACCGATG
TCGCATGTAT TTGGCTGGAC ACTGCTCAGC TTTCCCTTCG TGACTCAGCT TGGGCGGATT
GTGCGTGAGC TGTTGCAGGG TGGCTACAGC GGGCTGGCAA CGTGGTGGCG CTCGGCGTGG
CCGGCCATCC TCGGTGCAGA GATAGCCCCG CTGCCACTGG CATGGCTGGG AGCAGCGATT
GCCCACGATC TCGGAATGCT CCATCTGATA TTGGCCGGAG GAGCATTAGT TGCATCAGCA
GCGATTTTGC GTCGCTCATC GCTCAACCTC CAACGCCAGC GTCGCTCGAT GCGCGAGTTG
GCACGGCTTA ACGAAGTGAG CCGGGCGATC ATTCGTAGCG AACTTGATGT CGATGCCCTT
TGTGAGTTGA TCTACCGCGA GGCGAGCCGA ATAGTTGACA CCTCGTCGTT TCACCTCGGT
CTCTTCAACG GTCATTCCTA CACTCTCGTG GTGCGGGTCC AAGATCGGGT ACGGTTGCCA
CGGCTCACGG TCGATCTGTC GGAAAATAGC GGTCTGATCG GCTGGATTCG CGAAACCGGA
CGGGCGATTC TGGTCGAGGA TTTTACACGC GAGATGGATC GCCTCCCCGC CCGTCCACGT
TATCAGAGCG AACGACCACC ACGTTCGGGT ATCTATGTGC CGCTCATTGC CGGTGAAACC
GTGATCGGTT CGATTTCGGT GCAAAGCTAC GAACCGTCGG CATTTGACGC TAACGATCTG
CGGCTGATCT CACTCATCGC CGATCAGGCC GCTGTGGCAA TCGCACGGGC GCGGGCCTTT
CACGAAGCAC GTCAGCGGGC CAATCAGTTG CAAGCGATTC GTGAGGTAAG CCAGCAAATT
ACCGCTATTC TCAATCTCGA TCGGTTGTTA CCCTCGATCG TGCAACTTAT TCGTGAGCGT
TTCGGCTATC ACCCGGTACA CATTTTCACC CTCTCACCCG ACGATGAGCG GATCTACTTT
CGTGCTTCTA CCGCCGATGG TGCCGATCTC GAACGGTTAC GTGCGCTTTC ACTGCGTATC
GGTCAGGGGT TGGTCGGTGA AGCAGTGCAA CGCGGCGAAC CGGTGTTGGT CGGTGATGTG
CTGAACGATC ATCGTGCGAT TCGCGATACC TTGCAGACGC GCTCGGAATT GGCGGTACCG
TTGCGTGTGG GTACGACGGT GATCGGTGTG CTTGATGTGC AGAGCGATGA ACCGGACGAT
TTCGATGAAG ATGATCTGTT CGTGATCCGG ACGTTGGCCG ATCAGATTGC GATTGCGATT
GAGAGTGCTA ATGCCTACAC TGCTCAACAA GAGGAGGCCT GGACGCTGAA TGCACTGCTC
CAGATCGCCG AAAATATCGG TCGAGCGACG ACACTCTCCG ATCTACTGGC AACGGTAGTT
CGCCTACCAT CACTCCTGAT GGGGTGTCCA CGCTGTTATG TTGCCTTGTG GGATCGCGAA
CAGGGTGATT TTGTGGTACG TGCAGTGTAC GGTTTGCCGA CCACAGCACG TACCGGTGTT
CTCAACCAGC CGACACGTTC ACCGTTTCTC TGGCGGTTAC GTGAACGGGC CGCCGAGACG
GATCAATCAC GGCTCGAATT GCTCTGGCAG GCCCAAGACA ATGCCGATCA GTGGCCGACA
CTGATAACCG CAGCGCGCAG CGGTACTTTG GTTGGCTTAC CGATCAGTGC CCGTAACACA
CTTCTCGGTG TTTTGGTGCT GGATTACAAC GATCCGTTCG TTTCACCGAG CACGCGCCAG
CAGAACCTCT GCACCGGTGC TGCGGCTCAG ATTGCCGGTG CGCTGGAAAG TTTGCTGCTC
GCTGCCGAAG CTGCCGAAGC TGCTCGTCTC GAACAAGAGT TACGTGTAGC GCGCGAGATT
CAGCAATCGC TGCTACCCTC TCGTTTACCG AACGTTGCCG GTTGGCAGAT TGAAGCTACG
TGGCAATCGG CCCGGCTGGT CGGCGGCGAT TTCTACGATT TTTGGTCATT GCCGACCGGA
ACCGAACCAC CTCGCGAACT AGGGTTCGTC ATCGCCGATG TGAGTGATAA GGGCATACCG
GCTGCCATGT TTATGACGAT GGCACGCTCG CTGGTACGAG CTGCGGCACT CGATGGCTCG
GCGCCGGCAC GAGCGATGGA ACGCGCTAAC CGCTGGCTTT ACCGCGATTC CGAGTCAGGA
ATGTTTGTTA CCCTCTTTTA CGCCCGCCTC GATCTTATCA CCGGTCAGCT TTGCTATACG
TGTGCCGGAC ATAATCCTCC ACTGTTGTAC CGCGCAGCTA CCGGTGAGAT CGAAGAGCTA
CGTACACCCG GTATTGCGTT AGGAGTGTTG CCGGAAGTGA CATTAGCGGA GGCGGAGACA
CGGTTAGCAC CGGAGGATGT GTTGGTCTGT TACACAGATG GTGCAACTGA GACGATCAAT
GAACTGCTTG TGCCTTTTGA TGTTGATGGT TTACGGGCTG TGATCAAGGC CTACGCCACA
GGATCGGCAG CAACTATTAT GCAAGCAATT TTGGCTGCCG TTGCTCGTCA TAGTCACGGC
CAACCACCGT TTGACGATAT TACGCTGATC GTGATTAAAC GTGCGTCAAC ATAA
 
Protein sequence
MSREHLLTWY GLIVGGVVWG WLLLLPTVPT VPALVVLFAV LAFAVDLLAF RTPPADVHSL 
APLVLVSASL ALGPIPAAWI AAVEGFVSGV TILLQTNRPR TLFSLLGRPL LRSGLRALGL
LVGAWLATMS SGQPLTALPM SHVFGWTLLS FPFVTQLGRI VRELLQGGYS GLATWWRSAW
PAILGAEIAP LPLAWLGAAI AHDLGMLHLI LAGGALVASA AILRRSSLNL QRQRRSMREL
ARLNEVSRAI IRSELDVDAL CELIYREASR IVDTSSFHLG LFNGHSYTLV VRVQDRVRLP
RLTVDLSENS GLIGWIRETG RAILVEDFTR EMDRLPARPR YQSERPPRSG IYVPLIAGET
VIGSISVQSY EPSAFDANDL RLISLIADQA AVAIARARAF HEARQRANQL QAIREVSQQI
TAILNLDRLL PSIVQLIRER FGYHPVHIFT LSPDDERIYF RASTADGADL ERLRALSLRI
GQGLVGEAVQ RGEPVLVGDV LNDHRAIRDT LQTRSELAVP LRVGTTVIGV LDVQSDEPDD
FDEDDLFVIR TLADQIAIAI ESANAYTAQQ EEAWTLNALL QIAENIGRAT TLSDLLATVV
RLPSLLMGCP RCYVALWDRE QGDFVVRAVY GLPTTARTGV LNQPTRSPFL WRLRERAAET
DQSRLELLWQ AQDNADQWPT LITAARSGTL VGLPISARNT LLGVLVLDYN DPFVSPSTRQ
QNLCTGAAAQ IAGALESLLL AAEAAEAARL EQELRVAREI QQSLLPSRLP NVAGWQIEAT
WQSARLVGGD FYDFWSLPTG TEPPRELGFV IADVSDKGIP AAMFMTMARS LVRAAALDGS
APARAMERAN RWLYRDSESG MFVTLFYARL DLITGQLCYT CAGHNPPLLY RAATGEIEEL
RTPGIALGVL PEVTLAEAET RLAPEDVLVC YTDGATETIN ELLVPFDVDG LRAVIKAYAT
GSAATIMQAI LAAVARHSHG QPPFDDITLI VIKRAST