Gene Cagg_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1139 
Symbol 
ID7267887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1406671 
End bp1408602 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content53% 
IMG OID643565982 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_002462485 
Protein GI219848052 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains
[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.535539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCG ATACGATCTT GCTGTTACTC ACCAACTCCA CCAACCGGCA ACTGTTAGAA 
ACGTGGTTGG GCCACCACTA TTCGATCATC GTCGGCAATG ACAAAACGGC TTTGCAACAA
CCGTTTCAAC TGTGTATTAT CGATGGCCCC GCCCTCAACC GGTACGAAGC CGAGATCCAC
GCCCGTCGCG CAACTGCCCA CCCTATCTTT CTCCCGTTCT TACTTGTTAC GACACGACGC
GATGTTCACC TGTACACCCG TCATCTTTGG CAGACAGTTG ACGAGTTGGT GACCAGTCCA
ATTGAGAAAG CAGAATTGCT CGCACGGATC GAGATCCTCT TGCGCGCACG CCGGTCGGCG
CTCGAATTGA ACCGTCTTCA GCAGGCGATG CTATCGAGTA CCCAAACGTG GTTACAGTTA
GCCGTCAGCG CATCCCAAAT CGGCTTGTGG GAATGGGATC TTATCACGAA CCGTGTCTTC
TTTTCACCAG AATGGAAAGC ACAGCTTGGA TATGCCGCTG ATGAGCTGAA CGATTCCTTC
GCTGTCTGGG AAGAACGCCT CCACCCCGAT GATCGTGAGC GCTGCCTCCG CACCTTGCAG
CAATATCTTG AGCGGCCATG GCCGAACTTT TCCCTCGAAT TCCGCTTGCG CCATAAAGAC
GGGAGCTACC GTTGGATCCG CTCACAAGCT GCCCTTATCT ACGATGAATC TGGTCGCCCA
ACTCATATGC TCGGTGCCCA CCTTGATCTA ACCGAACGTA AACAGCTTGA AGAGGAACGG
CAACACCTTA CCGAACAACT CTTTCAGTCG CAGAAGCTAG AGGCAATTGG GGCGCTGACC
AGTGGCATTG CACACGACCT CAACAATTTG TTGGTGCCGA TTATTGGTTT CGCCGAACTC
GGTATGTTAC AGCTCTCTCC GTCGAGTGAG CTGTACACCG ATTTCGACCA GATCCGGGCG
GCAGGTATTC GGGCTACTGC CCTGACTCGG CAGATTTTAG CCTTTAGCCG TCGGCAACGA
CTCGAACCAA AACCGGTGAA TCTCAATCAG GTAATTAGCG ACTTCATCGG TATTTTGCGT
CGTTTGATCG GTGAACGCAT CACCATCCAT CTACAGCTCG CACCCTCTTT GCCATTGACA
CATGCCGATC CCGGGCAACT CGAACAGGTA TTACTTAACC TAGTTATTAA CGCTCGTGAT
GCAATCGAAG GCTACGGTAC CATCACCATC TCGACGGCAT CGGTTACCAT TCCACCTTCG
CCCTCAGACC AAAGAGCCAG ACTACTTCCC GGCGAGTATA TTATCCTCCA GGTACACGAT
ACCGGCTGTG GAATTGAGCC AGATATTTTA CCTCGGATTT TCGAGCCATT TTTTACCACA
AAACCACCGG GAAAAGGAAC CGGTCTTGGT CTGGCTACTG TCTTTGGCAT TATCAAGCAA
TATCGCGGCC ATATCGATGT CCAAAGTGTG CCTCAGCAGG GGACAACGTT TACTATGTAC
CTGCCGACAC TCACCGATTC AACACCGGCC GGTGAAGCAC CATTGCCATC CTCAGATACA
GCATTGACCG GGCACGAAAC GGTGCTCGTC GTAGATGACG AACCTTCCGT ACTCTACCTC
ATCACCAGTG CGTTACGGTT GTACGGGTAC CGGGTACTAG AAGCTACCGA TCCGCAGCAC
GGGATCAGTC TCGCCGTAGA TCGTGAGCAG CCTATCGACC TCTTAATTGC TGACGTGATG
TTACCGGGAA TAACCGGCGA CGAGCTATAC CGCTCACTTG CCGCCCAACA ACCTAACCTG
CACGTCCTGT TCATCTCGGC ACAGACGGAT GATTCATATG ATCTACCACC CAATGCCCCG
CTGTTGATGA AACCGTTTAC CCTGAACCAA TTGCTGCAAA AAGTACGTGC CGCACTTATG
GTAACGGTGT AA
 
Protein sequence
MVGDTILLLL TNSTNRQLLE TWLGHHYSII VGNDKTALQQ PFQLCIIDGP ALNRYEAEIH 
ARRATAHPIF LPFLLVTTRR DVHLYTRHLW QTVDELVTSP IEKAELLARI EILLRARRSA
LELNRLQQAM LSSTQTWLQL AVSASQIGLW EWDLITNRVF FSPEWKAQLG YAADELNDSF
AVWEERLHPD DRERCLRTLQ QYLERPWPNF SLEFRLRHKD GSYRWIRSQA ALIYDESGRP
THMLGAHLDL TERKQLEEER QHLTEQLFQS QKLEAIGALT SGIAHDLNNL LVPIIGFAEL
GMLQLSPSSE LYTDFDQIRA AGIRATALTR QILAFSRRQR LEPKPVNLNQ VISDFIGILR
RLIGERITIH LQLAPSLPLT HADPGQLEQV LLNLVINARD AIEGYGTITI STASVTIPPS
PSDQRARLLP GEYIILQVHD TGCGIEPDIL PRIFEPFFTT KPPGKGTGLG LATVFGIIKQ
YRGHIDVQSV PQQGTTFTMY LPTLTDSTPA GEAPLPSSDT ALTGHETVLV VDDEPSVLYL
ITSALRLYGY RVLEATDPQH GISLAVDREQ PIDLLIADVM LPGITGDELY RSLAAQQPNL
HVLFISAQTD DSYDLPPNAP LLMKPFTLNQ LLQKVRAALM VTV