Gene Cagg_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0485 
Symbol 
ID7266653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp599956 
End bp602136 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content54% 
IMG OID643565348 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002461862 
Protein GI219847429 
COG category[T] Signal transduction mechanisms 
COG ID[COG2203] FOG: GAF domain
[COG2205] Osmosensitive K+ channel histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0662755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC CAGTGTCCTC TTCAAGACAA TCTGAGCCGA ACACGCAAGG GCTACTGCGC 
ATTTCCACTG CAATCGAGAC GGCAGCTACC CTTGATGAAC TGTTATTGCT GGCACTGAAC
GAATTTGTGC AGGGGTTAGG CGTTGCGCTC TGTGGTGTGT TGTTGCTCGA TGCCGGGGGC
GAATCGATCT CATTGGTAAG CACCTTTCCT CCTCGTATCT CGTTCCCCCC GCCCATCCCA
ATGAGCGATT TACCGATCAT GCAACGCGCC TTGCAGCAAC GACAGGCGTA TCAGATCCAC
GATATTACCG AGTTGCCACG CGAGCGATAC CGCTCGCCAA CGTCGTTGCA GATACTGACG
ATGTTGACCG AAGCGCAGGT GCGCTCACTC CTCATTGTTC CGCTGGTGGC ACAGGACCGA
GTGATCGGCG CATTAGTAGT TGCTTCTATT GAGCAGCCAC GCCATTTTGA CGATCAAGAA
ATATCGAACA TTCGCCTCAT GGCCAGTCAA TTGGCAGCGG CGATTACCGC GTTTCGCACC
ATTGAAGAAG CAGAATATCG TAACGCCGAA TTGATGACGT TGAACGATAT TGCTGCCGCT
GTTAATTCGA TGCTCGATCC GCGCGATATT TACCATCTGG TGATGGAAAA GATCAATCAG
TTCTTTCGAG TGGAGGCCGG TTCGTTGCTG ATGCTTGATG AAGAAACCGG AGATATGGTC
TTTGTCATCA CCTTAGCAGG TGGGCAAGAG AAACTGATCG GCCAACGTGT GCCACCGGGC
GTAGGGATTG TCGGCGATGT CGTGCGTACA CAGCAGTATG CGATTGTGCA TGATCCTGAG
CGTGATCCGC GATTCTATCG GAACGTGAGT GAGGGGATCG GCTATAACGT CCATTCAATC
CTCTGCGTGC CGATTGTGGT CAAGGGTCGC ACGATTGGAG CAATTGAGTT ACTCAATAAA
CGAACCGGCC GTTTTACCGA AGAAGAGGCG CTCCGCCTCA CCCGGATGGC AGCGACTATT
GGTATCGCGA TTGAGAACGC CCACCTGTTT CAACAGGTGA GTACTGTGCG CGATCGCTTT
GAGGCCATTG TGAATTCGAC CAATGATGGT ATCTTAATGG CCGATATGCG TGGTGTGGTG
GTAGCGAGTA ATGTCGTAGC GGCGCGGCTC TTTCACCGTT CCCGCGAAGC GTTAATCGGG
CTGCGGCTCG ATGACTTGAT TGCCGAGTTG ATGGAGCGAG CCCAGGTGGT TGAAGAACCG
GCATGGTTGA ATGATGGGGA ATCGCATCGA GTGTTCGAGA TCGAACTAAC TGAAGGACCG
GCCCGCTACC TGCGCCATTC CATCTTGCCG GTGCTCGATA CTCACGGTAT GCAGATAGGT
CGTCTTGCCC TGTTCGAGAA CATTGACAAG GAGCGCGAAC TCGCCCGGCT GCGCGAAGAT
TACACCGGTA TGCTCATTCA CGATCTGCGC GCCCCGCTAA CGGCAATTAT GAACGGTATC
ATGATGGTGC AGCGTGGTTT GGGCGGGCCG ATTTCGCCAC AGCAACAAGA GATGTTGAAC
ATCGCCTATC AGGGAAGCCA GGCGATGCTC GAGATGATCA ACACGCTGCT CGACATCAGC
AAGATGGAGC AGGGGCGAAT GATCCTGAAC ATTGAACCAC TCTCGCCGTA TGCTGTGGTT
GATGAAGCTA TTGAACGCTT GCAAGTCTAT GCCCAACAAC GAAAAGTGCA GTTGGCGCAG
GATCTTGCGG TCGGATTGCC GCTATTCGAA GCTGACCGCG AAAAGATCGT TCGGGTCTTG
CAGAATCTGA TCGACAATGC TATTAAGTTC TCGCCCGAAA ATGGCATCAT AACGATCGGC
GCCCGTCAGA TCGAGTTGGA CACCGATACA CTCGGTGGTT CTCATCCAGA CCTGCCGATG
CCGTTACCGT CATTACCGGC AGGGAGATGG CTGATATACT GGGTGGCCGA CCAAGGTCCC
GGTATTCCTC CTCAATACCA CGCACGGATA TTCGAGAAAT TTGGTCAGGT GCAGCAGCAG
AAGGGACGTG GCACCGGCCT TGGTCTGACG TTCTGCAAGT TAGCGGTTGA AGCACACCGT
GGTCACATCT GGTTACGTAG CCGCGAAGGA GCCGGCAGTA CGTTTGCCTT TGCCTTGCCG
GTAGCCGGTG ATGGCGATTG A
 
Protein sequence
MSDPVSSSRQ SEPNTQGLLR ISTAIETAAT LDELLLLALN EFVQGLGVAL CGVLLLDAGG 
ESISLVSTFP PRISFPPPIP MSDLPIMQRA LQQRQAYQIH DITELPRERY RSPTSLQILT
MLTEAQVRSL LIVPLVAQDR VIGALVVASI EQPRHFDDQE ISNIRLMASQ LAAAITAFRT
IEEAEYRNAE LMTLNDIAAA VNSMLDPRDI YHLVMEKINQ FFRVEAGSLL MLDEETGDMV
FVITLAGGQE KLIGQRVPPG VGIVGDVVRT QQYAIVHDPE RDPRFYRNVS EGIGYNVHSI
LCVPIVVKGR TIGAIELLNK RTGRFTEEEA LRLTRMAATI GIAIENAHLF QQVSTVRDRF
EAIVNSTNDG ILMADMRGVV VASNVVAARL FHRSREALIG LRLDDLIAEL MERAQVVEEP
AWLNDGESHR VFEIELTEGP ARYLRHSILP VLDTHGMQIG RLALFENIDK ERELARLRED
YTGMLIHDLR APLTAIMNGI MMVQRGLGGP ISPQQQEMLN IAYQGSQAML EMINTLLDIS
KMEQGRMILN IEPLSPYAVV DEAIERLQVY AQQRKVQLAQ DLAVGLPLFE ADREKIVRVL
QNLIDNAIKF SPENGIITIG ARQIELDTDT LGGSHPDLPM PLPSLPAGRW LIYWVADQGP
GIPPQYHARI FEKFGQVQQQ KGRGTGLGLT FCKLAVEAHR GHIWLRSREG AGSTFAFALP
VAGDGD