Gene Cagg_3631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3631 
Symbol 
ID7269775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4411951 
End bp4413951 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content55% 
IMG OID643568437 
Productsignal transduction histidine kinase 
Protein accessionYP_002464903 
Protein GI219850470 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0281267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATT CAACCCTACC GATGTATGAG GTGTTGACAT CGCAGCCGCC CGATCTGATC 
GGTGCAGTTC GTGAGACGAT CCCATCGATC GGGATGTTGA TTGATGCGCT GGAGCAGCAA
TCCGACCCAG CAGTTCGGCT AAGTCTCCTC CGCCAGATCG GAGCTGAATG GCATGCATGT
GGTGGTCTCG CCGAGGTATT GACCGAGTTA GGGTTACATT TGGCCGCGCG GCTACCGATG
TCGCAAGGGG TTGTGGTCAG TGAATTACTG CTGGCGTTTG GCGCTGCTCA GGAAGAACAA
CGCACGCGCG AGTGGGAAGC TCGGCTGGTA GCGCGGGCCG CCGAGCTAGA CGGTCTACAT
CGCATTATCT CTGCCGCCAA TTCCACCCTT GATCTCGATA CATCCTTACA GACGGTGGTA
GAGACGGTCG CACAGGTAAT GAATGTTGAG GTCTGTTCGA TTTATCTCTA CGATAAGCAC
CGTGATGATC TCGTATTGCG TGCAACTCAC GGCCTCAATC GAGCAGCCGT CGGCCAGGTT
GTGGCGCGTT TGGGCGAAGG GGTTACCGGT TGGGCCGCTC AATTGGGGTA TCCGGTGGCC
GTGAGTGATG TTTATCAAGA CCCACGCTAT CATCGTGAGC CGCAGTTGGG CGAAGAGATT
TTTCGCTCGA TGCTTGCCGT CCCGATTGTG CTCTTTTCGG CAGAACGCTT TCAGTTTAGT
GCCGACAAGT TGCAAGGCGT TATTACCGTG CAAACGATTG CACCCCGCGA CTTCACCCAA
GAAGAGATTT CATTTGTCGA GATGGCTGCC GGCGAGTTGG CCTTCTTTAT TGCCAATGCG
CAACTGTACC AGCAGACCGA CGAGCGCCTC CATCAAAAAT TGCGTGAGTT GACGACCCTC
CAACAGGTGT CGAAATCAAT TGCCGAGCAG ATCGGTTTGC ACGATGTCCT GAACCTGATC
GTCGAAAAAG CCGTCGATCT GTCGAAAGTG GATCGGGCGG CGATCTTTCA GGTCGGTGAA
GACGGCAATT TGAAGCTGGT TGCCAGTTAT GGCGGTGAAG GAGATGGCGT GCGCGATTTG
ATCATTCAAA CCGTGCGTGA TGGTCGGCCA CTGGCCGTGA TGAATGCGTA TCAAGACGCA
CGTTTTCCGC AGTTGCCAGA TGTAGCACGG CGCGAAGGTT TTCACTCGCT CTTCTGTATG
CCGCTCCGCG CCCGTGGCCG TACTATTGGC GGTATCTGTC TCTACCAACG CGAGCCGCAC
CTGTTTGATT ACGAGCAGGT GCGCTTACTC AATACGTTTG CCGATGAAGC GGCAATCGCA
ATCGAAAATG CTCGCCTGTA CGAAGAGAGC CTCCGCGCAT TGCGCGTCAA ATCGGCTCTG
CTTCAAGAGA TGCATCATCG TGTGCGCAAT AATCTCCAGA CCATCGCCGC TTTGTTGGCA
ATGCAATTGC GTCGCCTCGA TCCGGCTAGT CCGGGCGCCA AGGCATTGCG TGAGAGCGCT
GCACGGATTC AGGCGATTGC CGCCATCCAT AATCTGCTGT CGCGCGATGA TATTGGCGTA
ACGACGGTCA GTGCCGTTGC GCGGCAAGTG ATCGAGAGTG TGCAAAGTAC GCTCTTCGAG
AGCGATATTC GCGTCGAATT CACCATTCTC GGCGATGAGG TGCGAATCGG CTCACGCGAT
GCTACTGTCC TGGCACTCGT GATTAACGAG TTGGTAGATA ATGCACTGAC CCACGGTTTG
GCCGCGGAAG GTGGGCGCAT CGAGGTGGAA GCGGTGCTTG AGCAAGGTTG GGTTGTTCTC
GAGTTGCGCG ATGATGGCCC ACGTCATCCA CCACCACCGC CACGGCAAAG TAGTGGTTTG
GGGTTGCAGA TTATTGAAAC GTTAGTAATC GGCGATTTAG GGGGGACCTT TAGCCTCATT
CGTGATGAGG AAGCCGGTTG GATGCGCGCA CAGGTTCGTT TCCCCCAACG GATTATCGAA
GAGGACCGGC TGGAAGTGTA G
 
Protein sequence
MDNSTLPMYE VLTSQPPDLI GAVRETIPSI GMLIDALEQQ SDPAVRLSLL RQIGAEWHAC 
GGLAEVLTEL GLHLAARLPM SQGVVVSELL LAFGAAQEEQ RTREWEARLV ARAAELDGLH
RIISAANSTL DLDTSLQTVV ETVAQVMNVE VCSIYLYDKH RDDLVLRATH GLNRAAVGQV
VARLGEGVTG WAAQLGYPVA VSDVYQDPRY HREPQLGEEI FRSMLAVPIV LFSAERFQFS
ADKLQGVITV QTIAPRDFTQ EEISFVEMAA GELAFFIANA QLYQQTDERL HQKLRELTTL
QQVSKSIAEQ IGLHDVLNLI VEKAVDLSKV DRAAIFQVGE DGNLKLVASY GGEGDGVRDL
IIQTVRDGRP LAVMNAYQDA RFPQLPDVAR REGFHSLFCM PLRARGRTIG GICLYQREPH
LFDYEQVRLL NTFADEAAIA IENARLYEES LRALRVKSAL LQEMHHRVRN NLQTIAALLA
MQLRRLDPAS PGAKALRESA ARIQAIAAIH NLLSRDDIGV TTVSAVARQV IESVQSTLFE
SDIRVEFTIL GDEVRIGSRD ATVLALVINE LVDNALTHGL AAEGGRIEVE AVLEQGWVVL
ELRDDGPRHP PPPPRQSSGL GLQIIETLVI GDLGGTFSLI RDEEAGWMRA QVRFPQRIIE
EDRLEV