Gene Cagg_2683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2683 
Symbol 
ID7269590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3285146 
End bp3286786 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID643567509 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002463987 
Protein GI219849554 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000116014 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTATGG GCATCGGTGT AAAAGGTTAT ACTGAAGAGA AAGGGACGAG TGTATCGTTC 
TATGGTACAA TGGCAACGAT GACCGACCTC CGCTTACGTC AACGCGAATA CCTGTTGCAG
ATCAGTCGCG CCCTGACGGC TCAGCTCGAT CTGGGTAGTG TACTGAATCT GGTCATTACC
TACGCCGTCG AATTACTCGC CGGGACGCTC GGTTTTATTG CGTTGTTCGA CGAAGACGAT
GGCGTGTTGC GTATTCGCGC TTCGGCCCAA TTACCGCGTG AGAGCTGGCC GGCCTTCACA
CCACTGTTGC GAATCCCGAT CAGCGATCTT AGTCGCTATG GACCGCAAGT GTTGCGTGAA
ATATCGTTGG ATACCGGTAT TCCACTGCGC CAAATGATCG CCCTTCCCCT CACCGTGCGT
GAAACGCCAC TCGGTGTGAT TTACGTTTTT CGCGCCGCGC TCAATGTGGC CTTTTCCGCC
GATGATCGGC AGATGTTGCA AGATTTCGCC GACCAAGCGG CGATTGCTGT CGGGAATGCG
CGGTTGTTTC AAAGTGTTCT GCGTGAGAAG CAGCATCTGA ATGCGCTCAT CGAACAGTCG
GCGGATGGGG TGATGATTAT CGATGGCCGC TGGCGCATTA CCACGTTCAA CCATACGATG
GAGCAACTGA CCGGTTGGTC ACGGGAAGAG GCGATTGGGC GACCCTGTGC TGAAGTGTTG
GGGATCCGCG ATGCGCAGGG GGTGAATATC TGTTTGAATG ATTGTCCGTT GCAGCGCCAT
CCCGAACTCG CTAATCCGGT GGTTGAGGGA CGAATTACCA CGCGCGATGG CCGTGAGTTG
TTTATCCAAA ATCGCTACGC GCCGCAGCGG AATGCGCTTG GTACGTTGTT GAGCGCCATT
GCCAATGTGC GCGATATTAC GGCGCAGAAA GCTGAAGAAG AGCGCCAGAA TACCTTTATT
TCGGTGATTT CGCACGAATT GCGCACGCCG GTGAGTATTA TCAAAGGCTT TGCCGAGACG
ATGCTTCGTC CTGATGGACA GTTTACCGTC GAGCAGTACC GTGAGGCGTT GCAGGTGATC
GGTGAAGAGG CCGACCGGTT AGCGCGTCAG ATTCAAGATT TACTCGATGT CTCGCGAATC
GCTGCCGGTG GTTTGCGCCT CGAATATAGT GATGTGTCGC TCCAATTGCT GGTGAAAGAG
GTTGTGCGAC GGTTTGCGGC TCAGGTTGGT GACCGGATCG AGTTCGAGAT CCGTGTTCCT
GACGATATGC CCCCGGTCTA TGCCGATTAC GAGCGGTTAC GGCAGGTGTT TACGAATCTG
ATCGAAAATG CGGTGAAGTA TAGTCCCAAC GGTGGGACGA TCCGAATCGG GGCGCGCGCT
GAAGGGGAGA TGGCGATAGT TTACGTCGCC GATCAAGGTA TTGGTATTCC TCCCGAAGAG
CAGGATCTGA TCTTTGAGCG CTTCTACCGG GTTGATAACC GGTTACGCCG CGATCGGCCC
GGTAGTGGGC TTGGACTTTA TATTACCCGC GCGATTGTCG AGGCCCATGG CGGTCGGATT
TGGGTTGAAA GCCAGGTTGG GCGTGGGTCT CGTTTCTTGT TTACACTCCC GTTGAGCCGG
CGTCGGTTAC CGGGAGAATA G
 
Protein sequence
MTMGIGVKGY TEEKGTSVSF YGTMATMTDL RLRQREYLLQ ISRALTAQLD LGSVLNLVIT 
YAVELLAGTL GFIALFDEDD GVLRIRASAQ LPRESWPAFT PLLRIPISDL SRYGPQVLRE
ISLDTGIPLR QMIALPLTVR ETPLGVIYVF RAALNVAFSA DDRQMLQDFA DQAAIAVGNA
RLFQSVLREK QHLNALIEQS ADGVMIIDGR WRITTFNHTM EQLTGWSREE AIGRPCAEVL
GIRDAQGVNI CLNDCPLQRH PELANPVVEG RITTRDGREL FIQNRYAPQR NALGTLLSAI
ANVRDITAQK AEEERQNTFI SVISHELRTP VSIIKGFAET MLRPDGQFTV EQYREALQVI
GEEADRLARQ IQDLLDVSRI AAGGLRLEYS DVSLQLLVKE VVRRFAAQVG DRIEFEIRVP
DDMPPVYADY ERLRQVFTNL IENAVKYSPN GGTIRIGARA EGEMAIVYVA DQGIGIPPEE
QDLIFERFYR VDNRLRRDRP GSGLGLYITR AIVEAHGGRI WVESQVGRGS RFLFTLPLSR
RRLPGE