Gene Cagg_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0496 
Symbol 
ID7266992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp612743 
End bp614407 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content53% 
IMG OID643565358 
Producthistidine kinase 
Protein accessionYP_002461871 
Protein GI219847438 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000434752 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTTTC CCATTCGTCT CAAATTACTA GGTGCATTAG CCATTGACCT CGTCTTGATG 
ATGGTCTTAG GCTACTTTGC TACGCACCAA ATGGCAACGA TGAATGAACG TGCCGTCTTT
ATCGAGCGCC ATACCATTCC TTCACTCGAT ACGGTTGGTG ATATGGTAGC AGCGATCAAT
CGTTACCGCA CCCGCCAGCT CGAGTTCTTG ATTTATACCA ACCATGGCGA CCGCGAGCGG
TTGCTGGAGC ACATGCATGC TCTCGAGTAC GAGATGGCCA CTTATTTTAC CACCTATCGC
CCTTTGATTA GCTCTGCCCG CGAGCAAGAG CACTTGGCGG CAGTCGAAGC AGCATGGCAC
GAGGTCGTGC AGGCCAACCA CGGGCGATTT ATTCCGGCAG TACGCTTGGT GAGTGACGGT
AGCGTTCAAC CGTTCTATTC GCGGATGAAC CCGTTCTACG AAAAGCTCGA TCAGGCGATG
ACTGCATTGG TACAGGAGAA TAAACAACAG GCCCGCGCCT CACTCGATGT GGTTGCCGCT
AGTTACGAAG CTGCCCGTAC CTTTATCTTG TTCGATACCG TGATGGCAAT CATCATCTCG
GCTGTGATCG GGCTGTTTCT TTCAGCGCGG ATTGCCCGCC GGTTACAACG ACTAGCACAG
GCGGCTAATC GAGTTGCTGC CGGTAATTTC GTCGGTGCGA TTAATGAGCG TATCCGCGAT
GAAATCGGTG ATCTGGCACA AGCGTTCGAT CAGATGTTGG CAAGTCTGCG CTCCCAACGT
GCTGAATTGG AAGAACGTAA CTGTGCCTTG CAAGATAGCC TTGCCCGTCA AGAACAGTTG
ATGGCGGAGG TGATCCGCGG AAAACAGGCA GAAGCTGAAG CTGAACGAGC ACGTGCAGCG
GCTGAGGCAG CCAGCCAAGC CAAGAGTGCA TTCCTTGCGA CTATGAGCCA CGAGCTACGC
ACACCATTGA ATGCGATCCT CGGGTATGCT CAATTACTCC ATATCCAAAA GGTTGTCCCC
GACAGTCATC AGCCATACCT CGAACGCATT CTAACTTCTG GCCGTCATCT GTTGTCACTT
ATTAGTAACG TGCTCGATTT CGCCCGTATC GAACAAGGTG CGCTCGATCT TGATTATCGA
CCGGTGGAGG TGGCAGCTCT GGTTGAAGAG GTTGTGTCAA TGACGTTGCC TTTGGCTCAA
CGTCATCACA ATCGAATAGA AACTGATTGC CCACCCGAGA TCGGCGTCAT CGAAACCGAT
GGCCGCCGAT TACGGCAAGT GTTGATTAAT CTGTTGAGTA ATGCCGCAAA GTTTACCGAA
GACGGTCTTA TCCGGCTCGA AGTAACCGCC ATCCAGCACA ACGGTCGAGA CGGTCTGCGT
TTTGCCGTCC ATGATACCGG GATCGGTATT CCGCCCGACA AACAACACAA ACTCTTCCAG
CCCTTTAGCC AAGTTGACGA TTCGGTCACC CGCCGCTATG AAGGTACCGG CCTCGGCTTG
GCCCTCAGCA AGCAGATCGT CGAAGCCCTC GGTGGCACCA TTACCGTGCA GAGTACGGTG
GGTGTTGGCT CGACCTTTTC CGTCTGGATC CCGGTAGTTC CGGTGCCAAC ATCTTCGCGC
GCATCGTTCT TAACTCCAAT GCAACTCGCA GGAGATGTCG CATGA
 
Protein sequence
MRFPIRLKLL GALAIDLVLM MVLGYFATHQ MATMNERAVF IERHTIPSLD TVGDMVAAIN 
RYRTRQLEFL IYTNHGDRER LLEHMHALEY EMATYFTTYR PLISSAREQE HLAAVEAAWH
EVVQANHGRF IPAVRLVSDG SVQPFYSRMN PFYEKLDQAM TALVQENKQQ ARASLDVVAA
SYEAARTFIL FDTVMAIIIS AVIGLFLSAR IARRLQRLAQ AANRVAAGNF VGAINERIRD
EIGDLAQAFD QMLASLRSQR AELEERNCAL QDSLARQEQL MAEVIRGKQA EAEAERARAA
AEAASQAKSA FLATMSHELR TPLNAILGYA QLLHIQKVVP DSHQPYLERI LTSGRHLLSL
ISNVLDFARI EQGALDLDYR PVEVAALVEE VVSMTLPLAQ RHHNRIETDC PPEIGVIETD
GRRLRQVLIN LLSNAAKFTE DGLIRLEVTA IQHNGRDGLR FAVHDTGIGI PPDKQHKLFQ
PFSQVDDSVT RRYEGTGLGL ALSKQIVEAL GGTITVQSTV GVGSTFSVWI PVVPVPTSSR
ASFLTPMQLA GDVA