Gene Cagg_3266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3266 
Symbol 
ID7267413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3957596 
End bp3959026 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content55% 
IMG OID643568087 
Producthistidine kinase 
Protein accessionYP_002464560 
Protein GI219850127 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAT GGTGGCATAA CTTACCGTTG CGGCAGCGAC TAGCCGTTCT CTACAGCGGC 
TTGTTGGCAT TATTGCTTGG GTTGTTGGGG TTTGGATTCT ACATCGACAT CCAACAATTC
CTGTTTAGCA GTACCGAACT ACGGATTCGA GCACAAGCCA AGCCGGTCAT TGAGCAATAC
GTTTTTACAT CAGCCGATCC GGTCATTCAT CTACCTCGCA TCGCGCAGCA GTTGAGTCGT
GATCTCACCT CACGCGATAC AACGGCGCTG GTGTTTGATC GTGACGGAAG GCTATTGGCC
GATGGGCGTA AGTTGCCGGA AGAACCGGTC GCGATACCGG TAAGTGCCGA TTATCTAGCG
CTGGCGCTGT CCGGGGACAA CAACGTTACG ATCATCCAGC CGGTCGGTGA GCAGCGGATG
CTCAGCCTCC TCATCCCGTT ACGAACAGCA CCGGCATCAT CGGAAATATT GGGTGTGGTG
CAGATGACGA CGCCACTGAC GATGATTGAA ATAACGCTAC AGCGACAGGG TTTTACCATT
TTGTTTGGCG TCATAATAAT GTTGGTCGTG GGGGTTATTG TTGGGTATTG GCTAACCAGT
TCAACATTGC GACCACTAAA CGATCTGATC GTTGCTTGTC GGAACATCGC GCAGGGGAAT
CTCCGCCAGC GCGTGCCGGT GGTTGCCCCG TATGATGAGG TAGGGCAATT AACGGCAGCC
TTTGCCGAGA TGGTTGAACA GTTGGAAAAG AACTTCCAAG CGCAACAACG GTTTATTGCC
GATGCAGCGC ATGAGATGCG CACCCCGCTG ACGGCATTAC GGAGTGGGCT AGAGGTGTTG
TTGCGCGGTG CGCAGGATGA TCCGCAGACA GCCTTTCGCT TGATCCAGAG CATGCACTGC
GATGTGGTGC GCTTATGTGG GGTAAGTGAG CAGCTTCTTG ATCGAGCACG CTACGAATCG
GGCCGAGCGT TGATGTTGCG GTTAGTAGCC ATTGCCGAGA TGATGGACGA GTTTGCCGCG
CAAGCTCGTC TATTGGTCGG TGAGCGGACG CTGATCGTAG CGCATGGCCC GGCGGTGTCG
GTCTTGATCG ATAGTGATGG GATCAAGCAA GCATTATTTC ATCTGATCGA TAATGCAATT
CAGCACACCA CGCCGCAGGG CGAAATCAGG CTGGGCTGGT CGGTTGAGCG AGGGATGGTG
CAGTTTTGGG TGGCTGACAA CGGTGAAGGG ATTGCCGAAG CCGATCTGCC ACACGTATTC
ACTCCCTTTT ACCGCGGCAG CCGATCACGT TCGCGACGAA CAGGCCGGGC CGGCTTAGGA
TTGACGTTAG TGCAGAGTGT GGCCCGTGCG CATAGTGGTG AGGTAACGAT AACGAGTCGG
TTAGGGGAGG GGACGCAGGT CGTCATTGCG ATACCCTACC GGGTGGAGTA G
 
Protein sequence
MKQWWHNLPL RQRLAVLYSG LLALLLGLLG FGFYIDIQQF LFSSTELRIR AQAKPVIEQY 
VFTSADPVIH LPRIAQQLSR DLTSRDTTAL VFDRDGRLLA DGRKLPEEPV AIPVSADYLA
LALSGDNNVT IIQPVGEQRM LSLLIPLRTA PASSEILGVV QMTTPLTMIE ITLQRQGFTI
LFGVIIMLVV GVIVGYWLTS STLRPLNDLI VACRNIAQGN LRQRVPVVAP YDEVGQLTAA
FAEMVEQLEK NFQAQQRFIA DAAHEMRTPL TALRSGLEVL LRGAQDDPQT AFRLIQSMHC
DVVRLCGVSE QLLDRARYES GRALMLRLVA IAEMMDEFAA QARLLVGERT LIVAHGPAVS
VLIDSDGIKQ ALFHLIDNAI QHTTPQGEIR LGWSVERGMV QFWVADNGEG IAEADLPHVF
TPFYRGSRSR SRRTGRAGLG LTLVQSVARA HSGEVTITSR LGEGTQVVIA IPYRVE