Gene Cagg_3669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3669 
Symbol 
ID7268204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4462537 
End bp4464006 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID643568475 
Producthistidine kinase 
Protein accessionYP_002464941 
Protein GI219850508 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGAAC AACACTGGCT GAAACGATGG GCTGCGGTCA TCTGGCAAGT GATTGGCGCA 
GTCAACATTC GAGCAAAGAT TCTTGGCATT GTGCTAGGGT TGGTGGTGCT GATGGGAGTA
GCAGCAACGA TTGAAGTGCG GCTGTTACTC GAACAAACGC TTACCAACCA GGCCTACGAA
CACTCGGTTG CAGTTGCGCG TGATATGGCA GCCCGCGCGA CCGATTTCAC CTTGATGCGT
GATTACTACG GCTTGTTTCG TCTCTTGCGC GACACGCAAG CTAATAACCC CGGTCTCCGC
TATGCCTTCG TTGTCGACTC TGAAGGTACG ATTGTTGCTC ACACATTCGG CACCGGCTTT
CCCGTAGGCT TGCGCGATGC CAACACGGTG ACAGCGAATG AACATCACCG CAGTGTGCTA
TTAACCACCG ATGAAGGCGA TATTTGGGAT ATTGCCGTGC CGATCTTCGA TGGGCGGGCC
GGTATCGCTC GTGTTGGTTT ATCGTTAGCA ACCCGTGAGC AAACAGTTGC CGCCGTGACC
GGTCAGTTGC TCATCACAAC CATTATGGCA GCGGCGGTTG GGATCACGGC TGCCGCCTTG
CTAACGTGGA TCCTTACCCG TCCAATTTTA CAATTAGTGG AGTTGACCAA AGCGGTAGCC
AGTGGCGATT TTAGTCGGCG CGCCCAACGT TGGGCGAACG ATGAAATTGG CAAGCTGACC
GATGCGTTCA ATGCGATGAG TGAAGCGCTA GCACAGGCCG AACGTGAACG CGCCGAACGC
GAGCAGATGC GAGCACAGTA TGTGACCCAG ATCATTACCG CCCAAGAAGA GGAGCGAAAG
CGAATTGCCC GCGAACTCCA CGACAGCACA AGCCAAGCCC TCACTTCTCT CCTGATTGGT
TTGCGTTCAC TTGCCGACCG TCATCATTCA CCTGAACTGC ATCGGCAGGT TGACGAGCTG
CGTGGGATTG TCGGGCAGGT GTTACATGAC TTGCACGCAC TCGCCCGCCA ATTACGACCT
AGCGTGCTCG ATGATTTAGG GTTAGCTGCC GCCATTCAAC GCTACGTTGC CGATTGCCGC
GCTCGGAGCG GATTGACGAT TGATTTGGCT ATGCCCGACT TGACCGATGA ACGGCTCGAT
CCGGCGCTCG AAACTGCACT CTACCGGATC GTGCAAGAAG CTCTGACGAA CGTAATCCGT
CACGCTCATG CTACAACTGC GAGCGTCGTG ATCGAGCGGC AGAATGGCCA CTTACGTGCC
ATTATTGAGG ATAATGGCTG TGGCTTTGAT CCGGCTAGCC TCAGCGGTGA TGGTCATCTC
GGCTTGAATG GAATCCGCGA GCGGGCAGCA TTGTTGAACG GTCAGTTGAT CATTGAGTCA
GCGCCCGGTA GTGGTACAAC TCTTTATGTC GAATTTCCCC TGCCAGCAGC AAATGAGGAG
CATCATGAGC GGCATTCTGT TGGTCGATGA
 
Protein sequence
MNEQHWLKRW AAVIWQVIGA VNIRAKILGI VLGLVVLMGV AATIEVRLLL EQTLTNQAYE 
HSVAVARDMA ARATDFTLMR DYYGLFRLLR DTQANNPGLR YAFVVDSEGT IVAHTFGTGF
PVGLRDANTV TANEHHRSVL LTTDEGDIWD IAVPIFDGRA GIARVGLSLA TREQTVAAVT
GQLLITTIMA AAVGITAAAL LTWILTRPIL QLVELTKAVA SGDFSRRAQR WANDEIGKLT
DAFNAMSEAL AQAERERAER EQMRAQYVTQ IITAQEEERK RIARELHDST SQALTSLLIG
LRSLADRHHS PELHRQVDEL RGIVGQVLHD LHALARQLRP SVLDDLGLAA AIQRYVADCR
ARSGLTIDLA MPDLTDERLD PALETALYRI VQEALTNVIR HAHATTASVV IERQNGHLRA
IIEDNGCGFD PASLSGDGHL GLNGIRERAA LLNGQLIIES APGSGTTLYV EFPLPAANEE
HHERHSVGR