Gene Cagg_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3195 
Symbol 
ID7267342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3878835 
End bp3879899 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID643568016 
Producthistidine kinase 
Protein accessionYP_002464489 
Protein GI219850056 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.107163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.232991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCG CAATACTACT CCGGCCGATC GGCAATCCTG TCACCGGTGC GCTACGACTG 
GCAGGTTGGT ATATCGTTCT CGCCGGGACA TGGGTTGCGT TCTCTGATCG GTTGCTCACA
CTGATGGTGA GCGATATTGA ACAACTGACA GTCTGGCAAA CGGTAAAAGG TTGGCTCTTC
ATCCTGGTCA TGGCAGGTTG GCTTGGGTAC GAACGATGGC GGACCCTTAG CCGGCAGCAA
CAGATCAATG AAGAACTCCG GGCGGCCAAC ACCGAGTTAC AAGAACTAAA CGCCTCGCTC
GAACGACGCA TCGCCGAACG GACGGCGCCG CTGCAAGCGG CCAACGCTGA ATTGAGTCAC
ATCAATCAAG AGTTGGAAGA GTTTACCTAT GCTGTATCAC ACGATCTCAA AGCACCACTG
CGCGCAATCG ACGGTTACAG CCAAATCCTG TTGCACTTCC ACATCCAGCG TCTCGATGCC
GACGGACAGC AGTGCCTGCA CAACATCCGC ACTGCGGTTA CACAGATGTA TCAGCTCATT
GACGATTTGC TGACCTACAC CCACATCGAG CGCAAATCAC TTGAATACGC CGAAATCCAT
CTTATCAATC TGGTAGAAGA GATTCTCGCA ATTTATGCCG ATCAGATCGC GGAACGCAAT
GTGATCATCG ATCGCGACCT GCGCTGCCTG ACGATTCGCG CCGACCTGAT CGGGTTGCGC
CTCGCATTGC GCAACCTGAT CGATAACGCC CTGAAGTTTA GCGCGCACGT GCCACAACCA
CGACTGGCAA TCGGCTCCGA AGTACAGGAT CATACAGTAC GACTGTGGGT GAAAGATAAC
GGGATCGGGT TCGATATGCG TGATTACGAT CGCATATTTG CTATTTTTCA ACGTTTACAT
CCACAAGAAT CATATCCCGG TTCTGGGGTT GGCTTGGCAA TTGTGCGTAA AGCTATCGAA
CGGATGGGCG GCCGTGTATG GGCCGAGAGT ACACCCGGCG CCGGTGCAAT CTTCTACGTG
GAGATACCAC ATGGCATCAC ACCCATATCG GCTGTTGTTG ATTGA
 
Protein sequence
MSSAILLRPI GNPVTGALRL AGWYIVLAGT WVAFSDRLLT LMVSDIEQLT VWQTVKGWLF 
ILVMAGWLGY ERWRTLSRQQ QINEELRAAN TELQELNASL ERRIAERTAP LQAANAELSH
INQELEEFTY AVSHDLKAPL RAIDGYSQIL LHFHIQRLDA DGQQCLHNIR TAVTQMYQLI
DDLLTYTHIE RKSLEYAEIH LINLVEEILA IYADQIAERN VIIDRDLRCL TIRADLIGLR
LALRNLIDNA LKFSAHVPQP RLAIGSEVQD HTVRLWVKDN GIGFDMRDYD RIFAIFQRLH
PQESYPGSGV GLAIVRKAIE RMGGRVWAES TPGAGAIFYV EIPHGITPIS AVVD