Gene Cagg_2803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2803 
Symbol 
ID7267508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3441517 
End bp3442785 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content57% 
IMG OID643567624 
Producthistidine kinase 
Protein accessionYP_002464102 
Protein GI219849669 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.367455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCGG TTGAGCAATT GGTGCAACGC TGGCAGCAGA TGATACACTA TTTCGGTAGT 
GGGCAGCGCA CGCTCATGCT GGCCGCGTAT GCGATGATCA GTACTGCGCT GGTCGAGTTT
ATCCTCTTTC ATCGCCAGTT GCCACCAGAG CGATTTTATG TGGTGATCGT GCTGTTGAGT
GTCTTGCTCT CCCTCAACGC CGTCTGGGAG CGATTGCAGC AGCGTTGGGG TAACGCAATT
GCCGATAGAG TCTTTTTTAG TACCAGCTCC ATCATCTTTC TAGCTGCTAA TTATATTGGA
TTGGACGCCG GTTGGACGTT TTTACCGTTT TTGTTGTTTG TGATTGCATC GCAGGCGATT
GTGGGGCTGG GTGTTTGGCG TGGGTTAGGT GTCAGTCTGC TCTTGTACCT CGGTTGGTGT
GGGGTGCTCT GGCTGCGTGG AGTGCCGCTC ATCCAGATTG TTGTCCAGGC CCCGTCGATT
GCCTTAGGAT TGATCTTTGT CCTGATCTTT TCTATCGTTG CTGCACGGCT CGTTGAACAA
ACAGCGCGTG CTGAGCGGTT GGCGGCTGAA TTGCAGTCCG TAAACGTGGC ATTGGCAGCA
GCACGTGAAC GGGAGGTAGA GCTTGCTGCT GCCGAGGAAC GGGTACGGCT GGCGCGCGAG
ATACACGACG GGTTGGGGCA CCATCTTACG GCGCTGAACG TACAATTGCA AGCCGCTGCG
CGCTTGCTCA ACCGTGATCC AGAGCGAGCG GCACAGGCAT TGGCGATCTG TCGCGAAGAG
GCGCAAGCGG CGTTGAATGA GGTGCGACAA AGCGTGGCAG TGATGCGTAA CGCACCGGTA
AACGGGCGTC CGTTGCCGGA GGTCATCGCG AAACTGGTGG CCGATTTTAA GCGTGTTTCG
CCGTTGCATG TGCAGTTTGT GGTTGAGGGA GAGATTGGTG AATTGCCGCT GACTGTTGCT
ATGGCGCTCT ACCGTGCGGT ACAAGAGGGC TTGACCAACG CGCAGAAGCA CGGCCAGGGT
ACGACGGTGA CGGTACGGCT GATCGGTGAA GTTGGGCAGG TGCGCTTGGA GGTGGTGAAC
GATGGCCCAC CGGCCCCGCC GGTGGCTGAA ACCGGCTTTG GCCTGGCCGG CTTGCGCGAA
CGGGCAGCTC GGTTAGGGGG AACGTTGCAC GCTGAACCGC TCCCAGCGGG CGGGTTCCGC
TTGGCGATGG TTGTGCCACA CGTACAAACA GAGGAGAAGC CGTATGATCC GCATTCTGTT
GGTCGATGA
 
Protein sequence
MKPVEQLVQR WQQMIHYFGS GQRTLMLAAY AMISTALVEF ILFHRQLPPE RFYVVIVLLS 
VLLSLNAVWE RLQQRWGNAI ADRVFFSTSS IIFLAANYIG LDAGWTFLPF LLFVIASQAI
VGLGVWRGLG VSLLLYLGWC GVLWLRGVPL IQIVVQAPSI ALGLIFVLIF SIVAARLVEQ
TARAERLAAE LQSVNVALAA AREREVELAA AEERVRLARE IHDGLGHHLT ALNVQLQAAA
RLLNRDPERA AQALAICREE AQAALNEVRQ SVAVMRNAPV NGRPLPEVIA KLVADFKRVS
PLHVQFVVEG EIGELPLTVA MALYRAVQEG LTNAQKHGQG TTVTVRLIGE VGQVRLEVVN
DGPPAPPVAE TGFGLAGLRE RAARLGGTLH AEPLPAGGFR LAMVVPHVQT EEKPYDPHSV
GR