Gene Cagg_0244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0244 
Symbol 
ID7267424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp305790 
End bp306965 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID643565113 
Producthistidine kinase 
Protein accessionYP_002461628 
Protein GI219847195 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.300489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGC TACACCAACT CCGCTGGAAG CTGTTCGTTT CCCACCTTAT TATTGTGCTA 
ATGGCGTATG TGGTCCTACT GGTCACTGCA AATGTGTTAG CCAATCTGGG TCTAACCGGT
TTTGCCCCGT TGACGCTTGG CGCTGCTGCC TCCGAAACGG GTCAGCTCGG TACAGATACG
GTCAGTACAA CCAATGCGTT GCAAGAACAA TTTCAGTCGG TAGTCCAGCA GTCGCTCTTG
ATTAGTGGCT TTGCCGCCCT TGCTGCTGCC GTGGTCGTTA GTCTGTTTGT CTCACGGCGG
ATTGTCGAAC CGATTCAGAC GTTGTCGCAG GTTAGTCGCC GGTTGGCGCA GGGATTTTAC
CGCGAACGAA CGATCATCTA TGCCGATGAT GAGATTGCAC AATTGGCGCA GAGTGTGAAT
CAGTTGGCCG ATGCGCTCGA TCAGACCGAG CGTCGCCGGT TGGCACTGCT CGCCGACGTG
ACGCACGAAT TGCGGACACC GCTCGCAACC ATCGGCGGCT ATATGGAAGG GTTGGTTGAT
GGGGTAGTGT CGGCAAATCC GGCAACGTTC AACCTGATCT TACGAGAAAC ACGCCGTCTC
CAACGCTTGA TCGAAGACCT TGAGTTGCTG TCACGGGTTG AAGCCGGACA GTTACCGGTA
ATTGCGCGCG CCATCGATCT ACGACCGGTG ATCGAGGAGC AGATTGCTCA GTTTGAGCCG
TTGTTCAGTA GTAATCAGGT GAACCTCATC CTTGATATGC CAGAGCAAGT ACCGCAGGTG
TGGGCCGATC CCGATCGGGT GGCGCAAGTG TTGATCAATA TTCTGGTCAA CGCTTGTCGC
TACACCCCAC CAGGTGGTAG TGTCACAGTA CAGGTGCGTG TCGATGACCA CGAAGTACGG
GTTGCCGTGA TCGATACCGG TATCGGGATC GCTGCCGAGC ATTTACCGCA TGTGTTTGAA
CGATTTTATC GCGTGGATAA ATCGCGTGCG CGGAATAGTG GTGGGAGCGG GATCGGGTTG
GCAATCGCCC GTCATCTTAT TTATGCGCAG GGTGGTGAGA TCTGGGCAGA AAGCGATGGT
CTTGGGAAGG GTGCGCGCTT TATTTTTACC CTGCCAATCG CGCCGCAGAT GGCGACGGTG
CCGGTTGAGC CTGTGGTCAT ATCAGAAACA GCATGA
 
Protein sequence
MKWLHQLRWK LFVSHLIIVL MAYVVLLVTA NVLANLGLTG FAPLTLGAAA SETGQLGTDT 
VSTTNALQEQ FQSVVQQSLL ISGFAALAAA VVVSLFVSRR IVEPIQTLSQ VSRRLAQGFY
RERTIIYADD EIAQLAQSVN QLADALDQTE RRRLALLADV THELRTPLAT IGGYMEGLVD
GVVSANPATF NLILRETRRL QRLIEDLELL SRVEAGQLPV IARAIDLRPV IEEQIAQFEP
LFSSNQVNLI LDMPEQVPQV WADPDRVAQV LINILVNACR YTPPGGSVTV QVRVDDHEVR
VAVIDTGIGI AAEHLPHVFE RFYRVDKSRA RNSGGSGIGL AIARHLIYAQ GGEIWAESDG
LGKGARFIFT LPIAPQMATV PVEPVVISET A