Gene Cagg_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3501 
Symbol 
ID7266429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4266956 
End bp4268365 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content54% 
IMG OID643568309 
Producthistidine kinase 
Protein accessionYP_002464776 
Protein GI219850343 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR01386] heavy metal sensor kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAAC GTAGTACGAC ATTACGCGTA CGCTTTGCCA TCTGGACGGG CGGGTTGCTC 
TGTGTCACAC TGTTGCTCTT TAGTGGGTTT GTTTATTGGC AAACCGCTCA AGGCTTGTCC
GAGTCACTTG ATACTGCTCT CCAAGCCAGC GCTCTCCAAA TCACGGCCGG TCTCAGTGGT
GAGCAGCTCA ATGTCGAAGA TGGTCAGATC GCACTCGATG AGCATCTGGC CGATCCGACG
TTATTGGCGC AATTGCAAAC CCAAGGGTTA ACGATCCGGG TTCTCGACCG CACCGGTGCG
ATTCGGCAAG CGGTCGGCGC TTATCACCAT GCACCGGTCG ATCCGGTTAG TTTGCAAGCA
CGCTCTGGGA CACCTGTGTG GCATACGCAG ATCCTCGCCG ATGGGACGAC AGTGCGGGTC
TATACTGTAC CGGTGTACGA ACACGACCAA TCGGTTGGCT TGATTCAGAT TATACAGTCG
CTTGAACCGG TAGTCGAAAC ATTGCAGCAA CTCCAAACCG CGTTCGCAAT TGGTATTCCT
GCGCTAACCT TACTGGCCGG ATTGGGGGGC TACTGGTTAG CTGCACGGGC ATTACAGCCT
ATCACCAACA TCATCCGTAC TGCCCAACAG ATTTCGGCCA CCGATTTACA TGCCCGCATC
ACCTTACCAC CAACTGACGA TGAAGTTGGT CGGCTGGCAG CTACGTTCAA CAGTATGCTC
GCCCGACTAG AAGATGCGTT TCGGCGTGAA CGGCAATTTA CCGCCGATGC GTCGCACGAA
TTGCGTACAC CGGTTGCCGC TATGGAAGCG ATTATTACCG TGACCCGTGA ACGTCCACGC
AGTGTCACCG AATATACCCA AGCGCTCGAT GATTTATTGG TGCAAACTCG CCGTCTTCGC
AGCTTGATCG ACGATCTGTT GCAACTGACA CGTAGTGAAA CCCGGTATCG CTCTACACAC
ACACCGGTCA ACCTATCACT CCTGCTCGAA GATGTTACCG AAACGATGCG TCCTCTTGCC
GAAGAGCGTG GGTTGGTCAT CGCTACCGAG ATTAGTCCCA ACTTAGTTGT GGTAGGTGAT
AGTGATAGTC TTATTCGCCT CTGGCTCAAT GTACTTGATA ACGCGATCAA ATATACGCCC
CGCGGTACGA TTACCCTCCG CGCCTCTCGT ACCGGTGATC AGGTCACGGT CACAGTAACC
GATAGCGGTA TCGGTATTGC GCCTGAACAT CTTCCCTTTA TCTTCGAGCG TTTTTATCGT
GTTGATCCTG CCCGCAGTGG TAATGGTAAC GGCCTTGGCC TCGCAATCGC CCGCGAGATC
GTGCGGCTCC ATCACGGTAC GATCACCGTT ACGAGTGAAT TGGGCCACGG CACGACCTTT
ACTGTGAACC TTCCGACCAG TTCCGGTTGA
 
Protein sequence
MIQRSTTLRV RFAIWTGGLL CVTLLLFSGF VYWQTAQGLS ESLDTALQAS ALQITAGLSG 
EQLNVEDGQI ALDEHLADPT LLAQLQTQGL TIRVLDRTGA IRQAVGAYHH APVDPVSLQA
RSGTPVWHTQ ILADGTTVRV YTVPVYEHDQ SVGLIQIIQS LEPVVETLQQ LQTAFAIGIP
ALTLLAGLGG YWLAARALQP ITNIIRTAQQ ISATDLHARI TLPPTDDEVG RLAATFNSML
ARLEDAFRRE RQFTADASHE LRTPVAAMEA IITVTRERPR SVTEYTQALD DLLVQTRRLR
SLIDDLLQLT RSETRYRSTH TPVNLSLLLE DVTETMRPLA EERGLVIATE ISPNLVVVGD
SDSLIRLWLN VLDNAIKYTP RGTITLRASR TGDQVTVTVT DSGIGIAPEH LPFIFERFYR
VDPARSGNGN GLGLAIAREI VRLHHGTITV TSELGHGTTF TVNLPTSSG