Gene Cagg_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1768 
Symbol 
ID7267680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2161354 
End bp2163354 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content56% 
IMG OID643566609 
Producthistidine kinase 
Protein accessionYP_002463104 
Protein GI219848671 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.686261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0755696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCT ACCAACGGAC TGCGCTCGCG TTTCTCGGTC TGCTCTCAAT AGTAACCCTG 
ATCGCTGCGC TCAGCACGGT CGGTGATGCA ACGTTGCGCA ACGACTGGCC ACGATTGGCC
GGGATGTTAG CGTGTCTTGG CGTAATGCTC GGTTTGGCGT GGCTGACACC GCTCCGACTG
CTGAAGGTTG GCGGGCGATT AACGCTGATC GTCATTGAAT TGGCCGCCGC TGCAGGTGCC
CAACTACTCA CTGCAGCACC GTTAATCGAC TATATCTATC TCGTACTCGT CCTGCAGGGA
ATCATCCTCT TCAGGCCATG GTTGTGGGCA TTGATGGCGG TGAGTGTGTG GATCATTTGG
GCGATTGTAC GCTATCAACT CTCGAACGAT CTATTGATCT GGCTACAAAG CAACCTTGCG
ATTGCCTTTC CCGCGGTTTG CGCCATTATT GCAGCGTGTA TCTATGCACG CCACGTTCAC
CGTAGCGAGC AGATGCAGCA GATGCTCCAA CAGATGCAGC AACGCTATAC GTCGCTATCG
ACATTATTAC GTGATGTACA GCAGCGTGTC GCGCGGGAGG AACGTCAGCG GCTGTTGAAC
CGGCTCATCA GTGAGGTACA GCAGACATTG GTCTATGCGG AGCAAGGGCT GACTACAGCC
TTGGCGATGG CTCAATCAAA CCTTAACCGT TTACAAACCG CACTTGATGT ACCACGAACG
GCAACGGCAA CGGCGATTGC CCGACTACGG GCGACCGTGC AGACGCTCCG CTACGTGCCA
AATGACCCAA AACCGACACC GTATGGAATG CTGGCCGGGG TGTTTGATGA AGGGCTGATC
TCGCCGTTGC CCAACAATAT ACTGGCTTGG TTGCTGCCAT CTCTCTTTGT AAGCCTATCT
TTAGGGCTGG TATTACTCCA ATCCTGGCCG CCCTCGCTAC CGATACTGCG CTGGCTCGTA
GTACTAGGCG GTCTATTGAT CGTTACCAGT GCCTGCACAC AGTACGTCCG TCGTTCCGTC
TTCATCCAGC TCGGCCTAGT AGTACAAACA ATTACGATAA CCCTCATGGC CGCCCTCACC
AACCTGCTTC CCCTGTTGTG GGGGCTGTTG TTGGTCGCCT GGCAGATGAC CAGTCGCCTC
TCACGGTGGC AACTGCTCCT TTTCAGCGGA ATATGGCTCC TGTTGCTAAC GGTGATCGTC
ATCATCCAGC CAATCTTTCT CGACCTTACG ACCATCCTCA GCTTATTGGT GGCGATGCTA
CTCGTCAGTG GCCCATTGTT GTTGGCTCGA CGCCAATTAC GTCGTCGCCA GCAGGTCGCA
CAGCAGGTTC AGTTGCTTGA AACCGAAATA AAACAGCAAA CCGACGAGGT GCAGCGGATT
ACCATCGCCG CCGAACGTTT GCGTCTTGCC CGCGAAGTAC ACGACGATCT CGGTTCCAAA
CTCGTGTTAA TGAATCTTGA ACTGCAACTC GCCACCGAAC TGGCCGGCGA AGATCCGGCT
AAGGCGCGTG ACCATTTAGC GAACAGTCGC GAATTGTTGC ATAGCGCATG GCGCAGTTTG
TTGGCAGTGG CCGACGCTGA GTTACCGTTT CAACCGGCAA CGCTAGTCCC AGCGCTACAC
CGGCTAACCC AACAATGCGC ACAGAGCACC CAGGCTACCG TGACTATCGA CATTGAAGGC
GATATGGCGC AATTACCGTC ACAGGTAGCG CATTGTATCT ACCGCACGGT GCAAGAGGGG
TTGACCAACG CCTGCAAACA CGCCCGCGCT GCCACCATGC ATGTCCAGGT GCGTGCAGCC
GACGGCTACG TGGTAGTTAC CGTCACCAAC GATAACCGTC CACACCAAGT ATTACCCCCT
GTTGATTTGG GAAATGGCAG TTTTGGTCTA TTAGGTCTGC GTGAACGGGC TGAGGCCCTC
GGTGGCGGGT TAGAGGCGGG ACCACTCGCC GAGGGTGGCT GGCGACTACG GCTGGTATTA
CCCTACGAAG GTGAGGAATA G
 
Protein sequence
MARYQRTALA FLGLLSIVTL IAALSTVGDA TLRNDWPRLA GMLACLGVML GLAWLTPLRL 
LKVGGRLTLI VIELAAAAGA QLLTAAPLID YIYLVLVLQG IILFRPWLWA LMAVSVWIIW
AIVRYQLSND LLIWLQSNLA IAFPAVCAII AACIYARHVH RSEQMQQMLQ QMQQRYTSLS
TLLRDVQQRV AREERQRLLN RLISEVQQTL VYAEQGLTTA LAMAQSNLNR LQTALDVPRT
ATATAIARLR ATVQTLRYVP NDPKPTPYGM LAGVFDEGLI SPLPNNILAW LLPSLFVSLS
LGLVLLQSWP PSLPILRWLV VLGGLLIVTS ACTQYVRRSV FIQLGLVVQT ITITLMAALT
NLLPLLWGLL LVAWQMTSRL SRWQLLLFSG IWLLLLTVIV IIQPIFLDLT TILSLLVAML
LVSGPLLLAR RQLRRRQQVA QQVQLLETEI KQQTDEVQRI TIAAERLRLA REVHDDLGSK
LVLMNLELQL ATELAGEDPA KARDHLANSR ELLHSAWRSL LAVADAELPF QPATLVPALH
RLTQQCAQST QATVTIDIEG DMAQLPSQVA HCIYRTVQEG LTNACKHARA ATMHVQVRAA
DGYVVVTVTN DNRPHQVLPP VDLGNGSFGL LGLRERAEAL GGGLEAGPLA EGGWRLRLVL
PYEGEE