Gene Cagg_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1678 
Symbol 
ID7268980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2048355 
End bp2050535 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content57% 
IMG OID643566520 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_002463015 
Protein GI219848582 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.62189 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC CACAGCGTGA ATTGAGTGTT GGTGAACAGG TAAATATGCG CGTCATTCCT 
GCCGAGCGGG TGCGACGTTG GATGCGAGCG AGCGCGGCAT GGCTGTCGTT ACAAGAGCCG
GAGCGACTGT TTCCGGCATT GACCCAGAGT TTCATCGAAG TACTGCCGGA GATACGAGCG
GCAATTCTCT GGTTGGTGCG TGCAAGCAGT TTACAACCGG TTGCCCAAGC AGGTTTGACC
ATGCCGGCTG CCTTGACCGA GCAGTGGCTA AAGATCAAAC TGCGGCCCGG TGAGGGGGTG
GCCGGTCTGG CGTGGCAGCG CGAGACTACC GTCCAACAAC ACGGCGAGCA TGGCTATCGT
GAATTACAAG GATTGGCCCC GCCACACGTG CAGGCTATTT TTCAAGCAAT GAGCGACCTG
TTGCCGCGAA GTCTGACCGT CACTGCGACG CCATTGCGTG CCGGTCACGT GCTGGTTGGG
GTCTTGGAGT TGATCGGTTG CGATGCGAAA GCGCTCGATG TTGAGCCGGA CGATCTCGAT
ATGATCGCCA GCATCGTGGC CGCAGCGATC CGCAATGCCC AGTTGTACGA TGAGATTCGA
CGGAGCAACC AGCGGCTCAA GGCGTTCGAT GCGGTCGTGA CTTCGATTAG TACGGCTGCC
GACCTGCCCG ATCTCGTGCA AAGTGTGCTG ACGGTGGTGC TCGAACTCAC TCCGGCGCGC
AGCGGCGCGC TCTTGATTTT CGATCCGGCC CAAGAGTGCC TCCAGCTTAG TGCGTGGCGT
AACCTTGATC GAGCGGTCTT GAGCAGTTTC GATCAAGTGC CGGTCGATAC TAGCCCGTGC
GCCGAGGTGG TACGTTACGG ACAACCGGCG TTCCGTCCGC TGCTGATTGA GCGCGGCGAA
GAGGCGTTGC TCGCAGCCGG AATGGTGGAG GCCGCCTATT TACCACTCTT GGCGGGTGGT
ACGGTTACCG GAGTGCTGGC CCTGTTTGGC GAAGTCAATC TCAACCGTTC GCTCGATAAA
GATATGCTGA TGCCGATCTG TAATCAGGTT GGTTTTGCCA TTGCGAACGT GCGTCTGTAC
GAAGACAGCC AGCGCGAACG ACGTAAGTTG CATACTGTCG TGGAATCGAT CGCCGAAGGC
GTCTTGCTCT GTGATCGGCA TGGACGGTTG ACGCTCGCCA ATCAGGCAGC GCAAGAACTG
CTCGATGAGG CAGTGCTGAG TTTCGAGACA CCATTAGCTT CAATCCCTGA GCTATACGAT
CTCCGCGATC TCGACGGCAA TCCGCTCACG CCGGACGATC TGCCGTTTAC GCGCGCGCTT
CGCGGCGATA CCTTCTACGA CTACCGGCTG ATGCGGCGGA AGCCCGATGG GAGCGAACGA
TTTTTAAGCT TTACCGGCGC CCCGGCCATT AACGAGCAAG GTGAAGTTGA AGGGGCAGTA
ATCACGTTAC GTGATATTAC GGCCAACCAG AAGGTACAAC GAGCCAAAGA CGAATTTCTG
GCCGTGGCTG CCCACGAACT CCGTAGTCCG CTGGCGGCAG TTCGGAGTTA TGCCGATCTG
TTGTTACGGC GTGAACAGCA ACGCGAAGGC GATGCCCGTG ATCTTCACGG CTTGACGATT
CTGACCCAGC AAGTGTCCCA TATGTTACGA CTGGTTGATA ACTTGCTCGA TGTCTCTCGT
CTGGATGCCG GCCAATTCGA TCTGCAATAT CAGACGGTGA ACCTCGTTAC GCTTGCGCAG
CAAGTCATCG ATCAGCAACG ACCAAGCGCC GGCAACCGAG AATTGTTGCT CGAGACTGAG
GCGCCAGAAT TGTGGATCTC GTGCGATGCA GTGCGCATCC GACAAGTGTT GACCAATCTC
CTGAACAATG CGCTTAAGTA CAGTCCAGCC GGTAGCGTGG TCAGTGTACG TGTCCGCAGT
GCATCATTAC CCGCTAACAA TGCTCCGGCG GCGCTGATCA GTGTGAGCGA TCAGGGACCG
GGAATTCCGG CGAGTGAGCA AGAACGAGTC TTCCAACGCT ATTATCGGTC GCCGGGTCGG
CGCGGTGAAG GGCTGGGACT AGGGTTGTAT CTTAGTCGGG AGATTGTGCA GTTGCACGGC
GGGCAGATTT GGATCGAGAG TCGTGAAGGA CAGGGAAGTA CGTTTATGGT GCTCTTACCG
AGTGAGCGTC CGCAAGGGTA A
 
Protein sequence
MTEPQRELSV GEQVNMRVIP AERVRRWMRA SAAWLSLQEP ERLFPALTQS FIEVLPEIRA 
AILWLVRASS LQPVAQAGLT MPAALTEQWL KIKLRPGEGV AGLAWQRETT VQQHGEHGYR
ELQGLAPPHV QAIFQAMSDL LPRSLTVTAT PLRAGHVLVG VLELIGCDAK ALDVEPDDLD
MIASIVAAAI RNAQLYDEIR RSNQRLKAFD AVVTSISTAA DLPDLVQSVL TVVLELTPAR
SGALLIFDPA QECLQLSAWR NLDRAVLSSF DQVPVDTSPC AEVVRYGQPA FRPLLIERGE
EALLAAGMVE AAYLPLLAGG TVTGVLALFG EVNLNRSLDK DMLMPICNQV GFAIANVRLY
EDSQRERRKL HTVVESIAEG VLLCDRHGRL TLANQAAQEL LDEAVLSFET PLASIPELYD
LRDLDGNPLT PDDLPFTRAL RGDTFYDYRL MRRKPDGSER FLSFTGAPAI NEQGEVEGAV
ITLRDITANQ KVQRAKDEFL AVAAHELRSP LAAVRSYADL LLRREQQREG DARDLHGLTI
LTQQVSHMLR LVDNLLDVSR LDAGQFDLQY QTVNLVTLAQ QVIDQQRPSA GNRELLLETE
APELWISCDA VRIRQVLTNL LNNALKYSPA GSVVSVRVRS ASLPANNAPA ALISVSDQGP
GIPASEQERV FQRYYRSPGR RGEGLGLGLY LSREIVQLHG GQIWIESREG QGSTFMVLLP
SERPQG