Gene Cphamn1_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1693 
Symbol 
ID6375380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1832389 
End bp1834215 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content49% 
IMG OID642684187 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001960093 
Protein GI189500623 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.575555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000125186 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAAAAACG CTAAGGTTTC GTTCCGGCTT GGGCTGGTTT TCGGAATTAT CATTTTTTTT 
ACGGTTGCCA TCAGCTACTT CTATCTTGGC GAACATCTGC GTTCGCTTTT CTACGATGGT
ATGAAAAACC GTTTATACAA GGAGCTGCAG CTGAATCGAA CGCTTCTTGA CGAGCGGCCG
GCACAATGGG ATGATGTCCG GCTTTCCGAT CTCTGGGCTG ATGATGTTGG TGAGGCTCTG
GAGCTCAGGG TGACGCTTGT CTCTCTCAAT GGTGAGGTTA TCGGAGATTC ATACATTGAT
GCGGATCGTC TCTCGCTGGT TGAAAATCAC AGTAACCGTC CTGAACTGCA ACAGGCACTT
GCTGAAGGCT CCGGTGAAAG TATACGGTAT AGTGATACTG TTAAGGAGGA GATGCTGTAT
ATAGCTGTTC CTCTTGGTAG AGAAGAACCC TTTGCCATTC TGCGTTTTGC CAAACCGCTG
AGTGACATCA GACTGCTTGA GGCTGAGCTT CGCAAAGGTA TCGAAGGCGC CTTGTTCTGG
TCTTTGCTGC TCTCTCTTGT TTTCGGTGTG CTTACCGCGC TTTTTCTCTC AAGGCCGCTT
CGCGTAATCG CTGACGCCGC TGACAACATC CTGCATGGAG ATTACGCTTC CAGGCTTGAA
ATCAATAGTG ATGATGAGAT CGGCCGGCTC GCCCGGGCCG TGAATTTCAT GTCGGATGAA
ATGAAGCGCA TGAACCAGCA GGAAGAGTGG TTCAAAGCAG TTTTTTCAAG TATTCGAGAG
GCTATTATCG TTACAGACGC AAATGGTAAT ATCAAGCTTG GAAATCCCGC GGCATGCAGG
CTGTTCAATC TTGATTGCGC TTTGATGATC AAGACCGGTG CAGGACACAA GGCGACTGAT
TCCCGGCTGT TTGAACTTCT TGAGCGGATG ACGGCCTCAG GTTCAACTGT TGAGAAAGAA
GAGGTTACCG TCATGACCGA TCGCGGTGAA AGAGTTTTGC AGGTCAGTGC CATGCCGATT
ATCAAGAGGA AAATCCCTGA GGGAACGGTT TTTGTGATGA ACGATATTAC CAAGCTGCTC
AACCTCGAGA GGATGAGGAG AGATTTTGTT TCAAGCGTTT CTCATGAACT CCGCACCCCG
CTTACAAGCA TAACGGGGTA TACTGAAACG CTGTTGGAGG GGGCGATTGA TGATGCTGAA
AACGCGAAGC ATTTTCTTCA GATAATTCTA CAGGAGAGTG AGCGGTTGAC CGCGCTGATC
AACGACGTTC TTGACCTCTC GAAAATAGAG TCCGGCAGGA TAGAATACAC GTTTGCTCCT
GTATGCCTTC GGGCTATCGT TGATCAGGTC GTGGATCTTT TCGGCCGGGC GATAGAGAAA
AAAGGTATTG TTCTTAGAGT TCAGATTCCT CAGGATCTTC CGTATGTTAA CGCTGACAAG
GGGTATCTTG AACTTGTGCT CAGAAACCTT CTGGACAATG CAATTAAATA TGTCGCTGAA
CAAAACGGCC AGATCATGAT CAGTGCCTCG ACGGTCGACG ATATGGTAAG AGTCGAGGTT
AAGGATAACG GCATCGGCAT TCCCAGGAAA GATCTTGGCA GAATATTCGA GCGATTTTAC
AGAGTTGACA AGGCAAGGTC GAGAGCAGTC AGCGGGACCG GTCTCGGGTT GTCTATCGTC
AAGCATATTG TGCTTGCCCA CAAGGGAAAG GTTGAGGTAC GTTCCAGGTT GAATCTCGGG
TCACAGTTCA GCTTTGTGAT TCCGATTGCC GGACAGAGCG GGAACAGTAT GGGCGCGACA
GTTGACAGGA GGGCGGCAGA GGCCTGA
 
Protein sequence
MKNAKVSFRL GLVFGIIIFF TVAISYFYLG EHLRSLFYDG MKNRLYKELQ LNRTLLDERP 
AQWDDVRLSD LWADDVGEAL ELRVTLVSLN GEVIGDSYID ADRLSLVENH SNRPELQQAL
AEGSGESIRY SDTVKEEMLY IAVPLGREEP FAILRFAKPL SDIRLLEAEL RKGIEGALFW
SLLLSLVFGV LTALFLSRPL RVIADAADNI LHGDYASRLE INSDDEIGRL ARAVNFMSDE
MKRMNQQEEW FKAVFSSIRE AIIVTDANGN IKLGNPAACR LFNLDCALMI KTGAGHKATD
SRLFELLERM TASGSTVEKE EVTVMTDRGE RVLQVSAMPI IKRKIPEGTV FVMNDITKLL
NLERMRRDFV SSVSHELRTP LTSITGYTET LLEGAIDDAE NAKHFLQIIL QESERLTALI
NDVLDLSKIE SGRIEYTFAP VCLRAIVDQV VDLFGRAIEK KGIVLRVQIP QDLPYVNADK
GYLELVLRNL LDNAIKYVAE QNGQIMISAS TVDDMVRVEV KDNGIGIPRK DLGRIFERFY
RVDKARSRAV SGTGLGLSIV KHIVLAHKGK VEVRSRLNLG SQFSFVIPIA GQSGNSMGAT
VDRRAAEA