Gene Cpha266_2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2687 
Symbol 
ID4568866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp3080957 
End bp3082621 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content51% 
IMG OID639767254 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_913095 
Protein GI119358451 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA ACAGAACAGC AGATGCCGAC GACTTCTCCG AAAAGCGTCT CCAGGCAGAA 
GCGATCCTGC TGCATGAGCG AAAAAAAAAA GTTGATAGCC TTGAATCAGT CGAAGATCGT
CTTCGCATTA TCCATGAGCT ATCGGTCAAC CAGATTGAGC TTGAAATGCA GCAGGACGAA
CTGCTGCAGT CGAGAGCAGT TCTCGAAGCG GGGTTAAAGA GATATAACGA GCTCTACGAT
TTTGCGCCCC TTGGATACCT GACCATCGCC GCGGACAGCA CAATACGCAA GCTGAACCTG
ACGGCCGCAA CAATGCTTGG TCTCGATCGC TCTCTTTTGA AAGGCGACCG GTTGGGACGG
TTTATTGTCT ATGAAGATCT TCCGGTTTTC AATGCCCTCA TAAAGAGAGT CTTCGCCACT
CGGGAGAGCG GATATTGCGA GGTGATGCTG CTTGACCATG CCAGAGATAA ACAGGAGATC
GCAACCTCCG GCCCGCGTCA ACGACGCATG GTTCGTATCG ATGCCATAGT TAACAGCGAA
AAACAGGAGT GCTGGGCCTT TCTGACGGAC ATCACCATTC AAAAACAGCT TGAGGACTCG
TTATGGGAAA GCGACCGCCT CTATCGATGC CTGATTGAAA CGGTCAGCGA AGGCGTTCTT
GTTATTCACG GCGACCATTT GCGTTTTGTG AATCCGATCG TATCGGAAAT GACCGGTTAC
ACTGAAGCGG AGCTGCTCTC CTTTTCGTTT ACCGATATGA TGCATCCTGA TGACAGGGAG
CGGGTAAAAC ACCACCACCT CAAATGTCTC AAGAGCGACC TGCCCGACCT GAGAATTGAG
TTACGGATCA TCAAAAAAAA CGGAAGGATC CTCTGGATCG AAATGGGCGG GTTAAAAACC
GAATGGAACG GCAAGCCCGC AATGCTATAC GTTTTGATAG ACATCACCGA GCGAAAAGTC
CTTGAGGAGA AGCTGCAAAC CGAAAAGCAG CAGCTTCTTG ATACGCTCAG AACCACCGAT
CAATACCAGG CTCAACTGCA GGAGCTCAAC AGCAAAATCA AGGTCATGTC GGAAGTCGAG
GAGCGCTCCC TGCTCTACCG TGACCTGCAC GACGGAGCAG GCCAGTCGCT GCACGCGGTA
TGTCTGCATC TTAAAATGAT TGCGGATGGT CGCGGAGGAT ATGGAGACCT CAAGTCGCTC
GCATCTGAAC TTGCTGGTGA AATTGCCGAT ATCTCTGCTG AAATCCGTGA TATTGCCCAT
CACCTCCGTC CTGCCTATCT TCAGGAAATC ACCCTTGATC GAGCCATTAT CAAGCGCTGC
GAGATGCTCG GAAGACGAGG GGTTCCAATC AGCATCAGTT GTGTCGGCGA TTTCAGTTCC
CTCTCCTGTC AGGTCAGCGA AAACCTCTAC CGTATTTCCC AGGAGGCGAT AGCAAATGCC
GACCGCCACG CGGCGGCAAC CCTGATCACG GTACGCTTAA CCCGTGTTGA TAATGCGTTA
ACACTGCTCA TAGCCGATAA TGGTTGCGGA ATAAAAGACG TTTCGACAAA TAAAGGTGTT
GGACTGCGAA TCATAGAGGA ACGGGTTTCG CTCATAGGCG GAAAGCTCGA CATGGCATCC
ACCGCTTCGG GCACCACAAT TACCGTGACG CTGGAGTTGC CATGA
 
Protein sequence
MKKNRTADAD DFSEKRLQAE AILLHERKKK VDSLESVEDR LRIIHELSVN QIELEMQQDE 
LLQSRAVLEA GLKRYNELYD FAPLGYLTIA ADSTIRKLNL TAATMLGLDR SLLKGDRLGR
FIVYEDLPVF NALIKRVFAT RESGYCEVML LDHARDKQEI ATSGPRQRRM VRIDAIVNSE
KQECWAFLTD ITIQKQLEDS LWESDRLYRC LIETVSEGVL VIHGDHLRFV NPIVSEMTGY
TEAELLSFSF TDMMHPDDRE RVKHHHLKCL KSDLPDLRIE LRIIKKNGRI LWIEMGGLKT
EWNGKPAMLY VLIDITERKV LEEKLQTEKQ QLLDTLRTTD QYQAQLQELN SKIKVMSEVE
ERSLLYRDLH DGAGQSLHAV CLHLKMIADG RGGYGDLKSL ASELAGEIAD ISAEIRDIAH
HLRPAYLQEI TLDRAIIKRC EMLGRRGVPI SISCVGDFSS LSCQVSENLY RISQEAIANA
DRHAAATLIT VRLTRVDNAL TLLIADNGCG IKDVSTNKGV GLRIIEERVS LIGGKLDMAS
TASGTTITVT LELP