Gene Cpha266_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0079 
Symbol 
ID4570646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp91430 
End bp94393 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content40% 
IMG OID639764681 
Producthistidine kinase 
Protein accessionYP_910573 
Protein GI119355929 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.217254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAT TTCGAACGAA AGCAAGGGCT GTTGATATGC TCGGCAGACA GCAGATAGCA 
GATGCATCAA CCGCTATCAG TGAGCTTTTC AAGAATGCTC ATGACGCTTA TGCAGACAAT
GTGGAAGTTG ATTTGTTTAA ATCAGACTCT TTATTAGTAA TTCGCGATGA CGGAATTGGA
ATGTCACCAT CAGAGTTTGA GGCAAATTGG CTGGTTTTGG GAACCGATAG TAAATTTTCT
TCTGCGGGAA AGCTACACGC TTACCGTCCC TCGAACAAGC CTGAACGGGC GGTTATGGGA
GAAAAAGGTA TCGGACGTCT TGCTATTGCA TTTCTTGGGC CTCAGGTACT GGTTCTTACA
CGCTCAGAAA AATTAGATCA TAATGATACC CTCACCATGT GCTATTTGCA TTGGGGTTTA
TTCGAACAAC AAGCTTTAAA TCTTGATGAT ATAGATATTC CAGTCAAATC CATTTCTGGT
GGTGAGTTGC CATCACTGCA AGATGTATCC GAATTATTGC TGGAAAACAC CAGGAATGTT
GAGCAACTTC AGAAACGATT CCCACAATGC GATTTATGCT CTATTATTGA TGATCTTTCA
GACTTTCAAG TTGATCCAAG AGATTTCGAG AAATCCGTTC AAGGGCTTTC TCTGTCAGAT
CACAAATGCG GAACTCACTT CTATATTGCA ACAGCAAATG AGGTAATCAT AGCTGATATC
GTGGCAGAAA AATATACTCT TACAAAAGAA TTCACTAAAT GTCTCCTTGG TTTTTGTAAT
TCTACCTTTG CTGAAACTTC CCCTCCTCCA ATCCAAACAA AATTCAGGTA TTGGCCAACC
GATAACAGGT ATGAAGATCT TATTGCTCCC AATGAGTTTT TTACAACAGA CGACCTGGCT
TTATCAGATC ATTTCGTAAG TGGTGAAATT GATGAGTACG GTCAGTTTAA TGGTATTGTT
CGCGTCTATG ATCAAGAGTA TCCTGATCAT GTTATCTCTT GGAAAGAGGG AGGAGGAAAA
CCAACAGAAT GTGGTCCATT TCGTGTAGAA TTTGGCTATC TGCAAGGAGC GCATCGAGAG
AGCATGGCTG ACCCTGATGA TTGGTCTATA CTCGATAGCA AACTTAAGCA AATAGGCGGA
TTGTATGTTT ACCGTGATCG TATCAGAATT CTTCCATATG GGAATTCCAA TGTTGATTGG
CTTGATATTG AACTAAGGCG CAACAAGGGA ATGGGATATT ATTATTTCTC TTATAGGAGA
ATTTACGGCG CTATTTGCAC AACAAGGAAG GAAAACGCTA CTCTACGGGA GAAAGCTGGT
CGTGAAGGCT TTCAAAAAGA TAAAGCATAC AGACAACTCA AAAGTGTCTT GGAGAATTTA
TTCATACAAT TGGCTGCTGA TTTTTTTCGT AAAGACGCAA CGCATGGTGA TTATTTTCAG
GAGCGTAAAG ATGAATTGGA CCGGTTGGAG CGTGCACGTA GAAAGCGGGA ACAACAAATA
ACGACAAAAC GTACCAAGCT TTCAGATACG CTCGATGTAT TTTTTTCTAA AGTAAACGAA
AGAATTCCGG AATCTGAAAT TGCTACACTC AGTCACCATG TTCAAAGCAG GATGCAGAGT
GCGTCTTTCA TGATGGATTC TGATACAGCA TCACAGGAAC TACTCGATGC AGAACGTGAA
GCGAATGAAA AGCTTGCTTT ACTGAGAACG ACTTATACCT TGATTCGGCC TCGTGGCGTC
GGATTAACAA AACAATTAAC AAGAGATTGG GATGCATATC AGCGGGAACA TAACAGGCTT
GAAATAGAAG TATTTGAGCC GTTCGTTAAG GATGTTGGGA AAAAACTTGG CTCGATGGCA
GCACAAGCCA AAATTTACAT TGATCAGCGT CGCCGTCTTC AGGAGCTTAT AAAGAATGTT
GCTGACGAGC AAAATGCGAA TGTGAAAAAT GAATCAAAAG TACTTCAAGA AACATCAAGT
CAGACACGTC GAGCAGCTGT TAACACCGCT CGTTCAGCTA TGAAAGAACT TCGCGATACT
ATTGAAGCCG TCAATGCTGA TCTTGCCCAT AGAGATCTAA ACGACCTGTT ACCTGAGCAA
ATTGAAGAAA TTCGTTCAAC TTATGAAAAA AGAATTGATT CAGTAGCATC ACGAAATGCT
GAAACTTTGG GAAGTGTTCG AGAGTTGCTT GCCGGTATTG TGGAAAGTCT TGAAAATAAC
ATGCAAAATA GTCAACTCGA CATAGTCGAA GCAATGGATA CAGAGCTTGA GTCGCTTCGT
GAACAAGCGG AAACAGATGA AGAATTGGTG CAATTAGGTC TTGCCGTCGC AGTTATCAAT
CATGAATTTG TAGCAGCGAT AAAGATGATT AGAGGTCAAT TGCGTGAACT CCGTTCTTGG
GCTATGGTAA ACAAAGATTT ATTACCAGTC TACCAAGAGA TTCGAACAAA TTTTGATCAT
TTGGATGCAC ATCTCAATTT GTTTACACCA CTACAGAGAA GACTTTACAA AAAACGAGTC
AATATTGAAG GCAAAGAGAT TATTCATTAT GTTCGTGCAT TATTCAATGT TCGCTTTGAG
CGGCATAAAA TTCATTTGGA AGCAACACAG GCATTTCTTG ATAGCCATGT AACAAGCTAT
CCGTCAACAA TTTATCCTGT TTTTGTAAAC CTTATAGATA ACTTTATTTT TTGGCTTAAA
GATAAACAAG GTGATCGTTT GATCTCGTTA GACTGTACAG ATAATTCATA TCATATAAAG
AATAATGGCC CAGCAATAAA CCGACGAGAT GCTGAATCAA TTTTTGAGCA AGGATTTTCG
CGAAAACCTG GAGGCCGTGG GTTGGGATTA TATATATCGA AAAAGGTACT GGAAAAAGAA
GGCATGACAC TTGCTTTGGA CAAAACATTA ACTATGGATT CAGGTGTGAG CTTTAACTTA
TCGTGGAGTG ACAATAATGA GTGA
 
Protein sequence
MAKFRTKARA VDMLGRQQIA DASTAISELF KNAHDAYADN VEVDLFKSDS LLVIRDDGIG 
MSPSEFEANW LVLGTDSKFS SAGKLHAYRP SNKPERAVMG EKGIGRLAIA FLGPQVLVLT
RSEKLDHNDT LTMCYLHWGL FEQQALNLDD IDIPVKSISG GELPSLQDVS ELLLENTRNV
EQLQKRFPQC DLCSIIDDLS DFQVDPRDFE KSVQGLSLSD HKCGTHFYIA TANEVIIADI
VAEKYTLTKE FTKCLLGFCN STFAETSPPP IQTKFRYWPT DNRYEDLIAP NEFFTTDDLA
LSDHFVSGEI DEYGQFNGIV RVYDQEYPDH VISWKEGGGK PTECGPFRVE FGYLQGAHRE
SMADPDDWSI LDSKLKQIGG LYVYRDRIRI LPYGNSNVDW LDIELRRNKG MGYYYFSYRR
IYGAICTTRK ENATLREKAG REGFQKDKAY RQLKSVLENL FIQLAADFFR KDATHGDYFQ
ERKDELDRLE RARRKREQQI TTKRTKLSDT LDVFFSKVNE RIPESEIATL SHHVQSRMQS
ASFMMDSDTA SQELLDAERE ANEKLALLRT TYTLIRPRGV GLTKQLTRDW DAYQREHNRL
EIEVFEPFVK DVGKKLGSMA AQAKIYIDQR RRLQELIKNV ADEQNANVKN ESKVLQETSS
QTRRAAVNTA RSAMKELRDT IEAVNADLAH RDLNDLLPEQ IEEIRSTYEK RIDSVASRNA
ETLGSVRELL AGIVESLENN MQNSQLDIVE AMDTELESLR EQAETDEELV QLGLAVAVIN
HEFVAAIKMI RGQLRELRSW AMVNKDLLPV YQEIRTNFDH LDAHLNLFTP LQRRLYKKRV
NIEGKEIIHY VRALFNVRFE RHKIHLEATQ AFLDSHVTSY PSTIYPVFVN LIDNFIFWLK
DKQGDRLISL DCTDNSYHIK NNGPAINRRD AESIFEQGFS RKPGGRGLGL YISKKVLEKE
GMTLALDKTL TMDSGVSFNL SWSDNNE