Gene Cpha266_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1787 
Symbol 
ID4571149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2033726 
End bp2036521 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content45% 
IMG OID639766370 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_912228 
Protein GI119357584 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATA CTTCAGAAGA AGACCAGCGT CAACAATTGC TGAGGGAAAA TGCCCGATTA 
AAAGCGCTTC TGCGTTCCGG GGATGAACAG CCGAAAGCTC TCGATGGGCT GAATTCTTCG
GAAGATCAGT GCCTCCATCC CGGAAGCTTG CTGACAGGCG GTTCTGTTTA TCAGTTCAGT
TGGAAAAACA AGCTGCAAGG GCCAGTTACC TTTGTTTCTT CAAATATTCA GCAATTGCTC
GGGTATACTT CCGATGAGTT TACCAGTGGT CAAATCAGTT ATGGTTCACT GATTTATCCC
GATGATCTGG CGACTTTTGT TGAAGAGCTT CACAGGAGTA TTGAACGGAA TATCGATTCT
TTTGAGCAGG AATACCGACT CAGGAAAAAG GACGGCACGG TGTTCCGGGT TTGTGATTAT
ACAATTGTCT TACGCGATAA AAAAAGTAAT ACGCTTTGCT ATGAGGGATA TATCATAGAT
GCGTCAACGA AAACATGCTT TGAACCTCTG TTTGATACGA TTGACGATTT TCTGTTCATT
GTCGATAGGG ATGGTTTGGT TATTCACTCG AATGAGGCCG TAAAAAATCG GTTGGGATAC
TCTTTGGATG AGTTGGTTGG AAAAAATATA GAGTATTTTT TCGGTGACGA TCAACAGAAA
GAGATACATG ATAAAATCGA AGGTCTGCTT TTTGGCCGCA ATACCTCTTT TCGGGTTCCT
CTTTTGACAA GATCCGGAAC GGCAATTCCT GCCGAAACCA CAATCGCCAA AGGTAACTGG
AACAACAGAA CGGTTATATG TTGCAACAGC CGAGATATTT CTGATCAGAT CCGACAGGAA
CAGGCTTTGA TTGAAAGCGA GAGACGGTTC AGAGACTTGA CCGAAATGTT GCCGCTTCCA
TTGTTTGAAG CCGATGTAAA TGGTATGGTT ACCTATACCA ATAGTCAAGG TGTTGAGGCT
TTTGGATACA CCCCTGAAGA TTTGCATCGG GGTGTTTCGG TATTCAAATG CTGTATTCCT
GAAGAGTCGG GAATCGTTAG CGCCAATTTT GAGAGCATGA AAGCCGGAAG CCGGATGTCA
ACCGGTAACG AATATACTGC CCTCAGGAAA AACAACACTA CGTTTCCGGC TCTGCTTTAC
AGTACTCCGA TTATTCGGAA TGGTTTGTTT GCAGGCGCTC GCGCTATCGT TATTGACCTT
ACGAAGCTGA AAAAAGCAGA GTCAGTGCTT GGAAACAGTC GTTTGCAGGA GAGGATGGTC
AGGGAGTTGC AATCGCTGAT TGATAATATT CCCGGAGCTG TTTATCGCGT TAACAGCAGG
AACGAGACAA CGATGCTCTC CATGACAGGC GATTTTTTGC TGGATTATAC CCGGGAGGAG
TTTGAAAAAG AGCTGTTTCC TTCCATGGCC ATTATTTATC CGGAAGATCG AGATCTGGTG
TTAACATCAA ATCAGTCACT CAGATCGGTA AAACGATCCG AAGCCCTCGT CTATCGTATT
GTTACGAAAA ACGGTTCTGT CCGATGGGTT GAAGATCGAA AAACATCTGC ATTTTCCCCT
GACGGCATGT TTTTGGGGAT AGATGGTATT TTGTTTGATA TTACAGAACG AATCAAGGCA
GAGGAGAATA AACAACTCCT TGAATCACGA CTCCGGAAAA CGCAGCGTCT TGAAACTATC
GGGACGCTTG CCGGCGGAAT TGCCCATGAT TTTAATAACA TTCTTACCCC GCTTCTTGGC
TATGCCGAAA TGGGGTTGAG CAGTTTGTCG AGTGAAAGTC CGCTTTACGA CTATTTCAGC
GAAATCATTC AGGCATCTGA AAGGGCAAAG AATCTCATCG CTCAAATTCT GACGTTCAGC
AGGCCAGGAG AGAGCAATCC CGCAGTCGTG AGTGTTCAGG ATATTATTGC CGAGTCGTTG
AAGCTACTGC GTCCATCGAT CCCTTCGACA ATTACAATTG TACAGGATCT TGATTTTTCC
TGTCGTAATA TTCTTGCCGA TCCATCGCAG ATACATCAGG TGATCGTCAA TCTCTGCACC
AATGCGTTCC AGGCAATGGA GGAGTCCGGA GGCGTGATGA CGATAGGCCT CAGGGAGATA
ACGCCGGATA AAGCTCTGAT GGCGGAATTT CCCGAACTGC ATGAGCATGA AAGCTATCTG
CAGCTCAGTA TTTCAGATAC CGGAAAAGGT ATGGATGAAA AAACCATGGA GCGTATTTTC
GAGCCTTTTT TTACCACAAA ATCAGGCAGA AAGGGTACCG GGCTTGGGCT TTCTGTAGTT
CATGGCATTA TTTCAAGTTA TAATGGACAT ATAAGCGTAG TGAGCAGGCC TGAAAAAGGA
ACATCCTTCC GGGTTTATCT GCCGGTTTGT AATAAAAAGG CACTGACTGA CTCTGCCAGA
GCTGATGTAG CAAAAGGAAA AGGGTGTATT CTTTTTGTTG ACGACGAACT TGCAACCATC
CGGATCATGG AGAGAATGAT GACCAGGATA GGGTTTAAAA TACAATCATG CAGTTCACCG
TTACAAGCGC TTGAGCTTTT CAGAAAAAAT CCGGAAACCT TTGATCTGGT CATAACCGAT
CTTACCATGC CCGAAATGAC AGGGATTGCT CTTGCCGGCG AATTACGAAA AATCAGTTCC
CGATTGCCAA TCATTCTGAT GACGGGATAT GGAGAGGAAA TTGAAACGAT GAGTTCGCTC
AGCCTGGTTG GCATCTGTAA GTTATTGAAA AAACCGGTTA ACATGGCTGA GCTGATTTCA
GCAGTCAAAG AGGTGATTTT ACATAAAAAA GCATAA
 
Protein sequence
MDNTSEEDQR QQLLRENARL KALLRSGDEQ PKALDGLNSS EDQCLHPGSL LTGGSVYQFS 
WKNKLQGPVT FVSSNIQQLL GYTSDEFTSG QISYGSLIYP DDLATFVEEL HRSIERNIDS
FEQEYRLRKK DGTVFRVCDY TIVLRDKKSN TLCYEGYIID ASTKTCFEPL FDTIDDFLFI
VDRDGLVIHS NEAVKNRLGY SLDELVGKNI EYFFGDDQQK EIHDKIEGLL FGRNTSFRVP
LLTRSGTAIP AETTIAKGNW NNRTVICCNS RDISDQIRQE QALIESERRF RDLTEMLPLP
LFEADVNGMV TYTNSQGVEA FGYTPEDLHR GVSVFKCCIP EESGIVSANF ESMKAGSRMS
TGNEYTALRK NNTTFPALLY STPIIRNGLF AGARAIVIDL TKLKKAESVL GNSRLQERMV
RELQSLIDNI PGAVYRVNSR NETTMLSMTG DFLLDYTREE FEKELFPSMA IIYPEDRDLV
LTSNQSLRSV KRSEALVYRI VTKNGSVRWV EDRKTSAFSP DGMFLGIDGI LFDITERIKA
EENKQLLESR LRKTQRLETI GTLAGGIAHD FNNILTPLLG YAEMGLSSLS SESPLYDYFS
EIIQASERAK NLIAQILTFS RPGESNPAVV SVQDIIAESL KLLRPSIPST ITIVQDLDFS
CRNILADPSQ IHQVIVNLCT NAFQAMEESG GVMTIGLREI TPDKALMAEF PELHEHESYL
QLSISDTGKG MDEKTMERIF EPFFTTKSGR KGTGLGLSVV HGIISSYNGH ISVVSRPEKG
TSFRVYLPVC NKKALTDSAR ADVAKGKGCI LFVDDELATI RIMERMMTRI GFKIQSCSSP
LQALELFRKN PETFDLVITD LTMPEMTGIA LAGELRKISS RLPIILMTGY GEEIETMSSL
SLVGICKLLK KPVNMAELIS AVKEVILHKK A