Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1787 |
Symbol | |
ID | 4571149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2033726 |
End bp | 2036521 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 639766370 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_912228 |
Protein GI | 119357584 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATA CTTCAGAAGA AGACCAGCGT CAACAATTGC TGAGGGAAAA TGCCCGATTA AAAGCGCTTC TGCGTTCCGG GGATGAACAG CCGAAAGCTC TCGATGGGCT GAATTCTTCG GAAGATCAGT GCCTCCATCC CGGAAGCTTG CTGACAGGCG GTTCTGTTTA TCAGTTCAGT TGGAAAAACA AGCTGCAAGG GCCAGTTACC TTTGTTTCTT CAAATATTCA GCAATTGCTC GGGTATACTT CCGATGAGTT TACCAGTGGT CAAATCAGTT ATGGTTCACT GATTTATCCC GATGATCTGG CGACTTTTGT TGAAGAGCTT CACAGGAGTA TTGAACGGAA TATCGATTCT TTTGAGCAGG AATACCGACT CAGGAAAAAG GACGGCACGG TGTTCCGGGT TTGTGATTAT ACAATTGTCT TACGCGATAA AAAAAGTAAT ACGCTTTGCT ATGAGGGATA TATCATAGAT GCGTCAACGA AAACATGCTT TGAACCTCTG TTTGATACGA TTGACGATTT TCTGTTCATT GTCGATAGGG ATGGTTTGGT TATTCACTCG AATGAGGCCG TAAAAAATCG GTTGGGATAC TCTTTGGATG AGTTGGTTGG AAAAAATATA GAGTATTTTT TCGGTGACGA TCAACAGAAA GAGATACATG ATAAAATCGA AGGTCTGCTT TTTGGCCGCA ATACCTCTTT TCGGGTTCCT CTTTTGACAA GATCCGGAAC GGCAATTCCT GCCGAAACCA CAATCGCCAA AGGTAACTGG AACAACAGAA CGGTTATATG TTGCAACAGC CGAGATATTT CTGATCAGAT CCGACAGGAA CAGGCTTTGA TTGAAAGCGA GAGACGGTTC AGAGACTTGA CCGAAATGTT GCCGCTTCCA TTGTTTGAAG CCGATGTAAA TGGTATGGTT ACCTATACCA ATAGTCAAGG TGTTGAGGCT TTTGGATACA CCCCTGAAGA TTTGCATCGG GGTGTTTCGG TATTCAAATG CTGTATTCCT GAAGAGTCGG GAATCGTTAG CGCCAATTTT GAGAGCATGA AAGCCGGAAG CCGGATGTCA ACCGGTAACG AATATACTGC CCTCAGGAAA AACAACACTA CGTTTCCGGC TCTGCTTTAC AGTACTCCGA TTATTCGGAA TGGTTTGTTT GCAGGCGCTC GCGCTATCGT TATTGACCTT ACGAAGCTGA AAAAAGCAGA GTCAGTGCTT GGAAACAGTC GTTTGCAGGA GAGGATGGTC AGGGAGTTGC AATCGCTGAT TGATAATATT CCCGGAGCTG TTTATCGCGT TAACAGCAGG AACGAGACAA CGATGCTCTC CATGACAGGC GATTTTTTGC TGGATTATAC CCGGGAGGAG TTTGAAAAAG AGCTGTTTCC TTCCATGGCC ATTATTTATC CGGAAGATCG AGATCTGGTG TTAACATCAA ATCAGTCACT CAGATCGGTA AAACGATCCG AAGCCCTCGT CTATCGTATT GTTACGAAAA ACGGTTCTGT CCGATGGGTT GAAGATCGAA AAACATCTGC ATTTTCCCCT GACGGCATGT TTTTGGGGAT AGATGGTATT TTGTTTGATA TTACAGAACG AATCAAGGCA GAGGAGAATA AACAACTCCT TGAATCACGA CTCCGGAAAA CGCAGCGTCT TGAAACTATC GGGACGCTTG CCGGCGGAAT TGCCCATGAT TTTAATAACA TTCTTACCCC GCTTCTTGGC TATGCCGAAA TGGGGTTGAG CAGTTTGTCG AGTGAAAGTC CGCTTTACGA CTATTTCAGC GAAATCATTC AGGCATCTGA AAGGGCAAAG AATCTCATCG CTCAAATTCT GACGTTCAGC AGGCCAGGAG AGAGCAATCC CGCAGTCGTG AGTGTTCAGG ATATTATTGC CGAGTCGTTG AAGCTACTGC GTCCATCGAT CCCTTCGACA ATTACAATTG TACAGGATCT TGATTTTTCC TGTCGTAATA TTCTTGCCGA TCCATCGCAG ATACATCAGG TGATCGTCAA TCTCTGCACC AATGCGTTCC AGGCAATGGA GGAGTCCGGA GGCGTGATGA CGATAGGCCT CAGGGAGATA ACGCCGGATA AAGCTCTGAT GGCGGAATTT CCCGAACTGC ATGAGCATGA AAGCTATCTG CAGCTCAGTA TTTCAGATAC CGGAAAAGGT ATGGATGAAA AAACCATGGA GCGTATTTTC GAGCCTTTTT TTACCACAAA ATCAGGCAGA AAGGGTACCG GGCTTGGGCT TTCTGTAGTT CATGGCATTA TTTCAAGTTA TAATGGACAT ATAAGCGTAG TGAGCAGGCC TGAAAAAGGA ACATCCTTCC GGGTTTATCT GCCGGTTTGT AATAAAAAGG CACTGACTGA CTCTGCCAGA GCTGATGTAG CAAAAGGAAA AGGGTGTATT CTTTTTGTTG ACGACGAACT TGCAACCATC CGGATCATGG AGAGAATGAT GACCAGGATA GGGTTTAAAA TACAATCATG CAGTTCACCG TTACAAGCGC TTGAGCTTTT CAGAAAAAAT CCGGAAACCT TTGATCTGGT CATAACCGAT CTTACCATGC CCGAAATGAC AGGGATTGCT CTTGCCGGCG AATTACGAAA AATCAGTTCC CGATTGCCAA TCATTCTGAT GACGGGATAT GGAGAGGAAA TTGAAACGAT GAGTTCGCTC AGCCTGGTTG GCATCTGTAA GTTATTGAAA AAACCGGTTA ACATGGCTGA GCTGATTTCA GCAGTCAAAG AGGTGATTTT ACATAAAAAA GCATAA
|
Protein sequence | MDNTSEEDQR QQLLRENARL KALLRSGDEQ PKALDGLNSS EDQCLHPGSL LTGGSVYQFS WKNKLQGPVT FVSSNIQQLL GYTSDEFTSG QISYGSLIYP DDLATFVEEL HRSIERNIDS FEQEYRLRKK DGTVFRVCDY TIVLRDKKSN TLCYEGYIID ASTKTCFEPL FDTIDDFLFI VDRDGLVIHS NEAVKNRLGY SLDELVGKNI EYFFGDDQQK EIHDKIEGLL FGRNTSFRVP LLTRSGTAIP AETTIAKGNW NNRTVICCNS RDISDQIRQE QALIESERRF RDLTEMLPLP LFEADVNGMV TYTNSQGVEA FGYTPEDLHR GVSVFKCCIP EESGIVSANF ESMKAGSRMS TGNEYTALRK NNTTFPALLY STPIIRNGLF AGARAIVIDL TKLKKAESVL GNSRLQERMV RELQSLIDNI PGAVYRVNSR NETTMLSMTG DFLLDYTREE FEKELFPSMA IIYPEDRDLV LTSNQSLRSV KRSEALVYRI VTKNGSVRWV EDRKTSAFSP DGMFLGIDGI LFDITERIKA EENKQLLESR LRKTQRLETI GTLAGGIAHD FNNILTPLLG YAEMGLSSLS SESPLYDYFS EIIQASERAK NLIAQILTFS RPGESNPAVV SVQDIIAESL KLLRPSIPST ITIVQDLDFS CRNILADPSQ IHQVIVNLCT NAFQAMEESG GVMTIGLREI TPDKALMAEF PELHEHESYL QLSISDTGKG MDEKTMERIF EPFFTTKSGR KGTGLGLSVV HGIISSYNGH ISVVSRPEKG TSFRVYLPVC NKKALTDSAR ADVAKGKGCI LFVDDELATI RIMERMMTRI GFKIQSCSSP LQALELFRKN PETFDLVITD LTMPEMTGIA LAGELRKISS RLPIILMTGY GEEIETMSSL SLVGICKLLK KPVNMAELIS AVKEVILHKK A
|
| |