Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0079 |
Symbol | |
ID | 4570646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 91430 |
End bp | 94393 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 639764681 |
Product | histidine kinase |
Protein accession | YP_910573 |
Protein GI | 119355929 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.217254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAAAT TTCGAACGAA AGCAAGGGCT GTTGATATGC TCGGCAGACA GCAGATAGCA GATGCATCAA CCGCTATCAG TGAGCTTTTC AAGAATGCTC ATGACGCTTA TGCAGACAAT GTGGAAGTTG ATTTGTTTAA ATCAGACTCT TTATTAGTAA TTCGCGATGA CGGAATTGGA ATGTCACCAT CAGAGTTTGA GGCAAATTGG CTGGTTTTGG GAACCGATAG TAAATTTTCT TCTGCGGGAA AGCTACACGC TTACCGTCCC TCGAACAAGC CTGAACGGGC GGTTATGGGA GAAAAAGGTA TCGGACGTCT TGCTATTGCA TTTCTTGGGC CTCAGGTACT GGTTCTTACA CGCTCAGAAA AATTAGATCA TAATGATACC CTCACCATGT GCTATTTGCA TTGGGGTTTA TTCGAACAAC AAGCTTTAAA TCTTGATGAT ATAGATATTC CAGTCAAATC CATTTCTGGT GGTGAGTTGC CATCACTGCA AGATGTATCC GAATTATTGC TGGAAAACAC CAGGAATGTT GAGCAACTTC AGAAACGATT CCCACAATGC GATTTATGCT CTATTATTGA TGATCTTTCA GACTTTCAAG TTGATCCAAG AGATTTCGAG AAATCCGTTC AAGGGCTTTC TCTGTCAGAT CACAAATGCG GAACTCACTT CTATATTGCA ACAGCAAATG AGGTAATCAT AGCTGATATC GTGGCAGAAA AATATACTCT TACAAAAGAA TTCACTAAAT GTCTCCTTGG TTTTTGTAAT TCTACCTTTG CTGAAACTTC CCCTCCTCCA ATCCAAACAA AATTCAGGTA TTGGCCAACC GATAACAGGT ATGAAGATCT TATTGCTCCC AATGAGTTTT TTACAACAGA CGACCTGGCT TTATCAGATC ATTTCGTAAG TGGTGAAATT GATGAGTACG GTCAGTTTAA TGGTATTGTT CGCGTCTATG ATCAAGAGTA TCCTGATCAT GTTATCTCTT GGAAAGAGGG AGGAGGAAAA CCAACAGAAT GTGGTCCATT TCGTGTAGAA TTTGGCTATC TGCAAGGAGC GCATCGAGAG AGCATGGCTG ACCCTGATGA TTGGTCTATA CTCGATAGCA AACTTAAGCA AATAGGCGGA TTGTATGTTT ACCGTGATCG TATCAGAATT CTTCCATATG GGAATTCCAA TGTTGATTGG CTTGATATTG AACTAAGGCG CAACAAGGGA ATGGGATATT ATTATTTCTC TTATAGGAGA ATTTACGGCG CTATTTGCAC AACAAGGAAG GAAAACGCTA CTCTACGGGA GAAAGCTGGT CGTGAAGGCT TTCAAAAAGA TAAAGCATAC AGACAACTCA AAAGTGTCTT GGAGAATTTA TTCATACAAT TGGCTGCTGA TTTTTTTCGT AAAGACGCAA CGCATGGTGA TTATTTTCAG GAGCGTAAAG ATGAATTGGA CCGGTTGGAG CGTGCACGTA GAAAGCGGGA ACAACAAATA ACGACAAAAC GTACCAAGCT TTCAGATACG CTCGATGTAT TTTTTTCTAA AGTAAACGAA AGAATTCCGG AATCTGAAAT TGCTACACTC AGTCACCATG TTCAAAGCAG GATGCAGAGT GCGTCTTTCA TGATGGATTC TGATACAGCA TCACAGGAAC TACTCGATGC AGAACGTGAA GCGAATGAAA AGCTTGCTTT ACTGAGAACG ACTTATACCT TGATTCGGCC TCGTGGCGTC GGATTAACAA AACAATTAAC AAGAGATTGG GATGCATATC AGCGGGAACA TAACAGGCTT GAAATAGAAG TATTTGAGCC GTTCGTTAAG GATGTTGGGA AAAAACTTGG CTCGATGGCA GCACAAGCCA AAATTTACAT TGATCAGCGT CGCCGTCTTC AGGAGCTTAT AAAGAATGTT GCTGACGAGC AAAATGCGAA TGTGAAAAAT GAATCAAAAG TACTTCAAGA AACATCAAGT CAGACACGTC GAGCAGCTGT TAACACCGCT CGTTCAGCTA TGAAAGAACT TCGCGATACT ATTGAAGCCG TCAATGCTGA TCTTGCCCAT AGAGATCTAA ACGACCTGTT ACCTGAGCAA ATTGAAGAAA TTCGTTCAAC TTATGAAAAA AGAATTGATT CAGTAGCATC ACGAAATGCT GAAACTTTGG GAAGTGTTCG AGAGTTGCTT GCCGGTATTG TGGAAAGTCT TGAAAATAAC ATGCAAAATA GTCAACTCGA CATAGTCGAA GCAATGGATA CAGAGCTTGA GTCGCTTCGT GAACAAGCGG AAACAGATGA AGAATTGGTG CAATTAGGTC TTGCCGTCGC AGTTATCAAT CATGAATTTG TAGCAGCGAT AAAGATGATT AGAGGTCAAT TGCGTGAACT CCGTTCTTGG GCTATGGTAA ACAAAGATTT ATTACCAGTC TACCAAGAGA TTCGAACAAA TTTTGATCAT TTGGATGCAC ATCTCAATTT GTTTACACCA CTACAGAGAA GACTTTACAA AAAACGAGTC AATATTGAAG GCAAAGAGAT TATTCATTAT GTTCGTGCAT TATTCAATGT TCGCTTTGAG CGGCATAAAA TTCATTTGGA AGCAACACAG GCATTTCTTG ATAGCCATGT AACAAGCTAT CCGTCAACAA TTTATCCTGT TTTTGTAAAC CTTATAGATA ACTTTATTTT TTGGCTTAAA GATAAACAAG GTGATCGTTT GATCTCGTTA GACTGTACAG ATAATTCATA TCATATAAAG AATAATGGCC CAGCAATAAA CCGACGAGAT GCTGAATCAA TTTTTGAGCA AGGATTTTCG CGAAAACCTG GAGGCCGTGG GTTGGGATTA TATATATCGA AAAAGGTACT GGAAAAAGAA GGCATGACAC TTGCTTTGGA CAAAACATTA ACTATGGATT CAGGTGTGAG CTTTAACTTA TCGTGGAGTG ACAATAATGA GTGA
|
Protein sequence | MAKFRTKARA VDMLGRQQIA DASTAISELF KNAHDAYADN VEVDLFKSDS LLVIRDDGIG MSPSEFEANW LVLGTDSKFS SAGKLHAYRP SNKPERAVMG EKGIGRLAIA FLGPQVLVLT RSEKLDHNDT LTMCYLHWGL FEQQALNLDD IDIPVKSISG GELPSLQDVS ELLLENTRNV EQLQKRFPQC DLCSIIDDLS DFQVDPRDFE KSVQGLSLSD HKCGTHFYIA TANEVIIADI VAEKYTLTKE FTKCLLGFCN STFAETSPPP IQTKFRYWPT DNRYEDLIAP NEFFTTDDLA LSDHFVSGEI DEYGQFNGIV RVYDQEYPDH VISWKEGGGK PTECGPFRVE FGYLQGAHRE SMADPDDWSI LDSKLKQIGG LYVYRDRIRI LPYGNSNVDW LDIELRRNKG MGYYYFSYRR IYGAICTTRK ENATLREKAG REGFQKDKAY RQLKSVLENL FIQLAADFFR KDATHGDYFQ ERKDELDRLE RARRKREQQI TTKRTKLSDT LDVFFSKVNE RIPESEIATL SHHVQSRMQS ASFMMDSDTA SQELLDAERE ANEKLALLRT TYTLIRPRGV GLTKQLTRDW DAYQREHNRL EIEVFEPFVK DVGKKLGSMA AQAKIYIDQR RRLQELIKNV ADEQNANVKN ESKVLQETSS QTRRAAVNTA RSAMKELRDT IEAVNADLAH RDLNDLLPEQ IEEIRSTYEK RIDSVASRNA ETLGSVRELL AGIVESLENN MQNSQLDIVE AMDTELESLR EQAETDEELV QLGLAVAVIN HEFVAAIKMI RGQLRELRSW AMVNKDLLPV YQEIRTNFDH LDAHLNLFTP LQRRLYKKRV NIEGKEIIHY VRALFNVRFE RHKIHLEATQ AFLDSHVTSY PSTIYPVFVN LIDNFIFWLK DKQGDRLISL DCTDNSYHIK NNGPAINRRD AESIFEQGFS RKPGGRGLGL YISKKVLEKE GMTLALDKTL TMDSGVSFNL SWSDNNE
|
| |