Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1788 |
Symbol | |
ID | 4571150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2037132 |
End bp | 2040128 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766371 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_912229 |
Protein GI | 119357585 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG1352] Methylase of chemotaxis methyl-accepting proteins |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA CACCAAACGC GAAGCCGCAA AAAATCAGGC ATGAAGAAGC CGTATCCATG AAAGCCGAAA AAGCATTTTT TCCTATTGTC GGAATAGGCG CTTCAGCCGG CGGACTGGAA GCTCTTGAAA GTTTTCTGAA AAAAGTTCCT TTCCCGTGCG GCATCTCCTT TGTGATTGTT CAGCATCTTG ACCCTACACA CAAATGCATC CTGGTAGAAC TGCTTCAGCG AGTTACCAGC ATGCCGGTTG TCGAGGTCGC CGACCGCATG AAAATCGAAA TCAATCACGT CTATGCTATT CCGCCCAACA AGTCCATGAC AATACTGCAC GGAGTGCTGC ATTTATTTGA CCCGACAGAG CCTCGCGGCC TCCGATTGCC GATTGATCTC TTTTTCCGTT CGCTGGCTGA CGACCTTCAA CAGCACAGCA TAGGCGTGAT ACTCTCCGGT ATGGGTTCCG ACGGCACGCT CGGACTGCGG ACGATAAAGG AAAAAGGTGG CTGCGTTTTC GTTCAGGATC CGAAATCCGC CAAGTTTGAC GGCATGCCAC AGAGCGCTAT TGATGCCGGT CTGGCCGATA TCATTGCTCC GGTTGAAGAT CTGCCGTACA GAATTCTTGC CTATCTCAAG CATATTCCCT CCATACGACA AGATAACAGC CACCTTGAAG ATAAGACGCT CAGTGGTTTG GAAAAAATAG TGCTTCTCTT GCGAAGAGAT ACCGGTCAGG ATTTTTCCCT CTATAAAAAA AACACCCTCT ATCGCCGGAT AGAACGGCGC ATGGGCATTC ACCAGATTGA AAAAATTGCC GATTATGTCC GATTCCTTCA AGGAAATCCT CATGAAACAA CACTGCTCTT CAAGGAGCTC CTGATCGGCG TTACCGGTTT TTTTCGTGAC CCGGCAGCCT GGGAGACACT GAAAACCAGA GCAATCCCCA CTCTTCTCGC CTCACGACAA GCCGACAGTA CTCTTCGTGC CTGGGTGGCA GGCTGTTCAA CCGGAGAGGA GGCCTACTCA CTTGCCATAG CTTTCATCGA AGCCGTTGAG CTGATACGTC CCCGCAGTGA TTTCAGGCTC CGGATATTTG CAACCGATCT GGATAAAGAT GCCATCGAAA AAGCCCGTTC AGGTATCTAT CCGCCAAACA TCGCATCAGA CCTCTCCAAA GAGAGGCTGC AACGTTTTTT CGAACAAGAT GAACATGGGT TCAGAATATC GAAAGAGATA CGGGAGACAA TAGTCTTTGC GCCTCACAAT ATCATCATGG ACCCGCCCTT CACCAAACTC GATATCATTA CCTGCCGCAA CCTTCTGATC TACATGGAGC AGGAGATACA GAAAAAACTG CTCCCGCTGT TTCATTACAG TCTCAATCCC GGCGGTATTC TTTTTCTTGG AAACGCCGAA AGCATCGGAT CCTTCAGCGA TTTGTTTGAC CCTCTTGAGG TTAAAACGCG ACTTTTCCGC AAGCTTCACA AGGAGTCACA ACAAGACCCT GTTATTTTTC CTGCTTTTTT TACTCATTCC GAGAACGAAA CCTCCGTTAT TATGAACGAC AGACAGAAAA AGCCAAATCC TGTCGTCAAC CTGCAGTCGC TTGCCGACCA GATCATCCTT CAACACTATG CTCCATCTGC GGTATTAACC AATGACAGGG GGGATATCAT CTATATCAGC GGACGCACAG GCAGGTACCT TGAGCCAGCG GCAGGCAAAG CCAACTGGAA CATTTTGGCA ATGGCTCGCG AAGGTCTTCG CTATGAACTG AATCTGCTTT TCAGCAGTGT GCTGCGCACG AAACAAACAT CAACAAAGAA GGGACTCTGT GTCGGCACAA ACGGCGGAAC GCAGATCGTG AATGTGACAA TCGAACCGCT TGAAAAACCG GAACTGCTCC GACGTTTGCT TCTTATTGTT TTTACGCCGG TCGAAAAATC CAAAAGCGAA ACCTCGAAGG ATAATCCCCT GCATATCAGC AGCGGAAACA ATATCCTTGC ATCACTTGAA GAGGATCTCA GGGTGGCTCG CGACGAGATC ATGACCATTC GGGAAGAGAT GCAGACATCG CAGGAAGAAC TTAAATCGAC AAATGAAGAG ATGCAGTCCG CCAACGAAGA GCTGCAGAGC ACGAACGAAG AGCTGACCAC ATCCAAAGAG GAGATGCAGT CACTCAACGA AGAGCTGCAG ACGGTTAACC ACGAGCTGCA GTCAAAGGTA AGTGAGCTGT CCGAGGCAAA CAACGACATG AAAAACCTCC TGAACAGCAC AGATATTGCG ACACTGTTTC TTGACGATTC ACTCAACATC CGAAGGTTTA CCACCAGAAC CGCAAGCATC ATCAAACTGA TTGCAAGCGA TATAGGGCGC CCGATTACCG ACATAGTAAC CGACCTGCAC TATCCAGCCC TTGCCGATGA CGCCCAAGAG GTACTGCGTA CCCTTATTTT CCGTGAAAAG CAGGTGTCAG CAAATAACGA CCGGTGGTTT TCCGTAAAAA TCATGCCCTA CCGGACACAG GAAAACAAGA TCGTCGGGTT GGTAATAACC TTCAGTGACA TCACTACCTC GAAAAAACTC GAAGCCTGTT TGCGTGAAAG TGAAGAACGG TTCCGATTTC TGTTTGAAAC AATGCCTGAA GGAGCACTGC TCCAGGATTC TGAAGGAAAA ATTCTGATGG CCAATCACGA GGCGGAACGC ATTTTCGGAC TCAGCAGTGA AGCAATGAAA AACAAAAAGA CAGAAGAACT GCAGAGAGCG TTCGTTCAGA AAGACGGATC GGCTTTTCCT CCCGAAAAGT ATCCATACCT CGTTGCATTG GATTCAGGAA AAACATGCAG CGGTGTAGTC ATGGGAATTA TGCTGCCGGC AAGCCAAACC TGCCGATGGA TCAAGGTTAG TGCTCTGCCT CGTTTCCATG AAAACACAGA AAAACCCTAT CAGGTGTACA CAACGTTTGT CGAAATCACC TTGCCCAAAG GGAATCACTC CGAATAA
|
Protein sequence | MKKTPNAKPQ KIRHEEAVSM KAEKAFFPIV GIGASAGGLE ALESFLKKVP FPCGISFVIV QHLDPTHKCI LVELLQRVTS MPVVEVADRM KIEINHVYAI PPNKSMTILH GVLHLFDPTE PRGLRLPIDL FFRSLADDLQ QHSIGVILSG MGSDGTLGLR TIKEKGGCVF VQDPKSAKFD GMPQSAIDAG LADIIAPVED LPYRILAYLK HIPSIRQDNS HLEDKTLSGL EKIVLLLRRD TGQDFSLYKK NTLYRRIERR MGIHQIEKIA DYVRFLQGNP HETTLLFKEL LIGVTGFFRD PAAWETLKTR AIPTLLASRQ ADSTLRAWVA GCSTGEEAYS LAIAFIEAVE LIRPRSDFRL RIFATDLDKD AIEKARSGIY PPNIASDLSK ERLQRFFEQD EHGFRISKEI RETIVFAPHN IIMDPPFTKL DIITCRNLLI YMEQEIQKKL LPLFHYSLNP GGILFLGNAE SIGSFSDLFD PLEVKTRLFR KLHKESQQDP VIFPAFFTHS ENETSVIMND RQKKPNPVVN LQSLADQIIL QHYAPSAVLT NDRGDIIYIS GRTGRYLEPA AGKANWNILA MAREGLRYEL NLLFSSVLRT KQTSTKKGLC VGTNGGTQIV NVTIEPLEKP ELLRRLLLIV FTPVEKSKSE TSKDNPLHIS SGNNILASLE EDLRVARDEI MTIREEMQTS QEELKSTNEE MQSANEELQS TNEELTTSKE EMQSLNEELQ TVNHELQSKV SELSEANNDM KNLLNSTDIA TLFLDDSLNI RRFTTRTASI IKLIASDIGR PITDIVTDLH YPALADDAQE VLRTLIFREK QVSANNDRWF SVKIMPYRTQ ENKIVGLVIT FSDITTSKKL EACLRESEER FRFLFETMPE GALLQDSEGK ILMANHEAER IFGLSSEAMK NKKTEELQRA FVQKDGSAFP PEKYPYLVAL DSGKTCSGVV MGIMLPASQT CRWIKVSALP RFHENTEKPY QVYTTFVEIT LPKGNHSE
|
| |