Gene Cpha266_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1788 
Symbol 
ID4571150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2037132 
End bp2040128 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content49% 
IMG OID639766371 
Productputative PAS/PAC sensor protein 
Protein accessionYP_912229 
Protein GI119357585 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG1352] Methylase of chemotaxis methyl-accepting proteins 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA CACCAAACGC GAAGCCGCAA AAAATCAGGC ATGAAGAAGC CGTATCCATG 
AAAGCCGAAA AAGCATTTTT TCCTATTGTC GGAATAGGCG CTTCAGCCGG CGGACTGGAA
GCTCTTGAAA GTTTTCTGAA AAAAGTTCCT TTCCCGTGCG GCATCTCCTT TGTGATTGTT
CAGCATCTTG ACCCTACACA CAAATGCATC CTGGTAGAAC TGCTTCAGCG AGTTACCAGC
ATGCCGGTTG TCGAGGTCGC CGACCGCATG AAAATCGAAA TCAATCACGT CTATGCTATT
CCGCCCAACA AGTCCATGAC AATACTGCAC GGAGTGCTGC ATTTATTTGA CCCGACAGAG
CCTCGCGGCC TCCGATTGCC GATTGATCTC TTTTTCCGTT CGCTGGCTGA CGACCTTCAA
CAGCACAGCA TAGGCGTGAT ACTCTCCGGT ATGGGTTCCG ACGGCACGCT CGGACTGCGG
ACGATAAAGG AAAAAGGTGG CTGCGTTTTC GTTCAGGATC CGAAATCCGC CAAGTTTGAC
GGCATGCCAC AGAGCGCTAT TGATGCCGGT CTGGCCGATA TCATTGCTCC GGTTGAAGAT
CTGCCGTACA GAATTCTTGC CTATCTCAAG CATATTCCCT CCATACGACA AGATAACAGC
CACCTTGAAG ATAAGACGCT CAGTGGTTTG GAAAAAATAG TGCTTCTCTT GCGAAGAGAT
ACCGGTCAGG ATTTTTCCCT CTATAAAAAA AACACCCTCT ATCGCCGGAT AGAACGGCGC
ATGGGCATTC ACCAGATTGA AAAAATTGCC GATTATGTCC GATTCCTTCA AGGAAATCCT
CATGAAACAA CACTGCTCTT CAAGGAGCTC CTGATCGGCG TTACCGGTTT TTTTCGTGAC
CCGGCAGCCT GGGAGACACT GAAAACCAGA GCAATCCCCA CTCTTCTCGC CTCACGACAA
GCCGACAGTA CTCTTCGTGC CTGGGTGGCA GGCTGTTCAA CCGGAGAGGA GGCCTACTCA
CTTGCCATAG CTTTCATCGA AGCCGTTGAG CTGATACGTC CCCGCAGTGA TTTCAGGCTC
CGGATATTTG CAACCGATCT GGATAAAGAT GCCATCGAAA AAGCCCGTTC AGGTATCTAT
CCGCCAAACA TCGCATCAGA CCTCTCCAAA GAGAGGCTGC AACGTTTTTT CGAACAAGAT
GAACATGGGT TCAGAATATC GAAAGAGATA CGGGAGACAA TAGTCTTTGC GCCTCACAAT
ATCATCATGG ACCCGCCCTT CACCAAACTC GATATCATTA CCTGCCGCAA CCTTCTGATC
TACATGGAGC AGGAGATACA GAAAAAACTG CTCCCGCTGT TTCATTACAG TCTCAATCCC
GGCGGTATTC TTTTTCTTGG AAACGCCGAA AGCATCGGAT CCTTCAGCGA TTTGTTTGAC
CCTCTTGAGG TTAAAACGCG ACTTTTCCGC AAGCTTCACA AGGAGTCACA ACAAGACCCT
GTTATTTTTC CTGCTTTTTT TACTCATTCC GAGAACGAAA CCTCCGTTAT TATGAACGAC
AGACAGAAAA AGCCAAATCC TGTCGTCAAC CTGCAGTCGC TTGCCGACCA GATCATCCTT
CAACACTATG CTCCATCTGC GGTATTAACC AATGACAGGG GGGATATCAT CTATATCAGC
GGACGCACAG GCAGGTACCT TGAGCCAGCG GCAGGCAAAG CCAACTGGAA CATTTTGGCA
ATGGCTCGCG AAGGTCTTCG CTATGAACTG AATCTGCTTT TCAGCAGTGT GCTGCGCACG
AAACAAACAT CAACAAAGAA GGGACTCTGT GTCGGCACAA ACGGCGGAAC GCAGATCGTG
AATGTGACAA TCGAACCGCT TGAAAAACCG GAACTGCTCC GACGTTTGCT TCTTATTGTT
TTTACGCCGG TCGAAAAATC CAAAAGCGAA ACCTCGAAGG ATAATCCCCT GCATATCAGC
AGCGGAAACA ATATCCTTGC ATCACTTGAA GAGGATCTCA GGGTGGCTCG CGACGAGATC
ATGACCATTC GGGAAGAGAT GCAGACATCG CAGGAAGAAC TTAAATCGAC AAATGAAGAG
ATGCAGTCCG CCAACGAAGA GCTGCAGAGC ACGAACGAAG AGCTGACCAC ATCCAAAGAG
GAGATGCAGT CACTCAACGA AGAGCTGCAG ACGGTTAACC ACGAGCTGCA GTCAAAGGTA
AGTGAGCTGT CCGAGGCAAA CAACGACATG AAAAACCTCC TGAACAGCAC AGATATTGCG
ACACTGTTTC TTGACGATTC ACTCAACATC CGAAGGTTTA CCACCAGAAC CGCAAGCATC
ATCAAACTGA TTGCAAGCGA TATAGGGCGC CCGATTACCG ACATAGTAAC CGACCTGCAC
TATCCAGCCC TTGCCGATGA CGCCCAAGAG GTACTGCGTA CCCTTATTTT CCGTGAAAAG
CAGGTGTCAG CAAATAACGA CCGGTGGTTT TCCGTAAAAA TCATGCCCTA CCGGACACAG
GAAAACAAGA TCGTCGGGTT GGTAATAACC TTCAGTGACA TCACTACCTC GAAAAAACTC
GAAGCCTGTT TGCGTGAAAG TGAAGAACGG TTCCGATTTC TGTTTGAAAC AATGCCTGAA
GGAGCACTGC TCCAGGATTC TGAAGGAAAA ATTCTGATGG CCAATCACGA GGCGGAACGC
ATTTTCGGAC TCAGCAGTGA AGCAATGAAA AACAAAAAGA CAGAAGAACT GCAGAGAGCG
TTCGTTCAGA AAGACGGATC GGCTTTTCCT CCCGAAAAGT ATCCATACCT CGTTGCATTG
GATTCAGGAA AAACATGCAG CGGTGTAGTC ATGGGAATTA TGCTGCCGGC AAGCCAAACC
TGCCGATGGA TCAAGGTTAG TGCTCTGCCT CGTTTCCATG AAAACACAGA AAAACCCTAT
CAGGTGTACA CAACGTTTGT CGAAATCACC TTGCCCAAAG GGAATCACTC CGAATAA
 
Protein sequence
MKKTPNAKPQ KIRHEEAVSM KAEKAFFPIV GIGASAGGLE ALESFLKKVP FPCGISFVIV 
QHLDPTHKCI LVELLQRVTS MPVVEVADRM KIEINHVYAI PPNKSMTILH GVLHLFDPTE
PRGLRLPIDL FFRSLADDLQ QHSIGVILSG MGSDGTLGLR TIKEKGGCVF VQDPKSAKFD
GMPQSAIDAG LADIIAPVED LPYRILAYLK HIPSIRQDNS HLEDKTLSGL EKIVLLLRRD
TGQDFSLYKK NTLYRRIERR MGIHQIEKIA DYVRFLQGNP HETTLLFKEL LIGVTGFFRD
PAAWETLKTR AIPTLLASRQ ADSTLRAWVA GCSTGEEAYS LAIAFIEAVE LIRPRSDFRL
RIFATDLDKD AIEKARSGIY PPNIASDLSK ERLQRFFEQD EHGFRISKEI RETIVFAPHN
IIMDPPFTKL DIITCRNLLI YMEQEIQKKL LPLFHYSLNP GGILFLGNAE SIGSFSDLFD
PLEVKTRLFR KLHKESQQDP VIFPAFFTHS ENETSVIMND RQKKPNPVVN LQSLADQIIL
QHYAPSAVLT NDRGDIIYIS GRTGRYLEPA AGKANWNILA MAREGLRYEL NLLFSSVLRT
KQTSTKKGLC VGTNGGTQIV NVTIEPLEKP ELLRRLLLIV FTPVEKSKSE TSKDNPLHIS
SGNNILASLE EDLRVARDEI MTIREEMQTS QEELKSTNEE MQSANEELQS TNEELTTSKE
EMQSLNEELQ TVNHELQSKV SELSEANNDM KNLLNSTDIA TLFLDDSLNI RRFTTRTASI
IKLIASDIGR PITDIVTDLH YPALADDAQE VLRTLIFREK QVSANNDRWF SVKIMPYRTQ
ENKIVGLVIT FSDITTSKKL EACLRESEER FRFLFETMPE GALLQDSEGK ILMANHEAER
IFGLSSEAMK NKKTEELQRA FVQKDGSAFP PEKYPYLVAL DSGKTCSGVV MGIMLPASQT
CRWIKVSALP RFHENTEKPY QVYTTFVEIT LPKGNHSE