Gene CHU_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_1901 
SymbolarcB 
ID4187119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2220281 
End bp2223076 
Gene Length2796 bp 
Protein Length931 aa 
Translation table11 
GC content39% 
IMG OID638071899 
Producttwo-component sensor histidine kinase 
Protein accessionYP_678509 
Protein GI110638300 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0313083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0871787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAACT TACAACTTGA CTTTGAACAA GCCAGGACAC GGCATCTATT ATTTAAAACA 
AAGCTAAGGT CTGTTTTATA TGGATCCGAA ATTGACACAA AGTCTGTTAT CTCCTATGCT
GACTGTGCAG TAAGTAAATG GATCTATGAG CATGCCTTAA AAGCTTACGG ACATATACCT
GAGATGCATC AGCTGAAACT TGTGCATATA AACCTTCATG AATCTGCGCA AGAGCTTGTA
CGTCTTTATA AAGATGGTAA AATTGATAAA GCACGCAGAG GCTTGTCAAA CCTGGAATTG
ATCGCAGATC ATTTTGCTGC TTTGCTCACA GTTGTTGAAT TAAAATTACA ACTCAGCAAT
TATCCGGCTG CTGCTATCCG CGACAACGAA GCTGTATTAA ACAATACACA TAAAGAACTT
TTAGAACTTC ACACCAGCCT GCATGAGCTG GATGCCCGCA TCCGTAAACA AACAGATGAA
CTGGTCAGTA CACAACGCGG CGTTGAGAAC AAACTACGCA ACCATTTCAG TCACGCTCCT
GTAGCAATCT GTATCCTGCG GGGGGCTGAG TTTGTATTAG AACTTGCTAA TGATATGTAT
TTGCAGCACA TAGATAAAAC AGCGGATATT GTTGGTAAAC GTTTATTTGA AGAATTACCT
GAACTGGAAA AACAGGGAGT GCGTGAAATG CTGAATAGTG TGCTGGATAC GGGAAATCCC
TTTGTAGGCA CAGATGTAGA AATACGTATG AAACGCAATG GCATCAATGA AACATTGTAT
TTTAATTTTG TATATAAACC GCTGGACGAT CATGAAACCA TACCTGGTAT TATGATTGTA
TGCTCTGAAG TTACGGATCA GGTAATTGCT AAAAAGGCAA TGATTGAAAA TCAGCAACGC
CTGAATATTG CTATTGAAGC TACAGCTTTA GGCACATATG AAGTAAATCT GAAGACAGCC
GAGCTTGTCT ATTCAGACAG ATACCTGGAA ATTTTTGGAT ACGCACCACA CGAAAAACCT
GGTCATAAAG AATTAGTTAA ATGTATACAT GAGGATGATG TACACATACG TGAGCAGGCC
TTTGCAATAG CATTTGAAAC TTCTAAGCTG TTTTATGAAG TACGTATCAT ACTCAGAGAC
CAGTCTGTCC GATGGTTAAG GGCTTTCGGG AAAGTTTTTT TTGATGATAA GGGAACTCCT
GAAAAAATGC TTGGAACTGT AATGGACATT ACGAAAGAAA AAGCAGCAGA AGAAAAAATA
CGCAAGGCTA ATGAAGTGCT TGAAATTGCA CTGCAATCTG CCAAACTAGG AACATATGAA
TTAAACATCA AAAGCGGGAA AGCAAATTTC AGTTTACTTT GTAAAGAAAT TTTTGGTTTT
GAGCCGGATA AAGATGTTAC CCTGGAAGAT GTGCGTAGTT CAACGCATCC TGACGATAAA
GAAATCACCC GCTCCTACAT GTTAGAAGCA CTTGCCAATA AACAGAATTA CGATGCTGAA
TATCGCATTA TAACAGCCGA CAAGTCTATA CGCTGGATCT CTATTTCAGG AAAAGGCGTC
TATGATGCAG AAGGCAATGT CAAAAAACTG ATTGGAGTAA TCGAAAATAT TACCGACCGC
AAACATGCGG AAGATGAATT AAATATAAGC GTGCAGAAAT TCAGATTACT TGCTGACTCC
ATGCCGCAAT TTGTATGGAC CGGTGATACA GCGGGTAATC TGAATTACTT TAACCAATCT
GTTTTTGATT ATACGGGGCT CACACCCGCG CAGATATATT TAGAGGGCTG GCTCCAGATC
GTTCATCCCG ATGACCGTGA AGAGAATATC GAAAAATGGA TGCAGGCTAT ACAAACCGGT
AACGATTTTA TATTCGAACA CCGCTTTAAA AAAAATACAG GTGAATACCG CTGGCAATTA
AGCCGTGCTA TTGCACAACG CAATAAAACC GGTGAAATAC AGATGTGGGT CGGAACCAGC
ACAGATATAC AGGATCAAAA AACATCGGCA CAGAAATTAG AAGAGTTGAT CAAGGAACGT
ACAAAAGAGT TAAAGAATGC AAATATTGAA CTGGAAAGTA TGAATCAGGA ACTACGCTCC
TTTACATACA TTTCAAGCCA TGATTTGCAG GAACCTTTAC GAAAGATTCA AACATTCATC
AGCAGAATCC AGAATAGCGA TAGTGGAACG CTATCGAAAG AAGGCGCAAA TTATTTTGCC
CGCATTCAGC AATCAGCAAA TAAAATGAAA ACACTGATCA ACGATCTGTT AACATACTCA
AGAACCAGCG CAACTGAAAA AGTATTTGAA AAGACCAATC TTAATCTGTT GCTGCATGAA
ATTAAAACGG AATTTACAGA AGTATTGAAA GAGAAAAACG GCTCCCTGGA AATTTCAAAC
CTTCCCGAAA TTAATGCAAT TCCATTTCAG TTAAGACAAT TATTTATAAA CTTAATTTCA
AACGCCATTA AATTTTCCAG GCCATCTGTC CCTCCGGTTA TAAAAATCAG TTCTGCTATA
CTGCTGGGCA GGGACACAGA TAATGCCAAT GCGCACGAAA ATGAATTGTA TCATCAGATC
ATAGTAAGCG ATAACGGTAT CGGTTTTGAC CCTTCCTATA AGGATAAAAT TTTCGAGGTA
TTTCAACGCC TGCATCCCAA AACCGAATAC GAAGGTACAG GTATCGGGCT TTCTATTTGC
ACTAAAATTG CCCAAAACCA CAATGGCTTT ATCAGCGCTT CGGGAGAATT AAATAAAGGT
GCTGCGTTTA CAATCTACTT ACCGTTATTG AAGTAA
 
Protein sequence
MYNLQLDFEQ ARTRHLLFKT KLRSVLYGSE IDTKSVISYA DCAVSKWIYE HALKAYGHIP 
EMHQLKLVHI NLHESAQELV RLYKDGKIDK ARRGLSNLEL IADHFAALLT VVELKLQLSN
YPAAAIRDNE AVLNNTHKEL LELHTSLHEL DARIRKQTDE LVSTQRGVEN KLRNHFSHAP
VAICILRGAE FVLELANDMY LQHIDKTADI VGKRLFEELP ELEKQGVREM LNSVLDTGNP
FVGTDVEIRM KRNGINETLY FNFVYKPLDD HETIPGIMIV CSEVTDQVIA KKAMIENQQR
LNIAIEATAL GTYEVNLKTA ELVYSDRYLE IFGYAPHEKP GHKELVKCIH EDDVHIREQA
FAIAFETSKL FYEVRIILRD QSVRWLRAFG KVFFDDKGTP EKMLGTVMDI TKEKAAEEKI
RKANEVLEIA LQSAKLGTYE LNIKSGKANF SLLCKEIFGF EPDKDVTLED VRSSTHPDDK
EITRSYMLEA LANKQNYDAE YRIITADKSI RWISISGKGV YDAEGNVKKL IGVIENITDR
KHAEDELNIS VQKFRLLADS MPQFVWTGDT AGNLNYFNQS VFDYTGLTPA QIYLEGWLQI
VHPDDREENI EKWMQAIQTG NDFIFEHRFK KNTGEYRWQL SRAIAQRNKT GEIQMWVGTS
TDIQDQKTSA QKLEELIKER TKELKNANIE LESMNQELRS FTYISSHDLQ EPLRKIQTFI
SRIQNSDSGT LSKEGANYFA RIQQSANKMK TLINDLLTYS RTSATEKVFE KTNLNLLLHE
IKTEFTEVLK EKNGSLEISN LPEINAIPFQ LRQLFINLIS NAIKFSRPSV PPVIKISSAI
LLGRDTDNAN AHENELYHQI IVSDNGIGFD PSYKDKIFEV FQRLHPKTEY EGTGIGLSIC
TKIAQNHNGF ISASGELNKG AAFTIYLPLL K