Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bpro_3898 |
Symbol | |
ID | 4013422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas sp. JS666 |
Kingdom | Bacteria |
Replicon accession | NC_007948 |
Strand | + |
Start bp | 4091078 |
End bp | 4094089 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637943549 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_550692 |
Protein GI | 91789740 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.504038 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0804481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGACA AAAACGTAGC GGACACCAAC TCCAACGCCG TCAACGTGCC GGGCGCCGCC GCCCCTGTCG GCGCGCCTCC CAGGGACCTG TTTACCGGCG GCGGCGAAAT GGGTGCGATC ATGCGCGCGA CGGACTGGTC CAAGACGAAG CTCGGCCCGA TTGAAGCCTG GCCGAACAGT CTCAGAACCA TGCTGGGCGT CGTACTCGGC AGCCGGTTTC CCATGCTGCT CTGGTGGGGC CCCGATCTCC TGCACCTCTA CAACGACGCA TACCGCCCCA TTCTGAGGGA CAAGCACCCC GCGTCGCTCG CCGCTCCAGC CGCGGAGATG TGGGCCGAGG TCTGGGACGT TGCCGGGCCC ATGGCAAGGA GCGTCCAGGA AGGAGGCCCG GCGACATGGA CGGAGGACCT TCAGCTGTTC ATTAACAGCG GGGCCATGGT CGAGGAAACG TACTTCACGT TTTCGTACAG CCCCGTCCCA GGCGATGACG GGCGCGTCGG CGGCCTGCTC AACACGGTGC AGGAAACGAC AGTGAAGGTG CAGGGCGAAC GCCAGATCCG GATGCTGCAC GAATTGGCCG CGCGGGCGGC CGAAGCAAAA TCCGAGAACG AAGCGTACCG GATTGCCGCG GAGGTCCTTT CGGCCAACGA ACTGGATCTC CCGTTTGTGC TGCTCTATGT TCTGAACGAG ACGGCTTCCG ACGCTGAACT CGTCGCCGTG AGCGGCTGGA AGGAGTATGA AGGCCGCGCC AGGCCCGCGC ATGTGCCGGT CAAGGAGGGT GCAAGTACAG CTTCCTGGCC CTTTGCTGAA GTCATTCGAA CCGCTCAGGA ACTCGTCGTC GACGACCTCT CGTCGCGCTT CGGGCCGCTG CCGGCGGGTC GCTGGAACGC ACGGCCCGAA CGAGCTATCG TCCTGCCGCT TTCGCGGACA GGCCAGTCCG AGCCTTATGG GTTTCTCGTT GCCGGCATCA GCCCGCACCG TGCATTCGAA GACCGGTATC GAAGGTTCTT TCGGGCTACC GCGGATCAGG TGGCGACCGT CATCGCCAAT GCCCGTGCCT ACGAGGCAGA AAAGACGCGG GCCGAGGCCC TGACCGAGAC CGATCGGGCC AAGACCGCCT TCTTCAGCAA CGTCAGCCAC GAGTTCAGGA CCCCCCTCAC GCTCCTGCTC GGACCGCTGG AGACGCTGCT TGCCGAGCGC GAGCTTGGTG CCGAGGCGCG GGATCGGCTG CTCCAGATGC AGCGCAACGC GTTGCGCCTG CTGCGCCTGG TGAATGCGCT GCTCGATTTC TCGCGCATGG AGGCCGGGCG GCACTCCGCG CGGTTCGCGC CGACCGACCT CGCACGCTAC ACCGCGGATC TGGCGAGCGC GTTTCGCTCG GCCATGGAAA AAGCCGGGCT CTCCTTTGCG GTGGAATGCC CGCCGTTGCC GGCGCCGATC TACGTCGACC GCGACATGTG GGAGAAGATC GTCATGAATC TCCTCTCCAA TGCGCTCAAG TTCACTTTCG AGGGCGGGGT GAGCCTGCGC CTTGTTGCGG CGGATGGCGG CGCGCACCTG ACGGTGAAGG ACACGGGCTC GGGCATCCCC AGGAGCGAAC TGCCGAAACT CTTTCAGCGG TTCCACCGGG TGGAGGGCGC GCGCGCTCGC TCGCACGAAG GCACGGGCAT CGGGCTTGCG CTGGTCCATG AGCTCGTCAA GCTGCACGGC GGCGAGATCC GCGTCGCGAG CGACGAAGGC CAGGGTGCGG AATTCACGAT CACGCTGCGC GGCGGGCGCA ACCACCTGCC GCCGGAGCAT GTCGTAGCAG AGTCCCCGGA GGCCGCATCG GTGGCGAGAA GCGCGTCGGC CTATCTCGAC GAGGCGCTGC AATGGCTGCC TTATGAGTCG CAGCCGATCG CCTCCGCTCC GGCAAGGAGC GGTGCTCGCG TGCTCGTCGC CGACGACAAC CGTGACCTGC GGACCTTCCT CTCGAGCCTA CTTGCGCCGC ATTACGACGT GCAGGTGGTG GCCGACGGCC GCGAAGCGCT CGCCGCCATT CAGCAGAGAA AACCGGACCT CGTGCTGAGC GACGTGATGA TGCCGAACCT CGACGGCCTG GGCCTGGTGC GGGCGCTGCG CGAGGATCCC GAAACGCGCA CGCTGCCGGT GATTTTGCTG TCGGCGCGCG CGGGTCAGGA AGCCTCGCTC GAAGGCCTTT CCGCCGGCGC GGACGACTAT CTCGCCAAGC CGTTCACGTC CCAGGAACTG CTCGCGCGGG TGCGCACGCA CCTCACCATG GCACGAGCGC GCGATGAGCT GAACACGGAG CTCATGCACG CCAACGAGGA ACTGGAAGCG TTCAGCTACT CCGTGTCCCA TGACTTGCGG GCGCCGCTGC GTGCAGTCAA CGGATACACG CATCTGCTCG AGGAGGGCTA CGCGACGCAG CTCGACGACG AAGGACGGCG GTTTCTTCGC GTGGTGCGGG AGGAAGCCGG CCGGATGGGG CAGCTGATCG ACGACCTGCT CAATCTCTCG CGGATCGCGA GAGCACAGCT CAGCCGGGAC CCGGTCGATC TCTCCGCGCT TGCCCTGGCC GCCGGCGAAG AGCTGCAGCG CAAGGAACCC GAACGGCGGG TCAGCCTGCT GATCCAGGAA GGCCTCGTCG CCGGGGTGGA TCGCCAGCTA TTGCGCATTC TGTTTGACAA TCTTCTCGGC AACGCGTGGA AATTCACGGT CAAGACCGGT GAACCGAGGA TCGCGTTCGG AACGGAACAG CGCAATGGCA GCGAGGTATT TGTGGTGCGC GACAACGGAG CCGGCTTTGA CATGGCTTAC GCCAGCAAGC TGTTTCGGCC CTTTCAGCGA CTCCACACCG AGAGCGAATT CCCTGGAACG GGGATCGGGC TCGCAACCGT TCGCCGGATC GTCGAACGCC ACGGCGGCCG CGTGTGGGCA GAGGGCGCGC CAGGGCGCGG CGCGGCCGTC TTCTTCACGC TCCCGCCCGC GGCAATGGAA GGCCGGAGGT GA
|
Protein sequence | MTDKNVADTN SNAVNVPGAA APVGAPPRDL FTGGGEMGAI MRATDWSKTK LGPIEAWPNS LRTMLGVVLG SRFPMLLWWG PDLLHLYNDA YRPILRDKHP ASLAAPAAEM WAEVWDVAGP MARSVQEGGP ATWTEDLQLF INSGAMVEET YFTFSYSPVP GDDGRVGGLL NTVQETTVKV QGERQIRMLH ELAARAAEAK SENEAYRIAA EVLSANELDL PFVLLYVLNE TASDAELVAV SGWKEYEGRA RPAHVPVKEG ASTASWPFAE VIRTAQELVV DDLSSRFGPL PAGRWNARPE RAIVLPLSRT GQSEPYGFLV AGISPHRAFE DRYRRFFRAT ADQVATVIAN ARAYEAEKTR AEALTETDRA KTAFFSNVSH EFRTPLTLLL GPLETLLAER ELGAEARDRL LQMQRNALRL LRLVNALLDF SRMEAGRHSA RFAPTDLARY TADLASAFRS AMEKAGLSFA VECPPLPAPI YVDRDMWEKI VMNLLSNALK FTFEGGVSLR LVAADGGAHL TVKDTGSGIP RSELPKLFQR FHRVEGARAR SHEGTGIGLA LVHELVKLHG GEIRVASDEG QGAEFTITLR GGRNHLPPEH VVAESPEAAS VARSASAYLD EALQWLPYES QPIASAPARS GARVLVADDN RDLRTFLSSL LAPHYDVQVV ADGREALAAI QQRKPDLVLS DVMMPNLDGL GLVRALREDP ETRTLPVILL SARAGQEASL EGLSAGADDY LAKPFTSQEL LARVRTHLTM ARARDELNTE LMHANEELEA FSYSVSHDLR APLRAVNGYT HLLEEGYATQ LDDEGRRFLR VVREEAGRMG QLIDDLLNLS RIARAQLSRD PVDLSALALA AGEELQRKEP ERRVSLLIQE GLVAGVDRQL LRILFDNLLG NAWKFTVKTG EPRIAFGTEQ RNGSEVFVVR DNGAGFDMAY ASKLFRPFQR LHTESEFPGT GIGLATVRRI VERHGGRVWA EGAPGRGAAV FFTLPPAAME GRR
|
| |