Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3788 |
Symbol | |
ID | 4024304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4227536 |
End bp | 4229854 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963992 |
Product | phytochrome |
Protein accession | YP_570910 |
Protein GI | 91978251 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0317187 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTCGG GCGTTGACAA TTTAGCCGAG TTGAAATTCA GCGCAATCGG GCCGACCGCG TGCGACCGCG AGGCGATTCA TCTCGCCGGC TCGATCCAGC CGCACGGCGC TTTGCTCGCG GTGACGCCAG AGGATCTGCA GATCGTCCAT GCCGGCGGCG AAACTGCAGC GCTGTTGGGC GTTTCCATCG AATCGCTGGC TGGAGTCTCT GCGTTCACGA TTTTCTCGGG CGACCAGATC GCGCGGCTGC GCGCCCTGTT GGGCCCCGAT CGCAAGATTG AACGGCCTCT GCACGCCTTC ACGCTGAAAG CCGCCGATGC GACACCGGTC GACGTCGTGG TTCACCAGGC CTCCGGCCTG CTCGTGCTGG AGTTCGAGCC ACGCCGCGAG CCAGCTCCAA ACAATTCGCT CGCGCTCGTG CAATCGATGA TCCGTCACGT GCAGCGCGCC GGAACCGTGC AGGCGTTTTG CGACGCGGTG GTCGCCGAGG TTCGCGCCGT CACCGGCTTC GATCGGGTGA TGATCTATCG CTTCATGCCG GATTGCAGTG GCGAGGTGAA AGCGGAGTCG CGCGCGTCCG GAATGGAGAG CTTCCTCGGG CTGCGGTATC CGGAGTCCGA CATCCCGAAG CAGGCCCGCG CGCTGTATCT CGCGAACTGG ATCCGCGCGA TCCCCGACGC CCGCTACGCG CCGGCGCGGA TCGTGCCTAT GATCGATCCT CGCACCGGAC TGCCGCTCGA TCTAAGCCAG AGCGTGATCC GCAGCGTATC GCCGGCGCAC CGGCTGTATC TCGCCCATAT GGGCGTGGTG GCTTCAATGT CGCTTTCGAT CATTCAGCAT GGCAGGCTGT GGGGCCTGAT TGCCTGCCAT CACAGCTCGC CGCGCTATCT ACCCTATCGG ATGCGGGAGG CGTGCGAGCT TTTCGCCGAG ATGGCGTCAT CGCAGCTCGA GGCCAAGGTC GCCGCCGAAC AACTCGAGGC ACGGCTGCGC AGCACGCGGA TTCACGAAGA GCTGGTGACG CGGATGAGCC AGGAATCCGA CCTCGCCGAG GGCCTGATCA GGTTTCATCC CAATCTGCTC GACTTCATTC CGGCGACCGG CGTCGGATTG TGGGTCGACG GTCAATTCAC CGGCCTCGGC GTCACGCCCG ATGCCGCGCA AACCGAAGCA CTGATCGGCT GGCTGACGGC CACTGCGAAT GACGGGGTAT TTCAAACCGA CGCCCTGCCG CTGATCTATC CGCCGGCAAA AGCCTTTGCC GATTGCGCCA GCGGCCTGAT GGCGCTGTCG CTGTCGAAAT CGCCACGCGA CTACGTGCTG TGGTTCCGGC CCGAAGTGGT GCGCACCGTC ACCTGGGCGG GCAATCCAAA CAAGCCGGTC GGCGTCGGTC CCGATGGCGG CTTCACGAAC CCGCGCCGCA GCTTCGCCGC ATGGCAGGAA TCGGTGCGGC TGCATTCCGA GCCGTGGCGC GCCTCCGACA TCGAAGCCGC CCACCGGCTG AGGCTGTCGC TGCTGGAAGT GGTGCTGCGG CGGATCGACG GCATCGCCCG CGAGCGCAAA TCGGCGCGGC TGCTGCAGGA GCAACTGATG CGGCAGGTCG AGATCGGCCT GCGCCGGTCG CAGGACGTCG CCAAGACGCT GCGGGAAGAG ACCCGCCGGC GGGTCTCGGT CGAGGCCGAC CTGTCGCAGG TGCTGCGCCG GACGGTCGAG GATCAGGAAG CCGAGCGGCT GCGAATCGCG CGCGAACTGC ATGACACGCT CGGCCAGTCG CTGACGCTGC TGCAGCTCGG CTTCGAGAAT CTCGGGCAGG TCGCACCCGA CAATGGCGAA TTGCAGCGCC GCATCGCCGG CATCAAGACC CTCACGGCCG AGATCGGCCA ACAGGTCAAC CGGCTGGCCT GGGAAATCCG GCCGACCGCG CTCGACGATC TCGGGATCCA GACCGCGGTC CAGCATCTGC TCGACGCATG GAGCGAGAAG TCGCAAGTGC AGTTCGATCT GCACATGACG CTCGGCGACC GCCGCCTGCC GCCCGCGATC GAGACCACTC TTTATCGCGT GCTGCAGGAA GCGCTGACCA ACATCGTCCG CCACGCCGCC GCGAGCCATG TCAGCGTCAT TTTGCGGCTG TCGGATCGGC AGGTGACGAT GGTGGTCGAG GACGACGGCC GTGGCTTCGT CAATCCCGAC GCCGCCCGCC CGCCGGAGCG GCTCGGCCTG CTCGGCATTC GTGAGCGGCT GACGCTGGTC GGCGGCTCGC TCGAAATCGA ATCGGCGCCC GGCAGGGGCA CCGCTTTGTT CGCTCGAATT CCGTTGTAA
|
Protein sequence | MHSGVDNLAE LKFSAIGPTA CDREAIHLAG SIQPHGALLA VTPEDLQIVH AGGETAALLG VSIESLAGVS AFTIFSGDQI ARLRALLGPD RKIERPLHAF TLKAADATPV DVVVHQASGL LVLEFEPRRE PAPNNSLALV QSMIRHVQRA GTVQAFCDAV VAEVRAVTGF DRVMIYRFMP DCSGEVKAES RASGMESFLG LRYPESDIPK QARALYLANW IRAIPDARYA PARIVPMIDP RTGLPLDLSQ SVIRSVSPAH RLYLAHMGVV ASMSLSIIQH GRLWGLIACH HSSPRYLPYR MREACELFAE MASSQLEAKV AAEQLEARLR STRIHEELVT RMSQESDLAE GLIRFHPNLL DFIPATGVGL WVDGQFTGLG VTPDAAQTEA LIGWLTATAN DGVFQTDALP LIYPPAKAFA DCASGLMALS LSKSPRDYVL WFRPEVVRTV TWAGNPNKPV GVGPDGGFTN PRRSFAAWQE SVRLHSEPWR ASDIEAAHRL RLSLLEVVLR RIDGIARERK SARLLQEQLM RQVEIGLRRS QDVAKTLREE TRRRVSVEAD LSQVLRRTVE DQEAERLRIA RELHDTLGQS LTLLQLGFEN LGQVAPDNGE LQRRIAGIKT LTAEIGQQVN RLAWEIRPTA LDDLGIQTAV QHLLDAWSEK SQVQFDLHMT LGDRRLPPAI ETTLYRVLQE ALTNIVRHAA ASHVSVILRL SDRQVTMVVE DDGRGFVNPD AARPPERLGL LGIRERLTLV GGSLEIESAP GRGTALFARI PL
|
| |