Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3606 |
Symbol | |
ID | 3971633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4009994 |
End bp | 4011937 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637926714 |
Product | sigma-54 dependent trancsriptional regulator |
Protein accession | YP_533461 |
Protein GI | 90425091 |
COG category | [K] Transcription [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3284] Transcriptional activator of acetoin/glycerol metabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.833415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.824204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACAAC GCACGATCCG CATCGCGTGG GAGAAGTTTG TCGCCCATGG CCGGCTGTCC ACGGGCGTTC CTGCGGCTGT GCTGGCGTCA TGGGCGCGAT CCAAGAGCTT TGGCGTCGGC GCCGGCGTCG CCGCGGCGCC ACTCGCCGGC GAGCCCGAAT TGTTCCGGCG CCGGACCACC AGTTCGGACC TGCTCGGCGC GGCCCGCCCA GCGCTGGAGC GATCCGCCCG TCTTCTCGCC GACGCTGGAT CGATGATGAT CTTGACTGAC GCCAGCGGCT TTATCGTCGA GACCGGGGGG GATCCGCGGG TGGTCGATGC CGGACGCAGC AACCACCTGG AGCCGGGCGG TCGCTGGGAG GAGAATGCGA TCGGTACCAA TGCGATCGGC ACCGCGCTCG CCGATGGGCG CCCGACGCTG ATCCGTGGAT CGGAACATTT CTGCGAAGAC GTTCAGCGTT GGACCTGCGC CGCGGTGCCG GTGCGACATC CGATCGGCGG CGATCTGCTC GGTGTCGTCG ATATCTCCGG TCCGGCGGGG ACGTTCAGCG CCCAGGATTT CGCGCTGGCG GCGGCGATCG GCCAGGAAAT CGAAGCTTCC TTGGGCAGCG CCGCCCGGCA GGAGCACGAG GCACTGCTCA AGCGCTTCTT GTCGAAACGC TCGATCTGGC TGAGCGAGGA GATCCTGGTC ATCGACCGGC GCGGCGTGCT GGTGCATGCG ACCGACCCGG CGACGAACAA ACTCGATGCG GCGAACCCGA ATACGCTCGC GCAGGATATT CGCCAGATGA TCGCGGGGGC TTCGCAGGAC GCCTGGCAGG AGAATTGCCG GCGGCGCTTT CCGAATGCCC GCCTTGAAAT CGTCAGCAAT GCCGACACCG CCGTTGGTTG CTTGATCGTC ATGCACCGCA GCCGCCGCGC GGCGCCGGCG CCGGCGGCCG GCAAACCGAG CCAAGAATCC AATGTGGATT TCGACCAGAT TATTGGCGAC AGCGTCGCCA TGCGGGACGC GCGCAGCCGG GCCCGCAAGC TGGCGATGAA CGCCTTGCCG ATCCTGGTCG AAGGCGAAAC TGGGGTCGGC AAGGAGCTGT TCGCTCGCGC CATCAAGGGG GCAGGGCCGG GGGCCGACGG GCCCTTCGTG CCGCTGAATT GTGGCGGCAT GCCGCGCGAC CTGATCGCGA GCGAGTTGTT CGGATACGTC AAGGGCGCCT TCACGGGGGC GGACGAGGGC GGCCGGGCCG GCAAGATCGA GCGTGCTGAT GGCGGCGTGC TTTGCCTTGA TGAAATCGGC GAGATGCCGC TCGATCTGCA ATCCTATCTG CTTCGGGTGC TGGAAGACGG CATCGTCTAT CGCGTCGGCG ATCATGTCGG CCGCCGGGTC AACGTCCGGA TCATTTCGAT GACGAACCGC GATCTCACCG CCGAGATCGA GGCCGGCCGA TTCCGACGCG ATCTCTACTA TCGCATCGCG GCGGCGCGCA TTCTGGTGCC GCCGCTGCGC GAGCGCGGCG ACGACGTCAT CGCTTTGGCG CAGCGCTTCG CGGCGGCCGC CGCCTTGCGG TTGCGGCAGC CCGCGCCAAG CTTTTCGCCG GAGCTACTCG CCCGTCTGCA AGCCCATGAT TGGCCCGGCA ACGTCCGCGA ATTGCGCAAT GTCATCGACG CCATGGTCGC GCTCGCGGAA TCCGAGCGGC TCGACCTCGC AGACTGGCCG GCGGAGTTTC CACGCGATAC GCGGCTTCGC GGGCAGGCGG CGGCTGCAAT GAAGCCGACA CAGCCGCCGC GCCCCAGCGA AAGCCTGCAA TCGGCCGAGC GCGCGGCGAT CCTGGCCCAG GTAGCGGCCT GCAACGGCAA TCTGACCCAA GCGGCCAAGC GCTTGGGCAT CGCGCGCTCC ACGCTCTATC TGCGGCTGAC TCAGTATCGC CGCGATGAGT CCTCGACCGA CTGA
|
Protein sequence | MEQRTIRIAW EKFVAHGRLS TGVPAAVLAS WARSKSFGVG AGVAAAPLAG EPELFRRRTT SSDLLGAARP ALERSARLLA DAGSMMILTD ASGFIVETGG DPRVVDAGRS NHLEPGGRWE ENAIGTNAIG TALADGRPTL IRGSEHFCED VQRWTCAAVP VRHPIGGDLL GVVDISGPAG TFSAQDFALA AAIGQEIEAS LGSAARQEHE ALLKRFLSKR SIWLSEEILV IDRRGVLVHA TDPATNKLDA ANPNTLAQDI RQMIAGASQD AWQENCRRRF PNARLEIVSN ADTAVGCLIV MHRSRRAAPA PAAGKPSQES NVDFDQIIGD SVAMRDARSR ARKLAMNALP ILVEGETGVG KELFARAIKG AGPGADGPFV PLNCGGMPRD LIASELFGYV KGAFTGADEG GRAGKIERAD GGVLCLDEIG EMPLDLQSYL LRVLEDGIVY RVGDHVGRRV NVRIISMTNR DLTAEIEAGR FRRDLYYRIA AARILVPPLR ERGDDVIALA QRFAAAAALR LRQPAPSFSP ELLARLQAHD WPGNVRELRN VIDAMVALAE SERLDLADWP AEFPRDTRLR GQAAAAMKPT QPPRPSESLQ SAERAAILAQ VAACNGNLTQ AAKRLGIARS TLYLRLTQYR RDESSTD
|
| |