Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3686 |
Symbol | |
ID | 6411362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3938638 |
End bp | 3942840 |
Gene Length | 4203 bp |
Protein Length | 1400 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642713566 |
Product | DNA-directed RNA polymerase subunit beta' |
Protein accession | YP_001992661 |
Protein GI | 192292056 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.152394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCAGG AAATTATGAA TCTGTTCAAT CCGACGACGC CGGCTCAGGT CTTCGACCAG ATCCGGATCT CGATCGCGTC GCCGGAGAAG ATTCTGTCGT GGTCGTACGG CGAGATCAAG AAGCCGGAAA CGATCAACTA CCGCACCTTC AAGCCCGAGC GCGACGGCCT GTTCTGCGCC CGCATCTTCG GGCCGATCAA GGACTACGAG TGCTTGTGCG GCAAGTACAA GCGGATGAAG TACAAGGGCA TCATCTGCGA GAAGTGCTCG GTCGAAGTGA CTCTGTCGCG CGTCCGTCGC GAGCGCATGG GCCACATCGA GCTGGCCGCG CCGGTCGCCC ACATCTGGTT CCTGAAGTCC TTGCCGTCGC GTATCGGGCA GCTGCTCGAT ATGACGCTGA AGGACCTCGA GCGCATCCTG TACTTCGAAT ACTACGTGGT GCTGGAGCCG GGCCTCACCG ACCTCAAGGA GCGTCAGCTC CTGTCGGAGG AGGAGTACCT GCGCGCCCAG GATCAGTACG GCCAGGATTC GTTCACCGCC ATGATCGGCG CCGAAGCGAT CCGTGAGCTG CTGAAGGGGC TCGAACTCGA AAAGATCGAC GCGCAGCTGC GCGCCGAGAT GGCCGAGACC GACTCTGACA TCAAGCACAA GAAGCTCGCC AAGCGCCTGA AGATCGTCGA AGCGTTCCGC TATTCCGGCA ACAAGCCGGA GTGGATGATC CTCACGGTCG TGCCGGTGAT CCCGCCGGAT CTGCGCCCGC TGGTGCCGCT CGACGGCGGC CGGTTCGCGA CCTCGGACCT CAACGACCTG TATCGCCGCG TCATCAACCG TAACAACCGC TTGAAGCGGC TGATGGAGCT GCGCGCGCCG GACATCATCA TCCGCAACGA AAAGCGCATG CTGCAGGAGG CTGTGGACGC GCTGTTCGAC AACGGTCGCC GCGGCCGCGT CATCACCGGC GCCAACAAGC GTCCGCTGAA GTCGCTCGCC GATATGCTGA AGGGCAAGCA GGGCCGGTTC CGTCAGAACC TGCTCGGCAA GCGCGTCGAC TATTCGGGCC GTTCGGTGAT CGTGGTCGGT CCCGAGCTCA AGCTGCATCA GTGCGGCCTG CCGAAGAAGA TGGCGCTCGA ACTGTTCAAG CCGTTCATCT ATTCGCGGCT CGACGCCAAG GGTCTGTCGA CCACCGTCAA GCAGGCCAAG AAGCTGGTCG AGAAGGAGCG GCCGGAGGTT TGGGACATCC TCGACGAGGT GATCCGCGAA CATCCGGTGC TGCTAAACCG CGCCCCGACG CTGCATCGTC TCGGCATTCA GGCGTTCGAG CCGGTGCTGA TCGAAGGCAA GGCGATCCAG CTGCATCCGC TGGTGTGCTC GGCGTTCAAC GCCGACTTCG ACGGCGACCA GATGGCCGTG CACGTTCCGC TGTCGCTCGA AGCGCAGCTG GAAGCGCGCG TCCTGATGAT GTCGACCAAC AACATCCTGC ATCCGGCGAA CGGCCAGCCG ATCATCGTGC CGTCGCAGGA CATCGTGCTC GGCCTGTACT ACCTGTCGAT CATGCGGGAA GGCCTGCCGG GCGAGGGCAA AGTGTTTGCC GACCTCGCCG AGCTCGAGCA CGCGCTGTAC TCCAAGGTCA TCCACCTCCA CACCAAGATC AAGTATCGCT GGCATTGGGT GAACGAGGAA GGCGAGAACA CCGTCCGTCT GCTGGAGACC ACCGCCGGCC GCATCCTGCT TGGGCAGGTG CTGCCGAAGT CGCCGAAGCT GCCGTTCGAC GTCATCAACA AGCTGATGAC CAAGCGCGAG ATCTCCGGCG TCATCGACCA AGTCTATCGC CACTGCGGTC AGAAGGAGAC GGTGATCTTC TGCGACCGGA TCATGGCGCT CGGCTTCTTC AACGCGTTCA AGGCCGGCAT CTCGTTCGGT AAGGACGACA TGGTCGTGCC GGGCTCGAAG TGGAAGATCG TCGACTCGAC CCGTACGCTG GCGAAGGACT TCGAGCAGCA GTACAACGAC GGCCTCATCA CCCACGGCGA GAAGTACAAC AAGGTGGTCG ACGCCTGGTC GAAGGCCACC GAAGAAATCG CCAAGGAGAT GATGAAGGAG ATCTCCGCGG TTCGGAAGGC GCCTGACGGC TCCGAACAGC AGGTCAACTC GATCTACATG ATGGCCCACT CCGGTGCGCG TGGTTCGCCC GCGCAGATGC GTCAGCTCGC CGGTATGCGC GGCCTGATGG CCAAGCCGTC GGGTGAAATC ATCGAGACGC CGATCATTTC CAACTTCAAG GAAGGTCTGT CGGTTCTCGA GTACTTCAAC TCGACCCACG GCGCCCGTAA GGGTCTGGCC GACACCGCGC TCAAGACCGC GAACTCGGGT TACCTGACCC GTCGTCTGGT CGACGTGGCG CAGGACTGCA TCATCACGCA GGCTGACTGC GGCACCTCGC TCGGCATCAA GATGCGGGCG ATCGTCGACG CCGGCACCGT GGTCGCCTCG CTGGGCAGCC GTATTCTCGG CCGCACCGCG GGCGAGGACG TGCGCGACCC GGCCACCAAC GAGATCATCG TCAAGCGCGG TGATCTGATG GAGGAGCGGG ACGTCGAGGC GATCCACCAG GCCGGCGTGC AGGAAGTGAA GATCCGCTCG GCGCTGACCT GCGAGCTGGT CAACGGCATC TGCGGCAAGT GCTACGGGCG CGATCTTGCC CGCGGTACTC CGGTCAACCA CGGCGAAGCG GTCGGCGTCA TCGCGGCGCA GTCGATCGGT GAGCCGGGCA CCCAGCTGAC GATGCGTACC TTCCACATCG GCGGTGCGGC GCAGATCAAC GAGCAGTCGG TGATCGAGTC GAACTTCGAC GGCAAGATCG TCATCAAGAA CCGCGCCATC GCCCGTAACG GCGAAGGCCA CAATGTTGCG ATGGTCCGCA ACATGGTGAT CGCGATCGTC GATCCGGACG GCACCGAGCG TGCGACCCAT CGCATCCAGT ACGGCGCGCG CGTGCACGTC GACGAGGGCG ATATGGTCAA GCGTGGCCAG CGTATCGCCG AGTGGGATCC GTACACTCGT CCGATCCTCA CCGAGGTCGA GGGTACCATC GACTTCGAAG ATCTGATCGA GGATCAGTCG ATCTCCGAAA CGCTCGACGA GTCGACCGGT ATTGCCAAGC GTATCGTCAT CGATTGGCGC TCGACCCGCG GCGGCGCGGA CCTGCGTCCG GCGATCGTGA TCAAGGGCAA GGATGGCAAG GTGCTGAAGC TGGCGCGTGG CGGCGACGCC CGCTACATGC TCTCGGTCGA CGCCATTCTG TCGGTCGACG TCGGCGCCCA GGTCAAGCCC GGCGACATCC TCGCGCGTAT CTCGACCGAG AGCGCCAAGA CCCGCGACAT CACCGGCGGT CTGCCGCGGG TGGCGGAGCT GTTCGAGGCG CGGCGGCCGA AGGATGCGGC GATCATCGCC GAGATCGCCG GCACTATCCG GTTCGGTCGC GACTACAAGA ACAAGCGCCG GCTCTCGATC GAGCCGCTCG ACAAGAACGA GGAAGCGCGC GAGTACCTGA TCCCGAAGGG CAAGCACATC CACTTGCAGG ACGGCGACGT CGTCGAAAAG GGCGACTTCA TCGTCGAGGG CAATCCGGCG CCGCACGACA TCCTGGCGAT CAAGGGCATC GAGGAACTCG CGGCCTATCT CGTCAACGAA ATCCAGGAGG TCTATCGACT CCAGGGCGTG TTGATCAACG ACAAGCACAT CGAGGTGATC GTTCGCCAGA TGCTGCAGAA GATCGAGATC ACCGACCAGG GCGATACCGA CATGATCTCG GGCGAGCAGG TCGACAAGAT CGAGTTCAAC GCGCTCAACG CCAAGGCGGT CGAGGAGGGC AAGAAGCCGG CAACCGGCAA TCCTGTGCTG CTCGGCATCA CCAAGGCCAG CTTGCAGACC CGCTCGTTCT TCTCGGCGGC GTCGTTCCAG GAGACCACCC GGGTGCTCAC CGAAGCCGCG GTCAACGGCA AGGTGGATCC GCTGGAAGGC CTCAAGGAAA ACGTCATCGT CGGCCGGCTG ATCCCGGCGG GCACCGGCGC CTCGATGGCC AAGATCCGCG AAGTGGCGGT GAAGCGCGAC CGGCTGATCC TCGACGAGCG CGAGAAGCAG GCGGCGATCG TTCCGGCCGC TGCGCCGGAA GCCGAACCGC TGTCGCTGCC GCCGGCAGAG TAA
|
Protein sequence | MNQEIMNLFN PTTPAQVFDQ IRISIASPEK ILSWSYGEIK KPETINYRTF KPERDGLFCA RIFGPIKDYE CLCGKYKRMK YKGIICEKCS VEVTLSRVRR ERMGHIELAA PVAHIWFLKS LPSRIGQLLD MTLKDLERIL YFEYYVVLEP GLTDLKERQL LSEEEYLRAQ DQYGQDSFTA MIGAEAIREL LKGLELEKID AQLRAEMAET DSDIKHKKLA KRLKIVEAFR YSGNKPEWMI LTVVPVIPPD LRPLVPLDGG RFATSDLNDL YRRVINRNNR LKRLMELRAP DIIIRNEKRM LQEAVDALFD NGRRGRVITG ANKRPLKSLA DMLKGKQGRF RQNLLGKRVD YSGRSVIVVG PELKLHQCGL PKKMALELFK PFIYSRLDAK GLSTTVKQAK KLVEKERPEV WDILDEVIRE HPVLLNRAPT LHRLGIQAFE PVLIEGKAIQ LHPLVCSAFN ADFDGDQMAV HVPLSLEAQL EARVLMMSTN NILHPANGQP IIVPSQDIVL GLYYLSIMRE GLPGEGKVFA DLAELEHALY SKVIHLHTKI KYRWHWVNEE GENTVRLLET TAGRILLGQV LPKSPKLPFD VINKLMTKRE ISGVIDQVYR HCGQKETVIF CDRIMALGFF NAFKAGISFG KDDMVVPGSK WKIVDSTRTL AKDFEQQYND GLITHGEKYN KVVDAWSKAT EEIAKEMMKE ISAVRKAPDG SEQQVNSIYM MAHSGARGSP AQMRQLAGMR GLMAKPSGEI IETPIISNFK EGLSVLEYFN STHGARKGLA DTALKTANSG YLTRRLVDVA QDCIITQADC GTSLGIKMRA IVDAGTVVAS LGSRILGRTA GEDVRDPATN EIIVKRGDLM EERDVEAIHQ AGVQEVKIRS ALTCELVNGI CGKCYGRDLA RGTPVNHGEA VGVIAAQSIG EPGTQLTMRT FHIGGAAQIN EQSVIESNFD GKIVIKNRAI ARNGEGHNVA MVRNMVIAIV DPDGTERATH RIQYGARVHV DEGDMVKRGQ RIAEWDPYTR PILTEVEGTI DFEDLIEDQS ISETLDESTG IAKRIVIDWR STRGGADLRP AIVIKGKDGK VLKLARGGDA RYMLSVDAIL SVDVGAQVKP GDILARISTE SAKTRDITGG LPRVAELFEA RRPKDAAIIA EIAGTIRFGR DYKNKRRLSI EPLDKNEEAR EYLIPKGKHI HLQDGDVVEK GDFIVEGNPA PHDILAIKGI EELAAYLVNE IQEVYRLQGV LINDKHIEVI VRQMLQKIEI TDQGDTDMIS GEQVDKIEFN ALNAKAVEEG KKPATGNPVL LGITKASLQT RSFFSAASFQ ETTRVLTEAA VNGKVDPLEG LKENVIVGRL IPAGTGASMA KIREVAVKRD RLILDEREKQ AAIVPAAAPE AEPLSLPPAE
|
| |