Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1793 |
Symbol | |
ID | 3908874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2052451 |
End bp | 2053926 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883687 |
Product | DEAD/DEAH box helicase |
Protein accession | YP_485412 |
Protein GI | 86748916 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.353288 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAGAA CCAATCTTTT GACCTCGTTT CAGGATTTCG GCCTCGCCGA TCCCATCTCC CGTGCGCTTC AGGAAGAAAA TTACACCGTC CCGACGCCGA TCCAGGCGCA GACCATTCCC CTCGCCCTCG CCGGTCGCGA CGTCGTCGGC ATCGCCCAGA CCGGCACCGG CAAGACCGCG TCCTTCGCGC TGCCGATCCT GCACCGCATC CTTGAAAACC GCATCAAGCC GCAGCCGAAG ACCTGCCGCG TCCTGGTGCT GAGCCCGACC CGCGAACTCT CCGGGCAGAT CCTCGACAGC TTCAACGCCT ATGGCCGCCA CATCCGCCTC AGCGCGACGC TGGCGATCGG CGGCGTGCCG ATGGGCCGCC AGGTCCGCTC GCTGATGCAG GGCGTGGAAG TGCTGGTCGC CACCCCGGGC CGCCTGCTCG ACCTCGTGCA GAGCAACGCG CTGCGCCTCG GCCAGGTCGA GTTCCTGGTG CTCGACGAAG CCGACCGCAT GCTCGACATG GGCTTCATCA ACGACATCCG GAAGATCGTC GCGAAACTCC CGATCAAGCG CCAGACGCTG TTCTTCTCGG CCACCATGCC GAAGGACATC GCCGACCTCG CCGAGCAGAT GCTGCGCGAT CCGGCCCGCG TCGCGGTGAC GCCGGTGGCC TCCACCGTCG AGCGCATCAA CCAGCGCGTG ATCCATCTCG ATCACTCCGC CAAGCCGGCG ATGCTCGCCA CCATCCTGCA GCAGGACGGC GTCAACCAGG CGCTGGTGTT CACGCGCACC AAGCACGGCG CCGACAAGGT GGTGAAGGGC CTGCAGCGCG CCGGCATCAC CGCCGACGCC ATCCACGGCA ACAAATCGCA GAACTATCGC GAGCGCGTGC TCGCGGCGTT CCGTACCGGC GAACTGCGCA CGCTGGTCGC CACCGATATC GCCGCCCGCG GCATCGATGT CGACGGCGTC TCCCACGTCG TCAATTTCGA CCTGCCGAAC ATTCCCGAGA CCTACGTCCA CCGGATCGGC CGCACCGCGC GCGCCGGCGC CGAGGGCACC GCGATCTCGC TGTGCGCCGG CGGCGAAGAG ACCGGCTATC TGCGCGACAT CGAGAAGCTG ATCCGGATCG CGCTGCCGAA GGAAGACCAT CGCACCCCGG GCGCGAAGCC CGCGCCGTCC ACCGCGCCGC AGCGTGGCGG CCAGCGCAAC GGCCAGCAGC GCAATGGCCA GCAGCGCAGC GGCGGCCAGC GCAACGGCGC CGCACCGCAC GCGGCCCGCG GCGACGCTGC TCCCGGATCG AACGACCCGC GCCGCCCGCG ACGCGCCGGC GGCCCGAATG CCGGACGCAA CCCGGACGCG GCCCGGCACG AACCGGCGGT ACGGCCCCAG CACGGCGGAC AGGCCGAAGG CCTGCAGGGC GTCGCATTTT TGCAGCGCAA GAATGATCGC CCCGCAAAGA CCACCAATCA ACGCCCGCAA CGCTGA
|
Protein sequence | MERTNLLTSF QDFGLADPIS RALQEENYTV PTPIQAQTIP LALAGRDVVG IAQTGTGKTA SFALPILHRI LENRIKPQPK TCRVLVLSPT RELSGQILDS FNAYGRHIRL SATLAIGGVP MGRQVRSLMQ GVEVLVATPG RLLDLVQSNA LRLGQVEFLV LDEADRMLDM GFINDIRKIV AKLPIKRQTL FFSATMPKDI ADLAEQMLRD PARVAVTPVA STVERINQRV IHLDHSAKPA MLATILQQDG VNQALVFTRT KHGADKVVKG LQRAGITADA IHGNKSQNYR ERVLAAFRTG ELRTLVATDI AARGIDVDGV SHVVNFDLPN IPETYVHRIG RTARAGAEGT AISLCAGGEE TGYLRDIEKL IRIALPKEDH RTPGAKPAPS TAPQRGGQRN GQQRNGQQRS GGQRNGAAPH AARGDAAPGS NDPRRPRRAG GPNAGRNPDA ARHEPAVRPQ HGGQAEGLQG VAFLQRKNDR PAKTTNQRPQ R
|
| |