Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0928 |
Symbol | |
ID | 3909782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1071097 |
End bp | 1072407 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882821 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_484549 |
Protein GI | 86748053 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCGG ACGAGTTCAT CTCACGACGG CACTTCATGG GCCTCGTTTC GGGGGCGATC GCCGCCGCCG GCGTGGCCGG TTCGGCGGCG GCGCAGCCGA TCGCCACCAA GGCGCGCATC GTGATCATCG GTGCGGGCGC GGCCGGCACC GCGATCGCCA ACCGGCTGGC GATGCGGCTG GAGCAGGCCA GCATCTTGAT CATCGACGGC CGGCGCGACC ACATCTACCA GCCCGGGCTG TCATTGGTGG CGGCGGGACT GCGGCCCGCC TCCTATGTGG TGTCCCGCAC CAGCGACTGG CTGCAACCCG GCGTCAAGCT GATCGAAGAG CCGGCGGTGG CGATCGATCC GGTGGCCAAG ACGGTCGCGA CCGCCGCGCG CAACACCGTG CCGTACGATT ATCTGATCGT CGCACCGGGG CTGGTGCTGG ATCACGAGGC GATCGAGGGC TTCTCGCTCG ATCTGGTCGG CAGTAACGGC GTCGGCGCGT TGTATGCCGG CCCGGACTAT GCGGCGCGGA CCTGGGCGGC GGCGTCTCGC TTCACCGAGA CCGGAGGTGT GGGACTGTTC ACCCGGCCGG CCACCGAAAT GAAATGCGCC GGCGCGCCGC TCAAGCACAC CTTCCTGATC GACGACATCG CCAGCCGCAA GGTCGGCGCC GGCCGCTACA AGATCACCTA TGCGGCGCAT GCGGACTCGC TATTCAGCGT GCCGATCGTC TCCGAAAAGG TCCGGATGCT GTTCGAAGAG CGCGGCATCA ACGCCGTCTA CAGCCGCGTC TTGAAGGCGA TCGATCCGGG CCGCAGGATC GCCACCTTCC AGACGCCGAA GGGGACAGAA GAGCTCGGCT ACGACTACCT CCACGTGATT CCGCCGCAGC GCGCCCCGGC GTTGATCCGG CAATCAGACC TGTCGTGGGC CGACAAATGG ACCGACCAGG GCTGGGTCGA GGTCGATCAA TACACGCTGC GCCACCGCCG CTATCCGGAT GTCTTCGCGC TCGGCGACGT CGCCGGCGTT CCGAAGGGCA AGACGGCGGC GTCGGTCAAA TGGCAGGTCC CGGTCGTCGA GGATCATCTG ATCGCGGCGA TCAAGGGCAA GGAGGGGACC GAGCGCTTCA ACGGCTACAC CTCCTGCCCG CTGATGACGC GGGTCGGCCG CGCGATGCTG ATCGAGTTCG ACTATCGCAA CAATCTGGCG CCGTCGTTCC CCGGCCTGAT CTCACCGCTC GAAGAGCTGT GGATCAGCTG GCTGATGAAG GAGGTCGCGT TGCGCGCCAC CTACTATGCC ATGCTGCGCG GCAAAGCCTG A
|
Protein sequence | MASDEFISRR HFMGLVSGAI AAAGVAGSAA AQPIATKARI VIIGAGAAGT AIANRLAMRL EQASILIIDG RRDHIYQPGL SLVAAGLRPA SYVVSRTSDW LQPGVKLIEE PAVAIDPVAK TVATAARNTV PYDYLIVAPG LVLDHEAIEG FSLDLVGSNG VGALYAGPDY AARTWAAASR FTETGGVGLF TRPATEMKCA GAPLKHTFLI DDIASRKVGA GRYKITYAAH ADSLFSVPIV SEKVRMLFEE RGINAVYSRV LKAIDPGRRI ATFQTPKGTE ELGYDYLHVI PPQRAPALIR QSDLSWADKW TDQGWVEVDQ YTLRHRRYPD VFALGDVAGV PKGKTAASVK WQVPVVEDHL IAAIKGKEGT ERFNGYTSCP LMTRVGRAML IEFDYRNNLA PSFPGLISPL EELWISWLMK EVALRATYYA MLRGKA
|
| |