Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0101 |
Symbol | |
ID | 3909687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 108459 |
End bp | 109907 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637881982 |
Product | PepSY-associated TM helix |
Protein accession | YP_483724 |
Protein GI | 86747228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCA TCACCAGGCG GCTGAAGCGG TGGCTGTATC TCGGCCACCG CTGGCTCGGG ATCGCGTGCT GTCTGTTGTT CGCGATCTGG TTCATCTCCG GCGTGGTGAT GATGTATGTG GCGTTCCCGC AATTCGACGC TAAGGAGCGC CGGGCAGCGC TGCCGGATCT CGCCTTCAGC GAAATCCGGC TCGCCCCCGA TCAGGCGATG GCGGCGGCCG GGCTGACGAC CTATCCGCGT GAACTCCGTC TTGCGATGCA GGACGGCGAG CCGGTGTACC GACTCGCCGG CACGAACAGA CGGCGGCAGG CGATCTCCGC CGTCGACGGC CGGGCGCTCG GCGACATATC GCCCGAACAG GCGCTGGCGG TGGCGCGGCA TCATCCCGCC GCTGTGGCGC CGGCGCTGCT CGACATCGTC GATCGCGACC AATGGAGCGT CACCGCGCGG TTCGATCCGT TGCGGCCGCT GTTTCTGATC GGGCTCGGCG ACGATGCCGG CACCGAGCTT TACGTCTCGC AGAAGACCGG CGAGATCGTG CTCGACACCA ATCGTCACGA GCGGGTCTGG AACTGGCTCG GCGCGATTCC GCACTGGATC TATCTCACCT TGCTGCGCCA GGACGCGCCG CTGTGGCGAC AGGTGGTGAT GTGGACCTCT GGGATCTGTT TGCTGGTCGC GATCAGCGGG ATCTGGATCG GGCTGCTGCG CGCCGGATTG CGGCGGCGCT ATGCGTCGGG CCGGATCACG CCCTATCGCG GCTGGATGGC GTGGCATCAC CTCACCGGCC TCGTCGCCGG CGTACTGGTG CTGACCTGGA TGGCCTCGGG CTGGCTGTCG GTCAATCCGT TTGAGCTGTT CGCGCGCCGC GGCGACAGTC GCGAGGCGCT GCAGCGCTAT GCCGGCCACG ACGCGCCGAC GATCGCGTCC ACACTCCCCG CACGCGACCG GCCCGGCGTC GTCGAAGCGC GCTTCATCTG GGTCGGCGGC GCGCCGCTGA TGCTGCTGGC GCATCGCGAC GGATCGCAGA GCGTGGCCGA TCCGGCGAGC GGCGCATCGC GGACGCTGTC ACCGGAGCGG ATCTTCGACG CGGCGGCGCG GCTGCTCCCT GACGCCACGA TGACGCTGCG GCAGCGGCTG GAGGAGCCCG ACGCCTATTG GTACTCGCAT CATCATCAGC GCGTGCTGCC GGTGCTGCGC GCCGGCTTCG ACGATGCCGC CGCTACCTGG TATCATCTCG ACCCGTCGAC CGGCGAAATT CTCGGGCGCA GCGATCGCAG CCGCCGGGTC TATCGCTGGC TGTTCAACGC GCTGCACAGC TTCGATTTCC CGGTGCTGAT CGCGCACCGG CCGGCGTGGG ACATCGTGGT GATTGTATTG TCGCTGGCGG GGCTGGTGAT TTCGGTCAGC GGCATCCTGC TCGGCTGGCG GCGGCTGCGC CGCGCGTGA
|
Protein sequence | MRRITRRLKR WLYLGHRWLG IACCLLFAIW FISGVVMMYV AFPQFDAKER RAALPDLAFS EIRLAPDQAM AAAGLTTYPR ELRLAMQDGE PVYRLAGTNR RRQAISAVDG RALGDISPEQ ALAVARHHPA AVAPALLDIV DRDQWSVTAR FDPLRPLFLI GLGDDAGTEL YVSQKTGEIV LDTNRHERVW NWLGAIPHWI YLTLLRQDAP LWRQVVMWTS GICLLVAISG IWIGLLRAGL RRRYASGRIT PYRGWMAWHH LTGLVAGVLV LTWMASGWLS VNPFELFARR GDSREALQRY AGHDAPTIAS TLPARDRPGV VEARFIWVGG APLMLLAHRD GSQSVADPAS GASRTLSPER IFDAAARLLP DATMTLRQRL EEPDAYWYSH HHQRVLPVLR AGFDDAAATW YHLDPSTGEI LGRSDRSRRV YRWLFNALHS FDFPVLIAHR PAWDIVVIVL SLAGLVISVS GILLGWRRLR RA
|
| |