Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0215 |
Symbol | |
ID | 3909457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 242500 |
End bp | 243780 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882097 |
Product | hypothetical protein |
Protein accession | YP_483837 |
Protein GI | 86747341 |
COG category | [S] Function unknown |
COG ID | [COG4487] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAGC TGACCATCAT TTGCCCGAAC TGCGCCACCA GCGTCCCGCT GACGGAATCG CTGGCGGCGC CGCTGCTCAA GGATACGCAG AGCAAATACG AGCGGCTGAT CGCGCAGAAG GACAAGGACA TCGCCGGCCG CGAGGCCGCG CTGGAGGCGC AGCGGGCCGA TCTCGACAAC GCGAAAGCCG CCGTCGGCCA GCAGGTCGCC GAGCGGATCG CGGTCGAGCG CACCCGGATC GCCGCCGAGG AAGCCGCCAA AGCGAAGCGG CTCGCGGCCG ACGACCTCGA CGCCAAGGCG CGGCAACTCG CCGAACTCAC CGAGGCGATG CAGCAGAAGG ACGTCAAGCT CGCCGAAGCG CAGCGGGCGC AGGCGGCGTT CCTGCAGAAG CAGCGCCAGC TCGAAGACGA GAAGCGCGAG CTCGACCTCA CCATCGAAAA GCGCGTGCAG GCGTCGCTGG AGAGCGTGCG CAGCAAGGCC AAACAGGATG CCGAAGAGGG GCTGCGGCTC AAGGTCGCCG AGAAGGAAGA AACCATCGCG ACGATGCAGC GGCAGATCGA CAAGCTGAAA TCCGAGCAGG GCTCGCAGCA ATTGCAGGGC GAGGTGATGG AGCTCGAGCT CGAAGCCTCG CTGCGCGCAC GCTTCCCGCA GGATTCGATC GAGCCGGTGC CGAAGGGCGA GTTCGGCGGC GACGTGCTGC ACCGCGTGGT CAACGCCGCC AATCAGCCCT GCGGCACGAT CCTGTGGGAA TCCAAGCGCA CCAAGAACTG GACCGACGGC TGGCTGACCA AGCTGCGCGA CGACCAGCGC AAGGCCAAGG CCGAGCTGGC GCTGATCGTC TCCAACGCGC TGCCGAAGGG CGTGCACAGC TTCGATCACA TCGACGGCGT CTGGGTCGCC GAGGCGCGCT GCGCGATCCC GGTCGCGATC GCGCTGCGAC AGTCGCTGAT CGAGCTCGCC GCCGCGCGCC AGGCCGGCGA AGGCCAGCAG ACCAAGACCG AGCTGGTGTA TCACTATCTC ACCGGGCCGC GGTTCCGGCA GCGGGTCGAG GCGATCGTCG AGAAATTCAC CGAGATGCAG TCCGACCTCG ACAAGGAACG CCGCTCGATG ATGCGGATGT GGGCGAAGCG CGAGGCGCAG ATCCGCGGCG TGCTGGAAGC GACCGCCGGG ATGTATGGCG ACCTGCAGGG CATCGCCGGC AAGGCGCTGG GCGAGATCGA CGGCATGGCG CTGCCGATGC TGGAGGATTT CAGCGACGAC GAGGCCGATC AGGCGGCGTG A
|
Protein sequence | MTELTIICPN CATSVPLTES LAAPLLKDTQ SKYERLIAQK DKDIAGREAA LEAQRADLDN AKAAVGQQVA ERIAVERTRI AAEEAAKAKR LAADDLDAKA RQLAELTEAM QQKDVKLAEA QRAQAAFLQK QRQLEDEKRE LDLTIEKRVQ ASLESVRSKA KQDAEEGLRL KVAEKEETIA TMQRQIDKLK SEQGSQQLQG EVMELELEAS LRARFPQDSI EPVPKGEFGG DVLHRVVNAA NQPCGTILWE SKRTKNWTDG WLTKLRDDQR KAKAELALIV SNALPKGVHS FDHIDGVWVA EARCAIPVAI ALRQSLIELA AARQAGEGQQ TKTELVYHYL TGPRFRQRVE AIVEKFTEMQ SDLDKERRSM MRMWAKREAQ IRGVLEATAG MYGDLQGIAG KALGEIDGMA LPMLEDFSDD EADQAA
|
| |