Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2133 |
Symbol | |
ID | 3908548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2425169 |
End bp | 2426359 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637884027 |
Product | hypothetical protein |
Protein accession | YP_485750 |
Protein GI | 86749254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.695891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTGA AATTGGGCAT GGCCAGGAGT TATATGGCCA GGACTAATAT GGCCAAGGCG GGCATCGCAC TCTGCGCGGT CGCGCTGGTG CTGATGTGGC CGCATGCCCG ACAGAGCGCG GCGTTTCTCG CCGCACAGGA CGATCCGGCC CGGCTGTCGG ACCTGCAGAT CGGGGCCGCC TTGCAGCGCG ACCCGGACCT GATCACGCGC CATATCACCG ATGCGCTGGA CGCCCGCGAT CCGGATCTCG CCGACAGCCT GGTGCAACTT GCCGCTGCAC GAAATATCAC GCTTCCCGTC GAACTCACCA CGCGCGTCGC CGCGGCGGTG GCCGCTGAGC AATCGGCCGC CGGCATCGCC ACCCGCTTCG CGACCGGCCT CGTCACCGGC GAGGCGAAGG ACGGCGCCAG CCTGTCCGGC ACGGTGGCGG GCGATCTGTT CGTGTTCGGC GACATCCGCG ACGTGGTCCG CGAGGGCACC AATCTGGCGA CGGGCGCCGA CGCCGACCGG GTCGTGCTCG GGCTCGCGGC CGCCGGGATC GCCATCACCG CGGCGACCTA TGTCACGCTG GGCGGCGCGG CGCCGGTGCG CGCCGGGCTG ACGCTGGTCA AGGATGCGCG CAAGGTCGGC CGGCTCGGCG GGGGGCTGGC GACATGGACC AGCCGCTCGG CGCGCGAGGT GGTCGATGCG CCGGCGCTGC AGCGCGCGGT TGCGGGCTCC TCGTTCAGCC GCCCGGCAGA GACGCTGACT GCGGTCAAGG CGGCATTCCG CGCCGAGAAG GCCGGCGGGC TGATGCGGCT CGCCAAGAAT GTCGGCCGCA TCGGCGACAA GGCCGGCACC CGCGGCGCGC TCGATACGTT GAAGATCGCC GAAGGTCCGA AGGATGTCGC CCGCGCGGCG CGACTCGCCG AGGCCAAAGG CGGCCAGACC CGCGCCTTCC TCAAGGTCCT CGGCCGCGGC GCGCTGCTGC TCACCACCGG CGCATGGAAT TTCGCCTGGT GGATCTTCGG CGCGCTGATG ACGCTGTTCG GTCTCGTCAC CTCGCTCAAG GCCGGCGTCG AGCGGATGAC GCAAGGCTGG ATCGATCGCG GCAAGGCGCG GCGCGCGAAG CGGCTGCTGG CCGAGGCGAA GCGGGCGCAA CGCGCGCAAG TCAATCCGTC TCCGGTCGCA GCGGCCCTGT CGGTTTCGTA G
|
Protein sequence | MRLKLGMARS YMARTNMAKA GIALCAVALV LMWPHARQSA AFLAAQDDPA RLSDLQIGAA LQRDPDLITR HITDALDARD PDLADSLVQL AAARNITLPV ELTTRVAAAV AAEQSAAGIA TRFATGLVTG EAKDGASLSG TVAGDLFVFG DIRDVVREGT NLATGADADR VVLGLAAAGI AITAATYVTL GGAAPVRAGL TLVKDARKVG RLGGGLATWT SRSAREVVDA PALQRAVAGS SFSRPAETLT AVKAAFRAEK AGGLMRLAKN VGRIGDKAGT RGALDTLKIA EGPKDVARAA RLAEAKGGQT RAFLKVLGRG ALLLTTGAWN FAWWIFGALM TLFGLVTSLK AGVERMTQGW IDRGKARRAK RLLAEAKRAQ RAQVNPSPVA AALSVS
|
| |