Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3633 |
Symbol | |
ID | 3911435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4169338 |
End bp | 4170558 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885535 |
Product | OsmC-like protein |
Protein accession | YP_487239 |
Protein GI | 86750743 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [COG1765] Predicted redox protein, regulator of disulfide bond formation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.525721 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATCG AACGCTTTGA ATTCCCCGGC AGCGGCGGAC ATCGACTCGC GGCTGCGCTG GAACTGCCGG GCTCGGCGCC GCTCGCCTTC GCGCTGTTTG CGCATTGTTT CACGTGCGGC AAAGACAATC TGGCCGCGCG GCGGATCGCG GCGGGGCTGG CGGCGCGCGG CATCGCGGTG CTGCGGTTCG ACTTCACCGG GCTCGGCGCC AGCGAGGGCG ACTTCGCCAA TGCGACGTTC TCGTCAAACG TCGCCGATCT GGTTCTCGCC GCCGATCATC TGCGCAAGGT CCATCGGGCG CCGTCGCTGC TGATCGGCCA CAGCCTCGGC GGCGCCGCGG TGCTGGCGGC CGCAGCGCAG ATCCCCGAAG CGAAGGCGAT CGCGACTATC GCCGCGCCGT CGGATCCATC GCATGTCGCC GGCCTGTTCG CCGAGCATGT CGATGCGATC CGCGAACAGG GCAGCGTCGA GGTCTCGCTC GCCGGCCGAC CGTTCACGAT CAAGCGCGAA TTCCTCGACG ACGCCGGCGA ACACAATCTG ATGGCGCAGG TGACCAAGCT GCGCAAGGCG CTGCTGGTGA TGCACGCACC GACCGATGCC ACCGTCAATA TCGACAACGC CACCCGGATC TTTCTGGCTG CGCGGCATCC CAAGAGCTTC GTCTCGCTCG ACCATGCCGA TCATCTCCTG AGCGACCGCC GCGATGCGAA CTACGCGGCC GATGTGATCG CCGCCTGGGC GGAGCGCTAT CTCGACGCCC GGCAACCCGC CGCCGCCGGT GCGCCGGAGG TGCTGCGCGC CGTCATTGTG CAGGAAACCG GCGAAAGCAA ATTCCAGCAG CGGATCAGCG TCGGACCGCA TCAGTTGCTC GCCGACGAGC CGGTCGCGGT CGGTGGCGCG GATTCCGGGC TCGGCCCGTA CGATCTGTTG CTTTCGGCGC TCGGCGCCTG CACCTCGATG ACGATGCGGC TCTATGCCGA ACGCAAGAAG CTGCCGCTCG ACCGCGTGAC CGTGACGCTG AGCCACGCCA AGATCCACGC CGAGGACTGC GTCGAATGCG AGACCAAGGT CGGCCTGCTC GACAGGATCG ACCGCGTGAT CGCGATCGAC GGCGATCTCG ACACCGATCA GCGCGCCCGA CTGATCGAGA TCGCGGACAA ATGCCCGGTG CATCGCACCC TGACCTCGGA AGTGAAGATC GTCACCCGCG CGGCGGAGTG A
|
Protein sequence | MPIERFEFPG SGGHRLAAAL ELPGSAPLAF ALFAHCFTCG KDNLAARRIA AGLAARGIAV LRFDFTGLGA SEGDFANATF SSNVADLVLA ADHLRKVHRA PSLLIGHSLG GAAVLAAAAQ IPEAKAIATI AAPSDPSHVA GLFAEHVDAI REQGSVEVSL AGRPFTIKRE FLDDAGEHNL MAQVTKLRKA LLVMHAPTDA TVNIDNATRI FLAARHPKSF VSLDHADHLL SDRRDANYAA DVIAAWAERY LDARQPAAAG APEVLRAVIV QETGESKFQQ RISVGPHQLL ADEPVAVGGA DSGLGPYDLL LSALGACTSM TMRLYAERKK LPLDRVTVTL SHAKIHAEDC VECETKVGLL DRIDRVIAID GDLDTDQRAR LIEIADKCPV HRTLTSEVKI VTRAAE
|
| |