Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3922 |
Symbol | |
ID | 3911727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4474906 |
End bp | 4475742 |
Gene Length | 837 bp |
Protein Length | 278 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637885824 |
Product | inositol monophosphatase |
Protein accession | YP_487526 |
Protein GI | 86751030 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase |
TIGRFAM ID | [TIGR01331] 3'(2'),5'-bisphosphate nucleotidase, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00186914 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.766393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGG GCCAGGCGTT CGGGACAGGA CCGGTGATTT CGCACAACGA CGCCGTCGCT TTGATGCAGC CATTGACCGA GCTGGTGCTG CGCGCCGGGG CCGCGATCCT CGCCACCGAC CGCTCCGACC CGGTCGAGCA CAAGCCGGAC GGCTCGCCGG TGACCTCCGC CGACCTCGCC GCCGACCGCA TCATCGCCGA GGGGCTGAAG CGGATCGCGC CCGACGTGCC GGCGCTGTCG GAAGAGCGCT GCGACCTCGG CCGGCCGAAT ACCGGGAGCT TCTTCCTGGT CGATCCGCTC GACGGCACCA AGGAATACGT CGCCGGGCGT GACGAATTCA CCGTCAATCT GGCGCTGGTG ACCGACGGCA AGCCGCTGCT CGGCATCGTC GGGGCGCCGG CGCTGGGCCT GGTGTGGCGC GGCCTGGTCG GCCACGGCGC CGAGCGGCTG GCGGTCGACG CCGACGGCAC CGGCTACGAC ACGACGCCGA TCCACACCCG GCCGATGCCG GCCGACGGAG CGCCGTGGGT CGTGGCGATC AGCCGGCTGC ATCTCGACGA ACGCACTCTG GCGTTCATCG CCGAGCGGCC AGGCGGCGTC CACGCGCGGA TGGGATCGGC GCTGAAATTC TGCCGGATCG CCGACGGCGC GGCCGACATC TATCCGCGGC TGTCGCCGAC CTGCGAATGG GACATCGCCG CCGGCGCCGC CGTGGTGATC GCCGCCGGCG GCGAACTGAC CGACAGCAGC GGCCGGCCGC TGCGGTTCGA CGAGCCGCGA CCGAACTTCA TCGTGCCGGA ATTCATCGCC TGGGGGGATG CCCGGGCAGC AGCCTGA
|
Protein sequence | MGKGQAFGTG PVISHNDAVA LMQPLTELVL RAGAAILATD RSDPVEHKPD GSPVTSADLA ADRIIAEGLK RIAPDVPALS EERCDLGRPN TGSFFLVDPL DGTKEYVAGR DEFTVNLALV TDGKPLLGIV GAPALGLVWR GLVGHGAERL AVDADGTGYD TTPIHTRPMP ADGAPWVVAI SRLHLDERTL AFIAERPGGV HARMGSALKF CRIADGAADI YPRLSPTCEW DIAAGAAVVI AAGGELTDSS GRPLRFDEPR PNFIVPEFIA WGDARAAA
|
| |