Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4358 |
Symbol | |
ID | 4024883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4822372 |
End bp | 4823154 |
Gene Length | 783 bp |
Protein Length | 260 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637964568 |
Product | histidinol-phosphate phosphatase, putative, inositol monophosphatase |
Protein accession | YP_571476 |
Protein GI | 91978817 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0029628 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGTCA TCGATTTCGC CGCCTTTATC GGACGCCTCG CAACGTCGTC CGGCGACACC ATCCTGCCGT TCTTCCGCAC CTCTCTCACG ATCGACAACA AGAAGGCCGG CCGCGACTTC GATCCGGTGA CCGAGGCCGA TCGCGCCGGC GAGGCGGTGA TGCGCCGGCT GATCAAAGGG AGTTTCCCCC AGCACGGCAT CGTCGGCGAG GAGTTCGGCA ACGAGCGCGA GGACGCCGAC TATGTTTGGG TGCTGGACCC GATCGACGGC ACCAAATCCT TCATCGCCGG GTTTCCGGTC TGGGGCACGC TGATCGCGCT GCTGCACAAG GGTTCGCCGG TCTACGGCAT GATGCATCAG CCCTACATCG GCGAGCGTTT CTCCGGCGAC AATGGCGCCG CGTCCTATAA GGGACCCTCC GGCGAGCGCA AGCTGACGGT GCGGCGCTGC GCCTCGCTGA GGGAGGCGAC GCTGTTCACC ACCTCGCCGC TGCTGATGAA TGACACCGAC CGCGCCACTT TCGAGCGGGT GCAGGCCGAG GCGCGGCTGA CCCGGTTCGG CGGCGACTGC TACGCCTATT GTATGCTGGC GGCGGGCCAG CTCGACCTCG TGATCGAGAC CGAGCTGAAG CCCTACGACG TCGCGGCGCT GATCCCGATC ATCACCGGCG CCGGCGGCAT CATCACCACC TGGGACGGCC AGCCTGCGCA GAACGGCGGC CGCATCGTCG CCGCGGGCGA CAAGCGGGTC CACGAAGCCG CGATGAAGAT CCTCAACGGC TGA
|
Protein sequence | MTVIDFAAFI GRLATSSGDT ILPFFRTSLT IDNKKAGRDF DPVTEADRAG EAVMRRLIKG SFPQHGIVGE EFGNEREDAD YVWVLDPIDG TKSFIAGFPV WGTLIALLHK GSPVYGMMHQ PYIGERFSGD NGAASYKGPS GERKLTVRRC ASLREATLFT TSPLLMNDTD RATFERVQAE ARLTRFGGDC YAYCMLAAGQ LDLVIETELK PYDVAALIPI ITGAGGIITT WDGQPAQNGG RIVAAGDKRV HEAAMKILNG
|
| |