Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PC1_2138 |
Symbol | |
ID | 8133082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pectobacterium carotovorum subsp. carotovorum PC1 |
Kingdom | Bacteria |
Replicon accession | NC_012917 |
Strand | - |
Start bp | 2443262 |
End bp | 2444062 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644865426 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_003017713 |
Protein GI | 253688523 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.625466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAGT CGCTTCCCGA TATTACCTTT TTTCATGAAC TCGCCACGCT GGCGAGTCAG GAAACGTTAC CGCGTTTTCG TTCTCTTACC GCCAATCAAA TTGAAACCAA GCCAAAAGAG GGTTTTCGTT TTGATCCGGT GACGGAAGCC GATCGGGAAG CTGAGCGGGT CATCCGTGAG CACATCACGC GCCATTATCC CGAACACGCG ATCATGGGGG AAGAATTTGG CCTGAGTGGG GAAGGCCCAG TGCGTTGGGT TTTAGATCCG GTTGATGGCA CCCGACCTTT CTTATGTGGG CTACCCGTGT GGGGAACCCT TATTGGCCTG CTGCATCATG AACGTGCCGT AATGGGGATG ATGAGCCAGC CGTTTACCGG AGAGTGTTTC TGGGCTGATG GTTCACTGGC CTGGCGTAGC GACCGCAATG GGGAAGTGCG TTTAAGCACG CGTAAAGGTG TGTCGCTCGA ACAGGCGATT TTGCACACAA CTGCGCCAGA AGCGCTGAGC ATGCACCCGA CCGTTCGCTT TACTGCACTC ACCGAATGCA CGTTGATGAC GCGCTATGGC GGAGAATGTT ACGCCATGGC GATGCTGGCG GCAGGCCAGA TTGACATCTG CGTGGAATTT GCATTGCAGC CCTACGACAT TGTCGCGTTG ATCCCGATTA TTGAACAGGC GGGCGGCATC ATCACCGATC TCAACGGGCA GCGAGCGGAA GCGGGTGGCA CGGTAGTTGC GACTGGTAAC CCAGCGCTGC ATCAGCAGGT TTTAGCCATT CTGAACGGAA CGCGGTCATA G
|
Protein sequence | MSQSLPDITF FHELATLASQ ETLPRFRSLT ANQIETKPKE GFRFDPVTEA DREAERVIRE HITRHYPEHA IMGEEFGLSG EGPVRWVLDP VDGTRPFLCG LPVWGTLIGL LHHERAVMGM MSQPFTGECF WADGSLAWRS DRNGEVRLST RKGVSLEQAI LHTTAPEALS MHPTVRFTAL TECTLMTRYG GECYAMAMLA AGQIDICVEF ALQPYDIVAL IPIIEQAGGI ITDLNGQRAE AGGTVVATGN PALHQQVLAI LNGTRS
|
| |