Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4933 |
Symbol | |
ID | 5902395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5330323 |
End bp | 5331528 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565453 |
Product | histidinol-phosphate phosphatase family protein |
Protein accession | YP_001686551 |
Protein GI | 167648888 |
COG category | [E] Amino acid transport and metabolism [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0241] Histidinol phosphatase and related phosphatases [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | [TIGR00213] D,D-heptose 1,7-bisphosphate phosphatase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR01656] histidinol-phosphate phosphatase family domain [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.295691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGGC AAGCCGTGAT TTTGGTGGGC GGCAGAGGCA CGCGTCTGGG CGTCGTGGCC AAGGACACCC CCAAGCCCTT GCTGCCGATC GACGGCGATC GCCGCTTCCT CGACTATCTC ATCGAGAACA TGGCGCGGCA TGGCGTCCGC GAGATTCTCC TGGTGGCGGG TCACCTGGGC GATCAAGTGG CGGCGCGCTA TGACGGCGCC GAGTTAGGCG GGTGCAGGAT CGCCGTCGTT ATCGAGCCTG AGCCGGCGGG CACGGGTGGC GCTCTCGTTC ACGTACGGGA CCGACTTGAC CCAGTGTTTC TGATGTCGAA CGGCGACTCC TATTTCGACT TCAACTACCT GGCTCTAACG ACGGTCTTGC GGCCCGACGA CATGGGCGCC CTGTCCTTGC GCTGGGTTCC CGACGCGCGG CGCTACGGCG CGGTCGAGCA GCGGGACGGA CGCATCGTCA ATTTCCGCGA GAAGGACGAG GGCCTGTCGA GCGGCGCCTG GATCTCCGGC GGCGTCTACC TGCTTCGCCG GGAGGTGCTA GGCCTGATCG ACCATCTGCC ATGCTCCATC GAGGCCGAGG TCTTCCCCAT TCTCGCCAAG ACCGGACGCC TGGGCTGCGC GACGTTCGAG GGCTATTTCC TCGACATCGG CCTGCCCGAA ACCCTTGAAC AGGGACGCGC CGAGCTGCCC CAGGTGCGTC GCCGACCCGC CGTGCTGCTG GATCGCGACA ACACCTTGAA CGTCGACACC GGTTACACGC ATCGGCCTCA AGACCTTCGC TGGATGCCCG GCGCGGTCGA AGCCGTTCGA GCGATCAATG ACGCGGGATG GCTGGCCCTG GTGGTTACGA ATCAATCGGG CGTGGGGCGA GGCTTCTACA CCGAGGACCA GATGAGGGCC TTCCACGAGC ATATGCGGAA CGAGTTGGCG GCGGCCGGCG CCCATATCGA CGCCTTCTAT CACTGTCCGT ACCACGCCGA CGCCAGCGTC GAGGCCTATC GGCACGACGA CCATCCCTCA CGCAAGCCCA ATCCTGGCAT GCTGGTCGCG GCCTTGAAGG ACTGGTCGGT CGACGCCGAA CGTTCCGTGA TGATCGGCGA CCAAAGCAGC GATGTCGCCG CCGCGGCAGC CGCCGGCGTG CGAGGCGTCA GGTATGAAGG CGGGGACTTG GCGGCCGTGG TGGTCAAGGC CATGGCGGGC GGCTAG
|
Protein sequence | MIRQAVILVG GRGTRLGVVA KDTPKPLLPI DGDRRFLDYL IENMARHGVR EILLVAGHLG DQVAARYDGA ELGGCRIAVV IEPEPAGTGG ALVHVRDRLD PVFLMSNGDS YFDFNYLALT TVLRPDDMGA LSLRWVPDAR RYGAVEQRDG RIVNFREKDE GLSSGAWISG GVYLLRREVL GLIDHLPCSI EAEVFPILAK TGRLGCATFE GYFLDIGLPE TLEQGRAELP QVRRRPAVLL DRDNTLNVDT GYTHRPQDLR WMPGAVEAVR AINDAGWLAL VVTNQSGVGR GFYTEDQMRA FHEHMRNELA AAGAHIDAFY HCPYHADASV EAYRHDDHPS RKPNPGMLVA ALKDWSVDAE RSVMIGDQSS DVAAAAAAGV RGVRYEGGDL AAVVVKAMAG G
|
| |