Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0929 |
Symbol | |
ID | 6374596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1005266 |
End bp | 1006024 |
Gene Length | 759 bp |
Protein Length | 252 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642683431 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_001959355 |
Protein GI | 189499885 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.419149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATG ATATGCAGCT TGCTGTCAAA CTTGCCCGGG ATGCGGGAAC GCTGATTCTC TCGTATTATC GAAGCAAATC ATTGAAGATT GACTCCAAAA GGGACGATAC TCCTGTTACT GAAGCTGACA GAAAAGCAGA AAGCCTTATA CGTAAAGGCA TCACGGAAGT GTTTCCCGAC GACGGCATAT TCGGTGAAGA GTTTGATGAG AAATTATCGG TAAACGGCCG TCGCTGGATT CTCGATCCGA TTGACGGTAC GAGGTCCTTT ATACATGGTG TGCCTCTTTT CGGTGTTATG ATCGGTCTGG AGGTTGATCG TGAAATGAGA GTTGGAGCAG TCAATTTTCC TGCACTGGGA GAGATCTATT ATGCTGAAAC GGGGTCCGGC GCATTTTTTA ATGGCGAAGC CATCAGCGTT TCTGCCATAT CTGACTATCG TGAAGCGACC GTTGTCTTTA CAGAAAAAGA GTACCTTCTT GACCCTCTTT CAGATCATCC CGTTGATAGT CTTCGTCATG ATGCGGGTCT TGTGCGCGGC TGGGGAGATT GCTACGGCCA TATGCTTGTC GCTTCCGGAA GAGCTGAAGT GGCAGTTGAC AAGATCATGA GCCCATGGGA CTGTGCTGCT CTGATTCCTG TTGTTACCGA GGCGGGAGGT CGCTGTTTTG ACTATACCGG CGAAACCACC ATATACGGAC AGGGCCTGGT AAGCACCAAC AGGTTCATCG GTGATAAGCT CCTTGTCGAT ATGGCCTGA
|
Protein sequence | MTDDMQLAVK LARDAGTLIL SYYRSKSLKI DSKRDDTPVT EADRKAESLI RKGITEVFPD DGIFGEEFDE KLSVNGRRWI LDPIDGTRSF IHGVPLFGVM IGLEVDREMR VGAVNFPALG EIYYAETGSG AFFNGEAISV SAISDYREAT VVFTEKEYLL DPLSDHPVDS LRHDAGLVRG WGDCYGHMLV ASGRAEVAVD KIMSPWDCAA LIPVVTEAGG RCFDYTGETT IYGQGLVSTN RFIGDKLLVD MA
|
| |