Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1522 |
Symbol | |
ID | 3747156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1998757 |
End bp | 1999542 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637774062 |
Product | histidinol-phosphate phosphatase, putative, inositol monophosphatase |
Protein accession | YP_379820 |
Protein GI | 78189482 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACC CATTCGATAA CCCCCACCGT TTATTTGCTT TTACACTCAT TAATCAAGCC AGCCAGATAG CACTCACTTA CTATGGCAAC CAATCGTTAA AAGTTGACAC CAAACGCGAT GCCAGCCCTG TTACAATTGC TGACCGTAAA GCTGAAGCCT TTATTCGCAA AGAGTTAGAG CTGCACTACC CCGATGATGG TATTTTGGGA GAAGAGTTTG GTGAAAAGCT CTCGCAAAAT GGACGACGCT GGGTTATTGA CCCCATTGAC GGCACCAAAG CCTTTATCCA CCATGTGCCG CTTTGGGGTA TGATGTTAGC ACTTGAAGTT AATGGCGAAC CACATTTGGG CATTATTGCC TTTCCCGCAT TAGGAACCAT CTACCACGCC GTGCAAGGCG AAGGTGCCTA TGAGAAAGAG ACACCCATCA GCGTCTCATC CGTTACCTCC GTTGCCGATG CCACCATTGT GTTTACCGAA AAAGAGTACT TGCTTGATTC ACCGTCAAAT CATCCCGTTG ATATGCTGCG CAATAGCGGA GGATTAGTAC GCGGTTGGGG CGACTGCTAC GGGCACATGC TTGTGGCATC GGGCAACGCC GAAGTGGCGG TGGATAAAAT TGTTAGCCCG TGGGATTGCG CTGCTGTTAT CCCCATTGTT ACAGAAGCGG GCGGCTGTTG CTTTGATTAT AAAGGTAATA AATCAAGCAG TGGTGAATAT GGTTTGGTGA GCACCAACCG TCAGCTTGGC GAGCAGTTGT TGCAGGAGAT TGCGGGAAAG GGGTAG
|
Protein sequence | MKNPFDNPHR LFAFTLINQA SQIALTYYGN QSLKVDTKRD ASPVTIADRK AEAFIRKELE LHYPDDGILG EEFGEKLSQN GRRWVIDPID GTKAFIHHVP LWGMMLALEV NGEPHLGIIA FPALGTIYHA VQGEGAYEKE TPISVSSVTS VADATIVFTE KEYLLDSPSN HPVDMLRNSG GLVRGWGDCY GHMLVASGNA EVAVDKIVSP WDCAAVIPIV TEAGGCCFDY KGNKSSSGEY GLVSTNRQLG EQLLQEIAGK G
|
| |