Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1220 |
Symbol | |
ID | 5898675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1282121 |
End bp | 1282957 |
Gene Length | 837 bp |
Protein Length | 278 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561705 |
Product | histidinol-phosphate phosphatase |
Protein accession | YP_001682848 |
Protein GI | 167645185 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0483] Archaeal fructose-1,6-bisphosphatase and related enzymes of inositol monophosphatase family |
TIGRFAM ID | [TIGR02067] histidinol-phosphate phosphatase HisN, inositol monophosphatase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.427245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.220917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTCG CCGCCGACGA CCTCGCACAG CTCGACGCCT TCATCATCGA CCTCAACCGC GCTTCGGCGG CGGCGATCCT GCCGCTGTTC CGGGCCGACC ATGGGTTGGA GGACAAGGGC GCGGGCAAGA ACCTGCCGCG CGGCAGCCAC GCCGCCTTCG ATCCGGTGAC GGAAGCCGAT CGCGGCGCCG AGGCGGCGAT CCGCAGGCTG ATCGGCGAGC GCTATCCCGA CCACGGGGTG ATCGGCGAGG AATACGGCGA GGATCGGCCC GACGCCGAAT TCGTCTGGGT GCTGGACCCG ATCGACGGGA CCCGCGCCTT CATCGCCGGC CTGCCGCTGT GGACCACCCT GATCGGTCTG CGCCATCAGG GCCGCCCGGT CCTGGGCTCG ATTGGCCAGC CCTATACGGG CGAGATCTTC ATCGGCTCTT CGGCCGGCTC GCGCCTGATG TCGCGCGGCC AGAGCCGGCC AATCCAGGTG CGGCCCTGCG CCGACCTGAC CGACGCCGTC ATCGCCACCA CCGATCCCGA GGCCTGTTTC GACGGCGCCG AGCTGGGGGC CTGGCGCCAG GTGCGGGCCG CCGCCAAGCT GGCCCGCCTG GGCTGCGACG CCTACGCCTA CGCCATGGTC GCCATGGGCA AGATGGACAT GGTGATCGAG GCCGGTCTGC AGTCCTGGGA CATCGAGGCC GCCATCCCCG TGGTGGAAGG GGCCGGCGGC GTGGTCACCG ACTGGCGCGG CGACACGATC GGCCCGAACG GCGGCCAGAT GGTGATCGCC GGCGACCGAC GCTGCCTGGA CGAGGCGCTA GTGGCGCTGC GGCGGTCGGC GAAGTAA
|
Protein sequence | MTLAADDLAQ LDAFIIDLNR ASAAAILPLF RADHGLEDKG AGKNLPRGSH AAFDPVTEAD RGAEAAIRRL IGERYPDHGV IGEEYGEDRP DAEFVWVLDP IDGTRAFIAG LPLWTTLIGL RHQGRPVLGS IGQPYTGEIF IGSSAGSRLM SRGQSRPIQV RPCADLTDAV IATTDPEACF DGAELGAWRQ VRAAAKLARL GCDAYAYAMV AMGKMDMVIE AGLQSWDIEA AIPVVEGAGG VVTDWRGDTI GPNGGQMVIA GDRRCLDEAL VALRRSAK
|
| |