Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4679 |
Symbol | hisG |
ID | 5902141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5059149 |
End bp | 5060144 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641565198 |
Product | ATP phosphoribosyltransferase |
Protein accession | YP_001686297 |
Protein GI | 167648634 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0040] ATP phosphoribosyltransferase |
TIGRFAM ID | [TIGR00070] ATP phosphoribosyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.899389 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCTC CAGTGAACGG GCAGATGATC TTCGCAATCC CGTCCAAGGG CCGCTTGAAG GAGCAGGTCG AGGCCTGGCT GGCCGATTGC GGCTTCCGGC TGGAGATGTC GGGCGGGTCG CGCGGCTACA GCGCCGAGCT GTCGGGCCTG CCGGGCGTGT CGGTGCGGCT GCTGTCGGCC GGCGACATCG CGGCGGGCCT GGACAGCGGC GAGCTGCACC TGGGCGTCAC CGGCGAGGAC CTGCTGCGCG AGCGCGGCGA GGACATGGAC GGCCGGGTGA TGCTGCTGCG CGCCCTGGGC TTTGGCCGCG CCGACCTGGT GGTGACCGCG CCCAAGAGCT GGCTGGACGT CGACACCATG GCCGACATCG ACGAGGTCGG TCACGCCCAT CTCGCCAAGA CCGGCCGCCG TCTGCGCGTG GCCACCAAGT ACGTCACCCA GACCCGGGCC TTCTTCGCCC GCCACGGCGT GGCCGACTAC CGGATCGTCG AGAGCAGCGG GGCCACCGAG GGGGCGCCGG CGGCTGGCGC GGCCGAGCTG GTCGTGGACA TCACCACCAC CGGGGCGACC CTGGCGGCCA ATGGCCTGAA GATCCTGTCC GACGGGGTGA TCCTCAAGAG CCAGGCCCAG CTGACCGCCT CGCTGATCAC GCCGTGGACG GAAGGTCAGG TCGAGTCGCT GGCCCGCCTG TTGTCGGTGG TCGAGGCCAA GGGCCGGGCT CGCACTCTTG CCACCCTGGT CTGGCCCGCC GAGCAGGATG CGGCCGGCCA GGCGGCCGTG GCCCCGTTCG TCGGCCAGGG CGCACGCCGG GCCAACGGCG CGCTGCTGGC CGCCGCCGAC CTGTTCGCCG CCGCCGCCGC CCTGGGGGCG GCTGGCGTCG AGCCGGTCAC CGTCTCGCGG CCGGACTATG TGTTCGAGTC GCGTTCGGCG GTGCTGGACC AGCTTTGGGA ACGACTCTCC CGGCAGGATT TGACCGAACC GTCAAAAATC GTCTGA
|
Protein sequence | MTSPVNGQMI FAIPSKGRLK EQVEAWLADC GFRLEMSGGS RGYSAELSGL PGVSVRLLSA GDIAAGLDSG ELHLGVTGED LLRERGEDMD GRVMLLRALG FGRADLVVTA PKSWLDVDTM ADIDEVGHAH LAKTGRRLRV ATKYVTQTRA FFARHGVADY RIVESSGATE GAPAAGAAEL VVDITTTGAT LAANGLKILS DGVILKSQAQ LTASLITPWT EGQVESLARL LSVVEAKGRA RTLATLVWPA EQDAAGQAAV APFVGQGARR ANGALLAAAD LFAAAAALGA AGVEPVTVSR PDYVFESRSA VLDQLWERLS RQDLTEPSKI V
|
| |