Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4922 |
Symbol | |
ID | 5902384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5318217 |
End bp | 5319296 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565442 |
Product | GHMP kinase |
Protein accession | YP_001686540 |
Protein GI | 167648877 |
COG category | [R] General function prediction only |
COG ID | [COG2605] Predicted kinase related to galactokinase and mevalonate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.272679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTG TCCGGACACC TCTGCGCGTT TCGTTCTTTG GCGGCGGCAC CGATCACCCC GGCTGGTTCC GAACCTTGGG CCCGGGCGCG GTGCTCTCCA CCACGATCGA CAAGTACGTC TACATCACCC TGCGCCACCT GCCCCCGGTG TTCGACTTCA ACTATCGGGT GTCCTGGCGG ATCATGGAGC AGGCCCAGAC CGTCGACGAG ATCCAGCACC CCGTGGTGCG CGCGGTTCTC AAGCACTACA CCAACCCGGG CGATTGCGGC TACGAGATCG CCTACAACGC CGACCTGCCG GCCCGCTCGG GCCTGGGTTC GTCCTCGGCC TTCACCGTCG CGGCCCTCCA CGCCCTGATG CGCCACCAAG GCAAGGAGGT CTCGAAGATG AGCCTCGCCA AGGAGGCCAT TCGCGTCGAG CAGGAGCTGC TGCAGGAGCC CGTCGGGTCG CAGGACCAGA CGGCCGTGGC CTTCGGCGGC TTCAATCGCA TCGACTTCCA CGCCGACGGC GGCCTGGGCG TGCGTCCGGT CGAAATCTCG CTCAACCGGC AGTTCGAGCT CGAGAACCGG CTGATGATGT TCTTCACGGG CTTCACTCGC GACGCCGGGG CGGTGGAGAA GGCCAAGGTC CAGAACTTCG TGGATCGCCG CGAGCAGATG AACCGGCTCT ACGACATGGT CGCCGAGGGC GAAGGCATCT TGCTCGACGA AACGACGCCG ATCGACGACT TCGGCCGCCT GCTGCATCGC GCTTGGCAGG ACAAGCGCAG CCTGTCGTCC GGCGTTTCCA GCGGCCCGAT CGACCGCATG TACGAGACGG CTCTGGGCGC GGGCGCCCTC GGCGGCAAGA TTCTCGGCGC GGGCGGCGGC GGTTTCATGC TGCTGTTCGC GGCCGCCGGA CGCCAGGAAG CCATTCGCTC GGCCCTGGCC AATCTGGTCT TCGAAGACGG CCGCTCGCCG CTGCACGTTC CCTTCCGGCT CGAACGCGAG GGCAGCACGG TGGTGCTCAA CCAGCCGCAG CTGACGGCCA ACTATGAGCG GCGCCCGATC CGCGCCCTGG CGCCCGAGAC CGCGACTTGA
|
Protein sequence | MIIVRTPLRV SFFGGGTDHP GWFRTLGPGA VLSTTIDKYV YITLRHLPPV FDFNYRVSWR IMEQAQTVDE IQHPVVRAVL KHYTNPGDCG YEIAYNADLP ARSGLGSSSA FTVAALHALM RHQGKEVSKM SLAKEAIRVE QELLQEPVGS QDQTAVAFGG FNRIDFHADG GLGVRPVEIS LNRQFELENR LMMFFTGFTR DAGAVEKAKV QNFVDRREQM NRLYDMVAEG EGILLDETTP IDDFGRLLHR AWQDKRSLSS GVSSGPIDRM YETALGAGAL GGKILGAGGG GFMLLFAAAG RQEAIRSALA NLVFEDGRSP LHVPFRLERE GSTVVLNQPQ LTANYERRPI RALAPETAT
|
| |