Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2917 |
Symbol | |
ID | 5900372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3162211 |
End bp | 3163698 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563414 |
Product | hypothetical protein |
Protein accession | YP_001684542 |
Protein GI | 167646879 |
COG category | [I] Lipid transport and metabolism [R] General function prediction only |
COG ID | [COG1545] Predicted nucleic-acid-binding protein containing a Zn-ribbon [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.381583 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0360554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGGCA TCGCCGCCTG GGGCGCCTAT GCGCCGCGCC TTCGCCTCAG CCGCAAGGCG GTCACCCAGG CCAATGCCTG GGTCGCGCCG AACCTGCGGG CCAAGGCCAA GGGCGAGCGC TCCATGGCCA ACTGGGACGA GGACGCTCTG ACCATGGCGG TCGAGGCGGC CCGCGACGCC TTGGGGCCAG GCGACGACCG CTCCGCCATC GACGCCCTCT ATTTCGCCTC CACCACGGCG CCGTTCACCG ATCGCCAGAA CGCGGGAATC GTCGCTGGCG CCCTGACGCT GGAGAAGGCC ATCGCCTCGG CCGACATCAC CGGTTCGCAG CGTTGCGGCC TGGCCGCCCT GGGCCAGGCG CTAGTGGCGG TGCAAGGCGG AGCGGCCAAG CGCGCCCTGG TGGCGACCGG CGAGCACCGC CGGGCGCGAG CGGCGTCCGC TCAGGAGCTC GACAATGGCG ACGGCGCGGC GGCCTTCGTG GTCACGGCCG AGGCCGGGGC GGCCGAATTC CTCGGCCGGG GCAGCGTCAC CGACGACTTC GTCGATCACT TCCGTGGGGC CGACAGCGAC TTCGACTATC AATGGGAGGA GCGCTGGATC CGCGACGAGG GCATCGTCAA GCTGGTCCCT CCGGCGATCC GGCAAGCGCT GGAAGTTTCT GGCCTAAAGG CTTCGGACGT CACGCATTTC TGTTTCCCCT CCACCTTCTC CGGGATGGCC GCCAGCCTGG CCAAGACGAT CGGGATCGCC CCCGAGGCGG TGCGCGACAA CCTGGCCTTG ACGCTGGGCG AAGCCGGCTG CGCGCACGGC CCGCTGATGC TGGCCCACGC CCTGGAGCAA GCCAGCCCCG GCGACGTCAT CCTGGTCGCC CAGTTCGGCC AGGGCGCCGA GGCGCTGGTG TTCCGGGTGA CCGAAGCGGC GGCCAGCACC CGCCCGGCGC GCGGCGTCAC CGGCGCTCTG GCTGATCGAA AAGACGAAGA CAACTATCTG AAGTTTCTCA CTTTCAATGG CCTGGTCGAA TGGGACAAGG GCATGCGGGC CGAGAAGGAC AACAAGACGG CCCTGACCAC CCTCTATCGC AACCAGGACA TGATCCTGGG CCTGGTCGGC GGTCGCTGCC GCGAGACCGG CGTCGTGCAG TTCCCCCGCA CGCGCATCTC GGTCGCACCC AACAATCCGG GCGTCGACAC CCAGGAACCA TACCGCTTCG CCGAGCGAAA GGCCTCGGTG CTGAGCTATT CGGCCGACTT CCTGACCTTC TCGATGAGCC CGCCCAACCA CTACGGCATG ATCGTCTTCG AGGGCGGCGG GCGGATCATG ATGGACATCA CCGACGTCGA GCAAGGCGAG GTCGACAGCG GCCTGCCGGT CAAGATGGTG TTCCGCATCA AGGACGTCGA TGAGAAGCGC GGCTTCGTCC GCTACTTCTG GAAGGCCGCG CCGGACCGGG TGGCGATGGC CGCCAAGACC GCCCTGGCGG CGGAATAG
|
Protein sequence | MVGIAAWGAY APRLRLSRKA VTQANAWVAP NLRAKAKGER SMANWDEDAL TMAVEAARDA LGPGDDRSAI DALYFASTTA PFTDRQNAGI VAGALTLEKA IASADITGSQ RCGLAALGQA LVAVQGGAAK RALVATGEHR RARAASAQEL DNGDGAAAFV VTAEAGAAEF LGRGSVTDDF VDHFRGADSD FDYQWEERWI RDEGIVKLVP PAIRQALEVS GLKASDVTHF CFPSTFSGMA ASLAKTIGIA PEAVRDNLAL TLGEAGCAHG PLMLAHALEQ ASPGDVILVA QFGQGAEALV FRVTEAAAST RPARGVTGAL ADRKDEDNYL KFLTFNGLVE WDKGMRAEKD NKTALTTLYR NQDMILGLVG GRCRETGVVQ FPRTRISVAP NNPGVDTQEP YRFAERKASV LSYSADFLTF SMSPPNHYGM IVFEGGGRIM MDITDVEQGE VDSGLPVKMV FRIKDVDEKR GFVRYFWKAA PDRVAMAAKT ALAAE
|
| |