Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3972 |
Symbol | |
ID | 5901434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4301584 |
End bp | 4303875 |
Gene Length | 2292 bp |
Protein Length | 763 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564493 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001685595 |
Protein GI | 167647932 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.471941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.534128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGACG GGCGACACGG TCCGGAGCGA CGGGGCGTGG ACCGGCGGGC GGTCCTGGCC GGCGCGACGG GCCTGGCGGG CTTCGCCCTG GCCGGAGCGG GCGCGGCCCG CGCGGCCTCG CCCCGGGTGG AAGCCCTGAT CGCCCAGATG ACCCTGGAGG AGAAGGCCGG CCAGCTGTCG TGCTATTCCG ACATGATCCG GCCGCCGGTC GGCGACATCA ATCCGCTGGT CAACCAGCGC AACACCCAGC AGATCCTGGC CGACACCCGG GCGGGCCGCA TCGGGGTGCT GATGAACGGG ATCGGGGTCG AGGGCGCCCT GCTGGCCCAG ACCGCCGCCG TCGAGCATTC GCGACTGCGC ATCCCGCTGC TGTTCGCGGC CGACGTGATC CACGGCTTCA AGACCGTGTA CCCGATCCCG CTGGGCGAAT CGGCCAGCTT CGACCCGACC CTCGCCGAGC GCACGGCGCG GGCCGCGGCG ATCGAGGCCT CGTCGCACGG CCTGCACTGG ACCTTCGCCC CGATGGTCGA CGTGGCCCGC GACCAGCGCT GGGGGCGGGG CGCCGAGGGC TCTGGCGAGG ACGTGTTCCT GGGCGAGGTG ATGGCCCAGG CCCGGGTGCG CGGCTTCCAG GGCGGCGACC TGACCGCCGC CGACAGCCTG CTGTCGACCG CCAAGCACTT CGCCGCCTAC GGGGCGGTGA CGGCGGGCCT GGACTATAAC ACCGTCGACA TTTCCGAGGA GACCCTGCGC GAGATCCACC TGCCGCCGTT CAAGGCCGCC TTCGACGCCG GCTGCCTGGC GGTGATGTCG GCCTTCAATG ACATCAATGG CGTGCCCGCC ACGGCCAACA AGCACCTGCT GACCGACATC CTGCGCGGCG AGTGGAATTT CCGGGGCGTG GTGATCTCGG ACTACACCGC CGACCAGGAA CTGGTGGCCC ACGGCTTCGC CGCCGACGAC AAGGACGCCG CCCGCCTGGC GATCCTGGCC GGGGTCGATA TCAGCATGCA GAGTGGGCTC TACAGCCGCT ACCTGCCCGA ACTGGTCGCC GAGGGGCTGG TCCCGATGGC CACGGTCGAC ACCGCCGTGC GCCGGGTGCT GGGCTTGAAG GAAGCGCTGG GTCTGTTCGA CCGGCCGTTC CGCTCGATCG ACCCCAAGGC CCAGGCCGCC AACACCGCCA CCCCGGCCAT GCGCGCCCTC TCGCGCGAGG CGGGCGGCAA GTCGATCGTG CTGCTGCGCA ATGACGGCGG CCTGCTGCCC CTGCCCAACG CGGGCAAGAC CATCGCCCTG ATCGGTCCGT TCGCCGAGGA CCGCGACAAC ATCCTGGGAC CCTGGGCCTT CTTCGGCGAC AAGGCCCTGG GGGTCGACCT GGCGACCGGG ATCCGCGAGG CGATGGCCGA CCCGTCGCGG CTGATCGTGG CCCGCGGTTG CGACGTCGAG ACCGTCATCC CCGGCGGCTA TGACCAGGCC ATCGCCGCCG CCCGGGCCGC CGACGTGGTG CTGCTGGCGG TCGGCGAGAG CCAGAACATG TCCGGCGAAG CCCAGTCGCG CACCGAGATC AGCCTGCCGC GCGTCCAGCA GCAGTTGGCA GAGTGGGTGG CGTCGGTGGG CAAGCCGACG GTGGTGCTGC TGCGCCATGG CCGCGCCCTG GTGCTGGAAG GGGCGGTCAA GGCCGCGCCG GCCATCCTGG CCACCTGGTT CCTTGGCAGC GAGACCGGCC ACGCCGTGGC CGACGTGCTG TTCGGCGAGG TCAATCCCTC GGCCCGCCTG CCGGTCAGCT TCCCGCACGA GAGCGGCCAG GAACCGTTCG CCTACAACCA TCGCACCACC GGCCGTCCCG CGCCCCAGGC TGACGACAGC CAGGAGTACA AGGCCCGCTG GCGCACCACC CGCAACGAGG CGCTCTATCC GTTCGGCCAC GGGCTGTCGT ACACCAGCTT CGCGCTCAGC GACGTCAGGC TGTCGACCAC CCGCCTGGGC TGGAACGAGA AGCTCCATGT CACGGTCAAT GTCGCCAACA CCGGCAAGGT CGCCGGCGAG CACGTCTTGC AGCTCTATGT CCGCGACCGG GTGGCCAGCC GCACCCGGCC GGTGCGCGAG CTCAAGCGCT TCCTGCGCGT GGCCCTGAAG CCCGGCGAGC GGCGCGACGT GCGCTTCAGC CTGGAGCGCG ACTCGCTGAT GTTCGTCGGC GACGACGACC GCTGGCTCGC CGAGCCGGGC ATGTTCGACC TGTGGGTGGC CAACAGCGCC GCCGACGGAC TGGCGGCGAG CTTTGAGCTG TTGGGGGCTT AA
|
Protein sequence | MGDGRHGPER RGVDRRAVLA GATGLAGFAL AGAGAARAAS PRVEALIAQM TLEEKAGQLS CYSDMIRPPV GDINPLVNQR NTQQILADTR AGRIGVLMNG IGVEGALLAQ TAAVEHSRLR IPLLFAADVI HGFKTVYPIP LGESASFDPT LAERTARAAA IEASSHGLHW TFAPMVDVAR DQRWGRGAEG SGEDVFLGEV MAQARVRGFQ GGDLTAADSL LSTAKHFAAY GAVTAGLDYN TVDISEETLR EIHLPPFKAA FDAGCLAVMS AFNDINGVPA TANKHLLTDI LRGEWNFRGV VISDYTADQE LVAHGFAADD KDAARLAILA GVDISMQSGL YSRYLPELVA EGLVPMATVD TAVRRVLGLK EALGLFDRPF RSIDPKAQAA NTATPAMRAL SREAGGKSIV LLRNDGGLLP LPNAGKTIAL IGPFAEDRDN ILGPWAFFGD KALGVDLATG IREAMADPSR LIVARGCDVE TVIPGGYDQA IAAARAADVV LLAVGESQNM SGEAQSRTEI SLPRVQQQLA EWVASVGKPT VVLLRHGRAL VLEGAVKAAP AILATWFLGS ETGHAVADVL FGEVNPSARL PVSFPHESGQ EPFAYNHRTT GRPAPQADDS QEYKARWRTT RNEALYPFGH GLSYTSFALS DVRLSTTRLG WNEKLHVTVN VANTGKVAGE HVLQLYVRDR VASRTRPVRE LKRFLRVALK PGERRDVRFS LERDSLMFVG DDDRWLAEPG MFDLWVANSA ADGLAASFEL LGA
|
| |