Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4021 |
Symbol | |
ID | 5901483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4356358 |
End bp | 4357971 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564542 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001685644 |
Protein GI | 167647981 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.701147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCAGA TCAGCCGCCG CGGAGCCCTG GGCTCTCTGC TGACCGGCGC GGCTGTCGCC GCTGTTCCCG CGGCCGCCGA AGCTCAGCCC GGCGCCCTGC CCGTCCGCGC CGCGGCCTCG CCCTGGGCCA AGGGCGTCGA AGGTCAGCGC AAGGCCGACC TGGGGAACGG GAGGTTCCTC AATCCCATCC TGGCCGGCGA CCATCCCGAC CCGTCGATCC TCAAGGACGG CGAGGTCTAT TACATGACCC ACTCGTCGTT CGACGCCTAT CCGGGCCTGC TGATCTGGCG CTCGACCGAC CTGGTCAACT GGACCCCGGT CGTCGCCGCC CTGAAGACCA ATGTCGGCTC GATCTGGGCG CCCGAGCTCT GCAAGCACCA GGGCCGCTAC TACATCTACC TGCCGGCCAA ATATCCCGAC CACAACACCA GCTACGTGAT CTGGGCCGAC AGGATCGAGG GGCCGTGGAG CGAGCCGGTC GACCTGAAGC TGCCGCGCTA TATCGACCCC GGCCACGTGG TCGACGAGCA TGGCGTGCGC TGGCTGTTCC TGTCGGGCGG CGACCGCATC CAGCTGGCGC CCGACGGCCT GTCGACGGTC GGCAAGCCCG AGCACGTCTA TGATCCTTGG CGCTATCCGG ACGACTGGGA CGTAGAGGGC TTCTCGCCCG AGGGTCCCAA GGTGATGAAG CGCGGCGACT ACTATTACCT GGTCACCGCC GTCGGCGGCA CGGCCGGCCC GCCGACCGGC CACATGGTCA TCGTCGCCCG CGCCAAGTCG CTGGCCGGCC CGTGGGAGGA TGACCCGAAG AACCCCGTCG TCCGCACGAC CAACAACAGC GAGACGTGGT GGTCGCGCGG CCACGCCACC CTGGTCGAGG GTCCGGCCGG CGACTGGTGG ACCGTCTATC ACGGCTACGA AAACGGCTTC TACACCCTGG GCCGCCAGAC CCTGCTGGCC CCGGTGACCT GGACCAAGGA CGGCTGGTTC GAGGTCGGCG GCGGCGACTT GTCTCGCCCC CTCGCCAAAC CCAAGGGGGG CAAGGCCGGA CCGCATGGCC TGGCCCTCTC CGACGACTTC ACGACCGACA AGGTCGGCGT CCAGTGGAAC TTCTTCGACC CCAAGCCGGG CGAGCACGAA CGGCTGACCC GGGCTGGCGG GGTGATGACC TTGAAAGGCG CGGGCGAGGC CCCCTCGACC GGCGCGCCGC TGATCTTCGT CAATGGCGAC CAGACCTATG AGATCGAGTG CGAGATCGAG GTGGATCCCG ACACCCGCGC CGGCCTGATC CTGTTCTACG ACCGCCAGCT CTATTGCGGG TTGGGGTTCG ACGCGAAGGC CTTTGTCACC CACCAGTATG GCATCGAGCG CGGCCGGCCG GCCAATCCGC ATGGCGCGAA GATGCTGATG CGGCTGAGGA ACGACCGCCA CATCGTCAGC TTCCACACCA GCGGGGACGG CGGGGTCACC TGGAAGCGCT TCGACCGCGG CATGGAGGTC TCGGGCTACC ACCACAATGT GCGCGGCGGC TTCCTGATGC TGAAGCCGGG CCTCTACGCC GCCGGCAAGG GCTCGGCGCG GTTCAAGGGG TTCAAGTACC GGGCTCTGGC ATAG
|
Protein sequence | MVQISRRGAL GSLLTGAAVA AVPAAAEAQP GALPVRAAAS PWAKGVEGQR KADLGNGRFL NPILAGDHPD PSILKDGEVY YMTHSSFDAY PGLLIWRSTD LVNWTPVVAA LKTNVGSIWA PELCKHQGRY YIYLPAKYPD HNTSYVIWAD RIEGPWSEPV DLKLPRYIDP GHVVDEHGVR WLFLSGGDRI QLAPDGLSTV GKPEHVYDPW RYPDDWDVEG FSPEGPKVMK RGDYYYLVTA VGGTAGPPTG HMVIVARAKS LAGPWEDDPK NPVVRTTNNS ETWWSRGHAT LVEGPAGDWW TVYHGYENGF YTLGRQTLLA PVTWTKDGWF EVGGGDLSRP LAKPKGGKAG PHGLALSDDF TTDKVGVQWN FFDPKPGEHE RLTRAGGVMT LKGAGEAPST GAPLIFVNGD QTYEIECEIE VDPDTRAGLI LFYDRQLYCG LGFDAKAFVT HQYGIERGRP ANPHGAKMLM RLRNDRHIVS FHTSGDGGVT WKRFDRGMEV SGYHHNVRGG FLMLKPGLYA AGKGSARFKG FKYRALA
|
| |