Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1303 |
Symbol | |
ID | 5898758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1376672 |
End bp | 1379152 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641561788 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001682931 |
Protein GI | 167645268 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.75334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.19336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGGGAG TTATGAACAC CAAGGCCTTC TGCGCCGCGT TGCTGGCGAC CACCCTCCTG TCGACCCCTT TCTTGAATGG CGCGGCGCTG GCCGCCGACA CCAAGAGCAC GGCCCATCCC GCCCTGTGGC CCGCGGCCAA GAGCCAGGGC GTGGTCGATT CCCAGACCGA AGCCTTCGTC GATTCCCTGC TGGCCAAGCT GACCCTCGAG GAAAAGGTCG GCCAGATGAT CCAGGGCGAC ATCGGCTCGG TGAAGCCCGA AGACCTGAAG ACCTACCCCT TAGGCTCGAT CCTGGCCGGC GGCAGCTCGC CGCCGCTGGG CGCGCCCGAC CGCTCGCCGA TCGGCCCGTG GGTCAAGTCG GTCGAGGCGT TCCGCGCCGC GGCCGCGCAA CGCCAGGGCG GCACGCGGAT TCCGCTGATG TTCGGCATCG ACTCCGTGCA CGGCCACGGC AACGCCGTGG GCGCGACGCT CTTCCCGCAC AACATCGGGC TGGGCGCGGC GCGCGACCCC GAACTGATCC GCAAGATCGG CGCGGCCACC GCCCAGGAAA CCGCCGCCAG CGGCTTCGAC TGGGCGTTCG GTCCCACCCT GACCGTGCCG CGCGACGACC GCTGGGGTCG GACCTACGAG GGCTATTCGG AAGACCCCGA GATCGTCCGG TCCTACGCCG GCCAGATGAT CCTGGGGCTG CAGGGCGCCG TCAGTCAGGG CGGCGTCATC CAGCAGGGCC ACGTGGCGGC CAGCGCCAAG CATTTCCTGG GCGACGGCGG CACCCATGAC GGCAAGGACC AGGGTGACAC CCAGGTCTCG GAAGCCGACC TGATCCGCCT GCACGCCCAG GGCTATGTTC CGGCCGTCAA CGCCGGGACC CTGACCATCA TGGCGTCGTT CAACAGCTGG AACGGCGAGA AGATGCACGG CAACAAGAGC CTGCTGACCG ACGTGCTGAA GGGCAAGATG GGCTTCGACG GCTTCATCGT CGGCGACTGG AACGGCCACG GCCAGGTGGC CGGCTGTACG CCCACCAACT GCGCCCAGGC CGCCAATGCG GGCCTGGACA TGTACATGGC CCCCGACAGC TGGAAAGAGC TCTACGCCAA CACCCTGGCC CAGGCCAAGT CGGGCGAGAT CCCGATGGCC CGCATCGACG ACGCCGTGCG CCGCATCCTG CGCGTGAAGG CCAAACTCGG CCTGTTCCAG CAGGCGCGGC CGCTTGAGGG CAAGGAAGCG GTCATGGCCT CGGCCGACCA CCGCGCCATC GCCCGCCAGG CGGTGCGCGA GTCGCTGGTG CTGCTGAAGA ACAACGGCGT GCTGCCGGTC AAGGCCTCGG CCAACATCCT GGTCGCCGGC TCGGGCGCCG ATGACATCGG CCAGCAGGCC GGCGGCTGGA CCCTGTCGTG GCAGGGCACC GGCAACACCA AGGCCGACTT CCCCAACGCC CAGTCGATCT ATTCGGGCCT GAAGGAGACG GTCGAGGCTT CCGGCGGGAC GGCGACGCTC AGCGTTGACG GAGCGTTCGA CAAGAAGCCC GACGTCGCCA TAGTGGTGTT CGGCGAGACG CCCTACGCCG AGGGCGTGGG CGACATCAGG ACGCTGGAAT TCCAGCCGGG GACCAAGACC GACCTCGCCC TGCTCAAGAC ACTGAAGGCG GCTGGCGTGC CCGTGGTGTC GGTGTTCCTC AGCGGCCGGC CGCTGTGGGT CAATCCGGAG ATCAACGCCT CGGACGCCTT CGTCGCGGCC TGGCTGCCGG GTTCGGAAGG CGGCGGGATC GCCGACGTGC TGATCGGCGA CAAGGCGGGA AAGCCGCGCC ACGACTTCCG AGGCAAGCTG TCGTTCAGCT GGCCCAAGAC CGCCGGCCAG TTCACGCTGA ACCGCGGCGA CAAGCGCTAC GACCCGCAGT TCGCCTATGG CCACGGCCTG ACCTACGCCT CCAAGGTCCG TGTGGGGACG CTGTCGGAGA AGCCTGGCCT CACCGTGGCG GCCGAGAACG TCAGCAACTA CTTCGTGGCC GGCAAGACGC CCGCGCCCTA TGAGTTCAGG CTGACCCCGA CCACGGCCGT GCAGGTCCGT CCGGTGGACG CCGGCAACGT GCAGGAGGCC GGTCGCCAGA TCACCTTCTC GGGCGACGTC CCGGCGACGG CCGCGATCTC GGGCGACCAG GCCGACCTGA CGTTCCAGAC CAATGCCGAG ATGAGCCTGC TGATCGACTA TCGCCTCGAC GCCAAGCCGA CCGGTCCGGT GACCCTGGCG ATCGGCAGGG GCAAGGTGGA CGTGACGCCG GTGCTCAACG CTTCGCCGGT CGGCGAGTGG AAGAGCTTGA AGGTCCCGCT CAAGTGCTTC CAGGCGGCGG GAACCGACGT GACCAAGGTC ACCGCGCCGT TCGAACTGTC GACCGCCGGC AAGCTGACCG TGTCGCTGCA AGGGGCGAAG CTGAGCACCG ACCCGGCCGG GGCGACGTGC CCAAGCAAGG CGGCGAACTA A
|
Protein sequence | MWGVMNTKAF CAALLATTLL STPFLNGAAL AADTKSTAHP ALWPAAKSQG VVDSQTEAFV DSLLAKLTLE EKVGQMIQGD IGSVKPEDLK TYPLGSILAG GSSPPLGAPD RSPIGPWVKS VEAFRAAAAQ RQGGTRIPLM FGIDSVHGHG NAVGATLFPH NIGLGAARDP ELIRKIGAAT AQETAASGFD WAFGPTLTVP RDDRWGRTYE GYSEDPEIVR SYAGQMILGL QGAVSQGGVI QQGHVAASAK HFLGDGGTHD GKDQGDTQVS EADLIRLHAQ GYVPAVNAGT LTIMASFNSW NGEKMHGNKS LLTDVLKGKM GFDGFIVGDW NGHGQVAGCT PTNCAQAANA GLDMYMAPDS WKELYANTLA QAKSGEIPMA RIDDAVRRIL RVKAKLGLFQ QARPLEGKEA VMASADHRAI ARQAVRESLV LLKNNGVLPV KASANILVAG SGADDIGQQA GGWTLSWQGT GNTKADFPNA QSIYSGLKET VEASGGTATL SVDGAFDKKP DVAIVVFGET PYAEGVGDIR TLEFQPGTKT DLALLKTLKA AGVPVVSVFL SGRPLWVNPE INASDAFVAA WLPGSEGGGI ADVLIGDKAG KPRHDFRGKL SFSWPKTAGQ FTLNRGDKRY DPQFAYGHGL TYASKVRVGT LSEKPGLTVA AENVSNYFVA GKTPAPYEFR LTPTTAVQVR PVDAGNVQEA GRQITFSGDV PATAAISGDQ ADLTFQTNAE MSLLIDYRLD AKPTGPVTLA IGRGKVDVTP VLNASPVGEW KSLKVPLKCF QAAGTDVTKV TAPFELSTAG KLTVSLQGAK LSTDPAGATC PSKAAN
|
| |