Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1832 |
Symbol | |
ID | 5899287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1939481 |
End bp | 1941907 |
Gene Length | 2427 bp |
Protein Length | 808 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562322 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001683459 |
Protein GI | 167645796 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0261035 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCC AGACCGCCGT CACCCGTCGT ACCCTGGGCG TTAGCCTGGC CGCCCTGGCC GCCGCGTCCT CGACCCTGGC CCGCGCCGCC GCGCCGCCCA AGGCTGGAAA GGTCGAAAGG GCGCTCTACA AGGATCCGAC CCAGTCGATC GAACTGCGGG TGCGCGACCT GCTCTCGCGC ATGACGCTGG AAGAGAAGGC CGCCCAGCTG GTCGGCATCT GGCTCACCAA GGCCAAGATC CAGACCCCGA ACGGCGACTT TTCGCCCGAA GAGGCCAGCA AGAACTTCCC CGACGGCCTG GGCCAGATCT CTCGCCCAAC CGACCGCCGC GGCCTGAAGC CCGCCACGGT CGTGGGCGCC GCCGCCGGCG CCGAGGACGG CTCGATCGGC CGCAACGCCA AGGAGACCGC CCGCTACATC AACGCCGCCC AGAAGTGGGC GATGGAGAAG ACGCGCCTGG GCGTTCCGAT GCTGATGCAC GACGAGGCCC TGCACGGCTA TGTGGCCCGC GACGCCACCA GCTTCCCTCA GGCCATCGCC CTGGCCTCGA CCTTCGATAC CGAGATGACC GAGAAGGTCT TCGCGGTCGC CGCCCGCGAG ATGCGCGCCC GGGGCTCGAA CATCGCCCTG GCCCCGGTGG TCGACGTGGC CCGCGACCCG CGCTGGGGCC GCATCGAGGA GACCTACGGC GAGGACCCGC ACCTGTGCGC CGAGATCGGC CTGGCGGCGA TTCGCGGCTT CCAGGGCAAG ACTCTGCCGC TGGCGCCCGA CAAGGTGTTC GTCACCCTCA AGCACATGAC CGGCCACGGC CAGCCCGAGA ACGGCACCAA TGTCGGCCCG GCCCAGATCG CCGAGCGCAC CCTGCGCGAA AACTTCTTCC CGCCGTTCGA ACGCGCGGTG AAGGAGCTGC CCGTTCGTTC CGTCATGCCC TCGTACAACG AGATCGACGG CGTCCCGTCG CACGCCAACC GCTGGCTGCT GACCGACATC CTGCGCAAGG AGTGGGGCTA CAAGGGTTCG GTGCAGAGCG ACTATTTCGC GATCAAGGAA TTGATGGGCC GTCACAAGCT GACCGACGAC CTGGGCGAGA CGGCCGTCAT GGCCATGAAC GCCGGCGTCG ATGTCGAGCT GCCGGACGGT GAGGCCTACG CCCTGCTGCC CCAACTGGTG AAGGTCGGAC GCATCCCCCA GGCCGCCGTT GACCAGGCCG TCGAGCGCGT CCTGACGATG AAGTTCGAGG GCGGCCTGTT CGAAAACCCC TATGCCGACG AGAAGACGGC CGACGCCAAG ACCGCGACGC CGGACGCCAT CGCCCTGGCC CGCGAGGCGG CCCGCAAGGC CGTGGTGCTG CTGAAGAACG ACAAGGGCGT GCTGCCGCTC AATCCCTCGA AGTTCAAGCG CCTGGCCCTC TTGGGAACTC ACGCAAAGGA CACCCCGATC GGCGGCTACA GCGACACGCC GCGCCATGTG GTGTCGATCT ACGAGGGCCT GCAGGCCGAG GCCAAGAAGA GCGGCTTCAC GCTGGACTAC GCCGAGGCCG TGCGGATCAC GGAGGCCCGG ATCTGGGCCC AGGACGAGGT CAAGCTGGTC GATCCGGCCG TCAACGCCAA GCTGATCGCC GAGGCGGTGG AGGTGGCCAA GCAAGCCGAC GTCATCGTCA TGGTGCTGGG CGACAACGAG CAGACCAGCC GCGAGGCCTG GGCCGACAAC CACCTGGGCG ACCGCGACAG CCTGGACCTG ATCGGTCAGC AGAACGACCT GGCCAGGGCG ATCTTCGACC TGGGCAAGCC CACGGTGGTG TTTCTGCTCA ACGGCCGCCC GCTGTCGATC AACCTGCTGG CGCAGCGCGC GGACGCCGTC ATCGAGGGTT GGTACCTGGG GCAGGAAACC GGCAACGCCG CCGCCGACAT CCTGTTCGGC CGCGCCAATC CGGGCGGCAA GCTGCCGGTC AGCATCGCCC GCGATGTGGG CCAGCTGCCG ATCTACTACA ACCGCAAGCC CACGGCTCGC CGGGGTTACC TGCTGGGCGA CACCTCGCCG CTCTATCCGT TCGGTTTCGG CCTGTCGTAC ACCACGTTCG ACATCTCGGC CCCGCGTCCG GCCAAGGCCG AGATCGGCGC CAACGAGAGC GTCAAGGTCG AGATCGACGT GATCAACACC GGCAAGGTCG CCGGCGACGA GGTGGTGCAG CTCTATATCC ACGACGAGGC CGCCTCGGTG ACCCGTCCGG TGCTGGAGCT CAAGCACTTC AAGCGCGTGA CCCTGGCCCC CGGCGCCAAG CAGACCGTGA CCTTCGAGGT CTCGCCGCTG GACCTGTCGC TGTGGAACCT GGAGATGAAG CGCGTGGTCG AGCCGGGCAA GTTCACCCTG CTGTCGGGGC CCAATTCCGC GCAGTTGAAG CCGGCGACGC TGACGGTCAT GGCTTAA
|
Protein sequence | MSSQTAVTRR TLGVSLAALA AASSTLARAA APPKAGKVER ALYKDPTQSI ELRVRDLLSR MTLEEKAAQL VGIWLTKAKI QTPNGDFSPE EASKNFPDGL GQISRPTDRR GLKPATVVGA AAGAEDGSIG RNAKETARYI NAAQKWAMEK TRLGVPMLMH DEALHGYVAR DATSFPQAIA LASTFDTEMT EKVFAVAARE MRARGSNIAL APVVDVARDP RWGRIEETYG EDPHLCAEIG LAAIRGFQGK TLPLAPDKVF VTLKHMTGHG QPENGTNVGP AQIAERTLRE NFFPPFERAV KELPVRSVMP SYNEIDGVPS HANRWLLTDI LRKEWGYKGS VQSDYFAIKE LMGRHKLTDD LGETAVMAMN AGVDVELPDG EAYALLPQLV KVGRIPQAAV DQAVERVLTM KFEGGLFENP YADEKTADAK TATPDAIALA REAARKAVVL LKNDKGVLPL NPSKFKRLAL LGTHAKDTPI GGYSDTPRHV VSIYEGLQAE AKKSGFTLDY AEAVRITEAR IWAQDEVKLV DPAVNAKLIA EAVEVAKQAD VIVMVLGDNE QTSREAWADN HLGDRDSLDL IGQQNDLARA IFDLGKPTVV FLLNGRPLSI NLLAQRADAV IEGWYLGQET GNAAADILFG RANPGGKLPV SIARDVGQLP IYYNRKPTAR RGYLLGDTSP LYPFGFGLSY TTFDISAPRP AKAEIGANES VKVEIDVINT GKVAGDEVVQ LYIHDEAASV TRPVLELKHF KRVTLAPGAK QTVTFEVSPL DLSLWNLEMK RVVEPGKFTL LSGPNSAQLK PATLTVMA
|
| |