Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0152 |
Symbol | |
ID | 5897864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 169318 |
End bp | 171798 |
Gene Length | 2481 bp |
Protein Length | 826 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560636 |
Product | glycoside hydrolase family 3 protein |
Protein accession | YP_001681787 |
Protein GI | 167644124 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0748056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTT CCGCCCGCCG CCTTCTGATT TCGGCCCTTG TCGCCTCGAC CAGTCTGGCG GGCGCGACCG CGCTGGCTCA GCCGAGCCCT GCCCAGTCCG GCCAGGGGGC CGTCGCCCAT CCGGCCCTAT GGCCTAAGGC GGCCAGCCCG GCGGCGATCA CCGACGCCAA GACCGAGGCC TTCATCAGTG GCCTGATGGC CAAGATGAGC CTTGAGGAAA AGGTCGGCCA GACCATCCAG GGCGACATCG CCTCGATAAC GCCGGCCGAC CTCGAAAAGT ACCCGCTGGG CTCGATCCTG GCCGGCGGCA ACTCGGCGCC CGGCGGCGAC GACCGCGCCC CGCCCAAGGC CTGGACCGAC CTGGTCGACG CCTATCGGAA ACAGGCCCTG GCCGCCCGTC CGGGCCATAC GCCGATCCCG ATCCTGTTCG GCATCGACGC CGTGCACGGC CATAACAACA TCGTCGGCGC GACGATCTTC CCGCACAATA TCGGCCTGGG CGCGATGCGC GATCCCGCCC TGATCCGCCG TATCGGCGCG GCCACCGGCG AGGAGGTGGC GGTGGTTGGC GGCGACTGGA CCTTCGGTCC GACCGTGGCC GTGCCGCGCG ACGACCGCTG GGGCCGCAGC TACGAGGGCT ATGCCGAGGA CCCGGAGGTG GTGAAGTCCT ATTCCGGACC CATGACCCTG GGCCTGCAAG GCGAGCTGAA GCCAGGCCAG ACCCTGGCCG CCGGCCACAT CGCCGGCTCG GCCAAGCACT TCCTGGCCGA CGGCGGCGCC GACGGCGGCA AGGACCAGGG CGACGCCAGT ATCCCGGAGG CCGAGCTGGT CGCCCTCCAC GCCCAGGGCT ATCCGCCCAG CATCGACGCC GGCATCCTGA CGGTGATGGC CTCGTTCTCC AGCTGGAACG GCGAGAAGAT CACCGGCAAC AAGACCCTGC TGACCGACGT GCTCAAGGGT CGGATGGGCT TCCAGGGCTT CGTGGTCAGC GATTGGAACG CCCACGGACA GCTGGCCGGC TGCACCAATC TCAGCTGCCC GCAGGCGATG AACGCCGGGC TCGACATGTA CATGGCGCCC GACAGCTGGA AGGGCCTGTT CGACAACACC CTGGCCCAGG TGAAGTCGGG CGAGATCCCG ATGGCGCGGC TGGACGACGC CGTGCGGCGC ATCCTGCGGG TCAAGGTCAA GGCCGGGCTG TTCGAGCGCG TCGCGCCGTC GGTGCAAGGC CGGTTCGATC GGCTGGGCGC GGCCGATCAC CGGGCGATCG CCCGCGAGGC TGTGGCCAAG TCCCTGGTGC TGCTGAAGAA CGACGGCGTG TTGCCGATCA AGCCGGGCGC GCGGGTGCTG GTGGCGGGGT CGGCCGACGA TATCGGCAAG GCGGCCGGCG GCTGGACCCT GACCTGGCAG GGCACGGGCA ACAAGAACAG CGACTTCCCC AACGGTCAGT CGATCTGGGG CGGCATCGAC GAGGCGGTGA AGGCGGCTGG CGGCCAGGCC GAGCTGACTC CGGACGGCAA GTTCACCACC AAGCCCGACG TGGCGATCGT GGTGTTCGGA GAAGATCCGT ATGCGGAGTT CCAGGGCGAC GTCGCCAATC TGGGCTACCA GCTGGCCGAC AAGACCGACC TGGCCCTGCT CAAACGACTG AAGGCCCAGG GCGTCCCCGT GGTCTCGGTG TTCCTGTCCG GCCGGCCGCT GTGGACCAAT CCCGAGATCA ACGCCTCGAA CGCCTTCGTC GCCGCCTGGC TGCCGGGCAG CGAGGGGGGC GGGGTCGCGG ATGTTCTGGT GGCGGGCAAG GACGGCAAGC CGAAACGCAA CTTCCAGGGC AAGCTGGGCT TCTCCTGGCC CAAGCGCGCC GACCAAGGCC CCCTGAACCG CGGCCAGCCG GGCTACGACC CGCAGTTCGC CTACGGCTAC GGCCTGTCCT ATGCGAAGGC CGGCGCCGTC GGCGTCCTGC CCGAGGATCC GGGCCATGTG GCCGCCGCCG GCAGCGTTGA CCGCTATTTC GTGGCCGGCC GGGTTCCGGC CCCCTGGGCG ATGGACTTCG TGGGGGCGGG CGCGCTGAAG GCGGTCGACG CCGGGGCCCA GGAGAACGCC CGCCAGGCGG CCTGGACCGG CCAAGGCAGG TTGGCGATCC ACGGCCCGCC GGTCGACCTG TCGCGCCAGA CCACCGGCGA CATGGCGGTG ATGCTCCGCT ATCGGATCGA CGCCGCCCCG ACCCAGCCCG TGACCATGAG CATCGGCTGC GGCGACGACG CCGCCTGCGG CGGGACGGTC GATGTCACAC CGCTGATGGT CGCAACGGCG GGAAGCCAAT GGCGCAGCGT CAAGATCAAG CTGTCCTGCT TCCAGGCGGC GGGCGCGAAG ATGGACCGCG TCACCGCGCC CTTCGTGGTC AGCACCGCCG GACCCTTCGT CCTGTCGGTC ACTGAAGTGC GCCTGGCTTC CAATGAAGGC GACGCGATCT GCCCCAAGTA G
|
Protein sequence | MTVSARRLLI SALVASTSLA GATALAQPSP AQSGQGAVAH PALWPKAASP AAITDAKTEA FISGLMAKMS LEEKVGQTIQ GDIASITPAD LEKYPLGSIL AGGNSAPGGD DRAPPKAWTD LVDAYRKQAL AARPGHTPIP ILFGIDAVHG HNNIVGATIF PHNIGLGAMR DPALIRRIGA ATGEEVAVVG GDWTFGPTVA VPRDDRWGRS YEGYAEDPEV VKSYSGPMTL GLQGELKPGQ TLAAGHIAGS AKHFLADGGA DGGKDQGDAS IPEAELVALH AQGYPPSIDA GILTVMASFS SWNGEKITGN KTLLTDVLKG RMGFQGFVVS DWNAHGQLAG CTNLSCPQAM NAGLDMYMAP DSWKGLFDNT LAQVKSGEIP MARLDDAVRR ILRVKVKAGL FERVAPSVQG RFDRLGAADH RAIAREAVAK SLVLLKNDGV LPIKPGARVL VAGSADDIGK AAGGWTLTWQ GTGNKNSDFP NGQSIWGGID EAVKAAGGQA ELTPDGKFTT KPDVAIVVFG EDPYAEFQGD VANLGYQLAD KTDLALLKRL KAQGVPVVSV FLSGRPLWTN PEINASNAFV AAWLPGSEGG GVADVLVAGK DGKPKRNFQG KLGFSWPKRA DQGPLNRGQP GYDPQFAYGY GLSYAKAGAV GVLPEDPGHV AAAGSVDRYF VAGRVPAPWA MDFVGAGALK AVDAGAQENA RQAAWTGQGR LAIHGPPVDL SRQTTGDMAV MLRYRIDAAP TQPVTMSIGC GDDAACGGTV DVTPLMVATA GSQWRSVKIK LSCFQAAGAK MDRVTAPFVV STAGPFVLSV TEVRLASNEG DAICPK
|
| |