Gene Caul_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0152 
Symbol 
ID5897864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp169318 
End bp171798 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content70% 
IMG OID641560636 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001681787 
Protein GI167644124 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0748056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTT CCGCCCGCCG CCTTCTGATT TCGGCCCTTG TCGCCTCGAC CAGTCTGGCG 
GGCGCGACCG CGCTGGCTCA GCCGAGCCCT GCCCAGTCCG GCCAGGGGGC CGTCGCCCAT
CCGGCCCTAT GGCCTAAGGC GGCCAGCCCG GCGGCGATCA CCGACGCCAA GACCGAGGCC
TTCATCAGTG GCCTGATGGC CAAGATGAGC CTTGAGGAAA AGGTCGGCCA GACCATCCAG
GGCGACATCG CCTCGATAAC GCCGGCCGAC CTCGAAAAGT ACCCGCTGGG CTCGATCCTG
GCCGGCGGCA ACTCGGCGCC CGGCGGCGAC GACCGCGCCC CGCCCAAGGC CTGGACCGAC
CTGGTCGACG CCTATCGGAA ACAGGCCCTG GCCGCCCGTC CGGGCCATAC GCCGATCCCG
ATCCTGTTCG GCATCGACGC CGTGCACGGC CATAACAACA TCGTCGGCGC GACGATCTTC
CCGCACAATA TCGGCCTGGG CGCGATGCGC GATCCCGCCC TGATCCGCCG TATCGGCGCG
GCCACCGGCG AGGAGGTGGC GGTGGTTGGC GGCGACTGGA CCTTCGGTCC GACCGTGGCC
GTGCCGCGCG ACGACCGCTG GGGCCGCAGC TACGAGGGCT ATGCCGAGGA CCCGGAGGTG
GTGAAGTCCT ATTCCGGACC CATGACCCTG GGCCTGCAAG GCGAGCTGAA GCCAGGCCAG
ACCCTGGCCG CCGGCCACAT CGCCGGCTCG GCCAAGCACT TCCTGGCCGA CGGCGGCGCC
GACGGCGGCA AGGACCAGGG CGACGCCAGT ATCCCGGAGG CCGAGCTGGT CGCCCTCCAC
GCCCAGGGCT ATCCGCCCAG CATCGACGCC GGCATCCTGA CGGTGATGGC CTCGTTCTCC
AGCTGGAACG GCGAGAAGAT CACCGGCAAC AAGACCCTGC TGACCGACGT GCTCAAGGGT
CGGATGGGCT TCCAGGGCTT CGTGGTCAGC GATTGGAACG CCCACGGACA GCTGGCCGGC
TGCACCAATC TCAGCTGCCC GCAGGCGATG AACGCCGGGC TCGACATGTA CATGGCGCCC
GACAGCTGGA AGGGCCTGTT CGACAACACC CTGGCCCAGG TGAAGTCGGG CGAGATCCCG
ATGGCGCGGC TGGACGACGC CGTGCGGCGC ATCCTGCGGG TCAAGGTCAA GGCCGGGCTG
TTCGAGCGCG TCGCGCCGTC GGTGCAAGGC CGGTTCGATC GGCTGGGCGC GGCCGATCAC
CGGGCGATCG CCCGCGAGGC TGTGGCCAAG TCCCTGGTGC TGCTGAAGAA CGACGGCGTG
TTGCCGATCA AGCCGGGCGC GCGGGTGCTG GTGGCGGGGT CGGCCGACGA TATCGGCAAG
GCGGCCGGCG GCTGGACCCT GACCTGGCAG GGCACGGGCA ACAAGAACAG CGACTTCCCC
AACGGTCAGT CGATCTGGGG CGGCATCGAC GAGGCGGTGA AGGCGGCTGG CGGCCAGGCC
GAGCTGACTC CGGACGGCAA GTTCACCACC AAGCCCGACG TGGCGATCGT GGTGTTCGGA
GAAGATCCGT ATGCGGAGTT CCAGGGCGAC GTCGCCAATC TGGGCTACCA GCTGGCCGAC
AAGACCGACC TGGCCCTGCT CAAACGACTG AAGGCCCAGG GCGTCCCCGT GGTCTCGGTG
TTCCTGTCCG GCCGGCCGCT GTGGACCAAT CCCGAGATCA ACGCCTCGAA CGCCTTCGTC
GCCGCCTGGC TGCCGGGCAG CGAGGGGGGC GGGGTCGCGG ATGTTCTGGT GGCGGGCAAG
GACGGCAAGC CGAAACGCAA CTTCCAGGGC AAGCTGGGCT TCTCCTGGCC CAAGCGCGCC
GACCAAGGCC CCCTGAACCG CGGCCAGCCG GGCTACGACC CGCAGTTCGC CTACGGCTAC
GGCCTGTCCT ATGCGAAGGC CGGCGCCGTC GGCGTCCTGC CCGAGGATCC GGGCCATGTG
GCCGCCGCCG GCAGCGTTGA CCGCTATTTC GTGGCCGGCC GGGTTCCGGC CCCCTGGGCG
ATGGACTTCG TGGGGGCGGG CGCGCTGAAG GCGGTCGACG CCGGGGCCCA GGAGAACGCC
CGCCAGGCGG CCTGGACCGG CCAAGGCAGG TTGGCGATCC ACGGCCCGCC GGTCGACCTG
TCGCGCCAGA CCACCGGCGA CATGGCGGTG ATGCTCCGCT ATCGGATCGA CGCCGCCCCG
ACCCAGCCCG TGACCATGAG CATCGGCTGC GGCGACGACG CCGCCTGCGG CGGGACGGTC
GATGTCACAC CGCTGATGGT CGCAACGGCG GGAAGCCAAT GGCGCAGCGT CAAGATCAAG
CTGTCCTGCT TCCAGGCGGC GGGCGCGAAG ATGGACCGCG TCACCGCGCC CTTCGTGGTC
AGCACCGCCG GACCCTTCGT CCTGTCGGTC ACTGAAGTGC GCCTGGCTTC CAATGAAGGC
GACGCGATCT GCCCCAAGTA G
 
Protein sequence
MTVSARRLLI SALVASTSLA GATALAQPSP AQSGQGAVAH PALWPKAASP AAITDAKTEA 
FISGLMAKMS LEEKVGQTIQ GDIASITPAD LEKYPLGSIL AGGNSAPGGD DRAPPKAWTD
LVDAYRKQAL AARPGHTPIP ILFGIDAVHG HNNIVGATIF PHNIGLGAMR DPALIRRIGA
ATGEEVAVVG GDWTFGPTVA VPRDDRWGRS YEGYAEDPEV VKSYSGPMTL GLQGELKPGQ
TLAAGHIAGS AKHFLADGGA DGGKDQGDAS IPEAELVALH AQGYPPSIDA GILTVMASFS
SWNGEKITGN KTLLTDVLKG RMGFQGFVVS DWNAHGQLAG CTNLSCPQAM NAGLDMYMAP
DSWKGLFDNT LAQVKSGEIP MARLDDAVRR ILRVKVKAGL FERVAPSVQG RFDRLGAADH
RAIAREAVAK SLVLLKNDGV LPIKPGARVL VAGSADDIGK AAGGWTLTWQ GTGNKNSDFP
NGQSIWGGID EAVKAAGGQA ELTPDGKFTT KPDVAIVVFG EDPYAEFQGD VANLGYQLAD
KTDLALLKRL KAQGVPVVSV FLSGRPLWTN PEINASNAFV AAWLPGSEGG GVADVLVAGK
DGKPKRNFQG KLGFSWPKRA DQGPLNRGQP GYDPQFAYGY GLSYAKAGAV GVLPEDPGHV
AAAGSVDRYF VAGRVPAPWA MDFVGAGALK AVDAGAQENA RQAAWTGQGR LAIHGPPVDL
SRQTTGDMAV MLRYRIDAAP TQPVTMSIGC GDDAACGGTV DVTPLMVATA GSQWRSVKIK
LSCFQAAGAK MDRVTAPFVV STAGPFVLSV TEVRLASNEG DAICPK