Gene Caul_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3622 
Symbol 
ID5901077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3910163 
End bp3912409 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content68% 
IMG OID641564133 
ProductBeta-glucosidase 
Protein accessionYP_001685247 
Protein GI167647584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.642494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC AGCAACTTCG GGCCCTCGCC CTGGCCCTGA TGCTGACCAC GGCGCTGCCG 
ACAGCCGCCG CTCACGCCCA GGCCGCGTCT GCGAGCACGG CGAAACCCTG GATGAACACC
AAGCTGAGCG CCGATCAGCG CGCCGAGCTG GTCGTCGCGC AAATGACCCA GGACGAGAAA
TTGGCGCTGG TGTTCGGCTT CTTCGGTTCG AACCAGAAGA CGCCGCAGTT CACCCCCTCG
CCGGAGGCCC GCATGGGCTC GGCCGGCTAC ATCCCCGGCA TTCCGCGCCT CGGCGTGCCG
CCGCTGTGGG AGACCGACGC CGGCGTCGGC GTCGCCACCC AGCGCGAAAC CAGCGACCCG
TACCGCGAGC GCACCTCCCT GCCCTCGGGC CTGGCCACCG CCGCGACCTG GAATCCCGAG
CTGGCCTACA AGGGCGGCGC GATGATCGGC TCGGAGGCTC GCGATTCGGG CTTCAACGTG
CAGCTGGCCG GCGGCGTGAA CCTGGCTCGC GAGCCGCGCA ACGGCCGCAA CTTCGAATAT
GGCGGCGAGG ATCCGCTGCT GGCCGGCACG ATCGTCGGCG CGCAGATCCG CGGCATCCAG
TCCAACAAGA TCATCTCGAC CATCAAGCAC TGGGCGCTGA ACGGCCAGGA GACCGGCCGC
ATGACCGTCA GCGCCAATAT TGCCGACGAC GCGGCCCGCG CGTCGGACTT CCTGGCCTTC
GAACTGGCCA TCGAGCAGTC GGACCCCGGC GCGGTGATGT GCGCCTATAA CCGCATCAAC
AGCACGTACG CCTGCGAGAG CAACTACCTG CTCAACGAGG TCCTGAAGAC CGACTGGGGC
TACAAGGGGT TCGTGATGTC CGACTGGGGC GGGGTGCACT CCACCCCCAA GGCCGCCAAG
GCGGGCCTGG ACCAGGAGTC GGCCTACACC TTCGACAAGC AGCCCTTCTT CGGCGCGCCT
CTGAAGGCCG CCGTGGCCGA TGGCTCGGCG CCGCAGGCTC GCCTGGACGA CATGGCCAAG
CGCATCACCC GCTCGATGTT CGCCCACGGC CTGTTCGACC ACCCTGTCGC CATCAAGCCG
ATCGACTTCG CCGCCCACGC CAAGATCACC CAGGCCGACG CCGAAGAGGC CATCGTCCTG
CTGAAGAACG ACAAGGGCCT GCTTCCGCTG GCCAGGACCG CCAGGAAGAT CGTCGTCATC
GGCTCGCACG CCGATGTCGG CGTGCTGTCG GGCGGCGGCT CGTCGCAGGT GTTCCCGATC
GGCGGCATGG CGGTGAAGGG TCTGGGTCCC AAGGGCTTCC CCGGCCCGAT CGTCTACCAC
CCCTCCTCGC CGCTGAAGGC GCTGCAGGTC CGCAATCCCG GCGCGACCTT CGCCTATGAC
GACGGGACCG ATGCGGCCGC CGCCGCCAAG CTGGCCGCCG GCGCCGACCT CGTGATCGTC
TTCGCCCACC AGTGGGCCGC CGAGTCGCAG GACTATTCCC TGACCCTGGC CGACAATCAG
GACGCCCTGA TCGACGCCGT CGCCTCGGCC AATCCCAAGA CCGCCGTGGT TCTGGAAACG
GGCGGTCCGG TGCTGATGCC CTGGCTCGAC AAGGTCGGCG CCGTGGTCGA GGCCTGGTAT
CCAGGCACCC ATGGCGGCGA GGCCATCGCC CGGGTGCTGA CCGGCGAGGT CAACCCGTCC
GGGCGCTTGC CGATCACCTT CCCGAAAAGC GTCGACCAGT TGCCGCGTCA GACGATCGAC
TGCGACCCGG CCAAGCCCGA GGACTTCTGC GACGTCAACT ACGACATCGA GGGCGCGGCG
GTCGGCTACA AGTGGTTCGA CCAGAAGGGC CACGCCCCGC TGTTCGCCTT CGGCCACGGC
CTGTCCTACT CGACCTTCGC CTACAGCGGC CTGAAGACCG AGGTGGTCGG CGACACGCTG
AGGGTCAGCT TCACCGTCAA GAACGCCGGA AAGGCTGCTG GCAAGGACGT GCCGCAGGTC
TATGTCGGCC CGAAGGCTGG GGGTTGGGAA GCCCCGCGCC GCTTGGCCGG CTTCAAGAAG
GTCGATCTGG TTCCGGGCGC GACCACCAAG GTCAGCGTCA CCGTCGATCC GCGCCTGCTG
GCCACCTTCG ACTCCAAGGC CAAGACCTGG AACATCGCCG CCGGCGCCTA CGAGGTGTCG
CTGGGCGCCT CGTCGCGCGA CCTGACGGCG AAGTCCGATG TGGCGATGGC GGCCAAGACA
CTCCCGGTGT CGTACGACGG GAAGTAG
 
Protein sequence
MKRQQLRALA LALMLTTALP TAAAHAQAAS ASTAKPWMNT KLSADQRAEL VVAQMTQDEK 
LALVFGFFGS NQKTPQFTPS PEARMGSAGY IPGIPRLGVP PLWETDAGVG VATQRETSDP
YRERTSLPSG LATAATWNPE LAYKGGAMIG SEARDSGFNV QLAGGVNLAR EPRNGRNFEY
GGEDPLLAGT IVGAQIRGIQ SNKIISTIKH WALNGQETGR MTVSANIADD AARASDFLAF
ELAIEQSDPG AVMCAYNRIN STYACESNYL LNEVLKTDWG YKGFVMSDWG GVHSTPKAAK
AGLDQESAYT FDKQPFFGAP LKAAVADGSA PQARLDDMAK RITRSMFAHG LFDHPVAIKP
IDFAAHAKIT QADAEEAIVL LKNDKGLLPL ARTARKIVVI GSHADVGVLS GGGSSQVFPI
GGMAVKGLGP KGFPGPIVYH PSSPLKALQV RNPGATFAYD DGTDAAAAAK LAAGADLVIV
FAHQWAAESQ DYSLTLADNQ DALIDAVASA NPKTAVVLET GGPVLMPWLD KVGAVVEAWY
PGTHGGEAIA RVLTGEVNPS GRLPITFPKS VDQLPRQTID CDPAKPEDFC DVNYDIEGAA
VGYKWFDQKG HAPLFAFGHG LSYSTFAYSG LKTEVVGDTL RVSFTVKNAG KAAGKDVPQV
YVGPKAGGWE APRRLAGFKK VDLVPGATTK VSVTVDPRLL ATFDSKAKTW NIAAGAYEVS
LGASSRDLTA KSDVAMAAKT LPVSYDGK