Gene Caul_2404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2404 
Symbol 
ID5899859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2618000 
End bp2620696 
Gene Length2697 bp 
Protein Length898 aa 
Translation table11 
GC content68% 
IMG OID641562895 
ProductBeta-glucosidase 
Protein accessionYP_001684029 
Protein GI167646366 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.27509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGGT CCAATTGGCG ACGATGCGGC CTGGCCACGG CGCTGTGCGC GGCGTTGACG 
ATGGGCGCGC CGCCCGTACT CGCCGCTCCC GCTTCGCCGC CGCCTTCGGC CGACCGGGCC
AAGACCGATC GCCTGGCCGC GGAACTGGTG GGCAAGATGA CGCTCGACGA GAAGCTCGAG
CAGTTGCTCA ACACCGCGCC GGCGATCCCA CGGCTGGGCG TCCCGGCCTA TAATTGGTGG
ACGGAATCGC TGCACGGCGC CCTGGGCGCC CTGCCCACCA CCAATTTTCC CGAGCCCATT
GGCCTGGCCG CCACCTTCGA TGCCCCGCTC GTTCACGACG TCGCCGGCGC GATCGGCGCC
GAGATGCAGG ACCTGCACGC CCTGGCGCGC GCGACCGGTC GCATGGGCCG CATCGGGACA
GGCCTGAACA CCTGGTCGCC CAACATCAAC ATCTTCCGCG ATCCCCGGTG GGGCCGCGGC
CAGGAGACCT ATGGCGAAGA CCCGTTCCTG ACGGCGCGCA TGGGCGTGGC CTTCGTGGAG
GGGATCCAAG GCGATGACCC CGATCATCCG CGCATCATCG CCACCCCCAA GCATTTCGCG
GTTCACAGCG GGCCCGAGTC CACGCGCCAC GGCGCCAATG TCTTCGTCTC GCGCCGCGAT
CTGGAGGACA CCTACCTTCC TGCGTTCCGC GCCGCCGTCG TCGAAGGCCG AGCTGGTTCG
ATCATGTGCG CCTATAATCG CATTGATGGT CAGCCGGCCT GCGCGAGCGA TCTCCTGCTC
AAGGAGCACC TGCGCGGCGC GTGGAAGTTC GACGGTTACG TCGTGTCCGA CTGCGACGCG
GTCAAGGATA TCAGCGACCA TCACAAGTAT GCGCCCGAAG CGGCCAGCGC CGTGGCCGCG
GCGCTGCGAG CCGGCGTCGA TAACGAGTGC AACGGCGCGA CCCTGACCGA TACCGACGGC
TTGGCCGGCC GTTACCGCGA GGCGCTCGAT CGCGGCTTGA TCTCGACCGC GCAGATCGAC
ACGGCCTTGG TGCGCCTGTT CTCGGCCCGG TTCCGCAATG GCGACCTGCC GGCCAAGGGC
GGGTCCGACG GGCGCCTGGC GGGTCCGAGC GTGGTGACCA CGCACGAGCA CGAGGCGCTA
GCCCTGGCCG CCAGCGAGAA AAGCCTTGTC CTGCTGAAGA ACGATGGCGT GCTGCCGCTC
AAGCCTGGTC TGCGCATCGC GGTGATCGGC CCTCTGGGCG ACGCCACCCG CGTGCTGCGC
GGCAACTACT CCTCGGCCCT TTCGGCGCCG CCGATCTCGG TGGTCGATGG CCTGAGACGC
GCCCTGCCCG CGGCTCAGGT CACCTACGCG CCGTTCGGCG CCTCGTTCAC CGATGGTGAT
CGCGTGCCGA CCGCCGCCTT GCGCACCCCC GATGGAAAGG CGGGTCTCTT GGCGCGGTAT
TTCAACACCG TCGAGCCCCC GCCCGCCCGG TTCGCCCCAG GCGCGTTCGC CGAGGCCGTC
GCCAAGATGA CCTATGCCGA CAAGCCGGTG GTCACGCGCA TCGAGGCGGA CGTCGCGGCC
CGAAGCCTGG ACCTGGCGAG CGTCTCCGAC CACCACCGGG TGGTTTGGAC GGGCTTTCTG
GTCCCGCCGG AGACCGGGAC CTATCGCCTC GGCCTGGCGG GCTTCAACGG CGAGCTGAAG
TTCGACGGCA AGCCGTTCGC CGATCTTCGC AAGGCCGGCT GGGGCAGCCT GCCGACCCTG
AAGACCCTGC GCCTGGAAAA GGGCCGCCGT TATCCGATCG AGATCGTTTC GGAGTCGCAC
GTCCTGTCCG GCGTGAGCCT GGTCTGGAAG CGCATCGCCG CCGACCCCAC GGCCGAGCTC
AAGGCCGCCG CGGCCAGGGC CGACGTGTTG GTGGCGGTGG TGGGCCTAAC CTCCGATATG
GAGGCCGAAG AAGCGCCCAT CGAAATCCCA GGCTTCAAGG GCGGCGACAA GACCACGCTC
GACCTGCCCG CCGACCAGCG GGCCATGCTC GAACAGGCCA AGGCCTTGGG CAAGCCGCTG
ATCGTCGTGG CGATGAACGG CAGCCCCCTG AATTTCGCCT GGGCCAAGGA CAACGCTTCG
GCCCTCCTGG AGGCCTGGTA CCCTGGCCAG TCGGGCGGCT TGGCGATCGC CAACGTCCTG
ACCGGCAAGA CCAATCCCGC CGGTCGCTTG CCGTTGACCT TCTATCGTTC AGTCGACGAT
TTGCCGCCGT TCGACGACTA CGCCATGGCG GGCCGGACCT ATCGTTATTT CGAGGGAACG
CCGGTCTATC CGTTCGGCTA CGGCCTGAGC TACACCCGCT TCGACTACGG CCCGCTCAAG
ATCGAACCGG CGACCAAGGG AGCCGGCCAA GGCCTGCGCG TGACCACGAC GATCAAGAAC
GTCGGAACCC GTCCCGGCGA AGAGGTCGCC CAGCTCTATC TGGACTTTCC CAAGACGCCC
GGCGCGCCCC GTCTGGCCCT GCGCGGCTTC CAGCGGATCG CGCTGAAGCC GGGCGAGACC
CGCGACATCA CCTTCGCGCT CTCGCCTCGC GACCTCAGTT CCGTCGATCT GGATGGCGAG
CACCGGGTGA GCGCCGGGCT TTATCGCGTC AGCGTCGGTT CGGGCCAACC CGACACGGGC
GTCGCGGGCC GCTCGGCGGA CTTCGCCGTC GATGCGGCCG TGTCGGTGGC CAAGTAA
 
Protein sequence
MFRSNWRRCG LATALCAALT MGAPPVLAAP ASPPPSADRA KTDRLAAELV GKMTLDEKLE 
QLLNTAPAIP RLGVPAYNWW TESLHGALGA LPTTNFPEPI GLAATFDAPL VHDVAGAIGA
EMQDLHALAR ATGRMGRIGT GLNTWSPNIN IFRDPRWGRG QETYGEDPFL TARMGVAFVE
GIQGDDPDHP RIIATPKHFA VHSGPESTRH GANVFVSRRD LEDTYLPAFR AAVVEGRAGS
IMCAYNRIDG QPACASDLLL KEHLRGAWKF DGYVVSDCDA VKDISDHHKY APEAASAVAA
ALRAGVDNEC NGATLTDTDG LAGRYREALD RGLISTAQID TALVRLFSAR FRNGDLPAKG
GSDGRLAGPS VVTTHEHEAL ALAASEKSLV LLKNDGVLPL KPGLRIAVIG PLGDATRVLR
GNYSSALSAP PISVVDGLRR ALPAAQVTYA PFGASFTDGD RVPTAALRTP DGKAGLLARY
FNTVEPPPAR FAPGAFAEAV AKMTYADKPV VTRIEADVAA RSLDLASVSD HHRVVWTGFL
VPPETGTYRL GLAGFNGELK FDGKPFADLR KAGWGSLPTL KTLRLEKGRR YPIEIVSESH
VLSGVSLVWK RIAADPTAEL KAAAARADVL VAVVGLTSDM EAEEAPIEIP GFKGGDKTTL
DLPADQRAML EQAKALGKPL IVVAMNGSPL NFAWAKDNAS ALLEAWYPGQ SGGLAIANVL
TGKTNPAGRL PLTFYRSVDD LPPFDDYAMA GRTYRYFEGT PVYPFGYGLS YTRFDYGPLK
IEPATKGAGQ GLRVTTTIKN VGTRPGEEVA QLYLDFPKTP GAPRLALRGF QRIALKPGET
RDITFALSPR DLSSVDLDGE HRVSAGLYRV SVGSGQPDTG VAGRSADFAV DAAVSVAK