Gene Caul_3288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3288 
Symbol 
ID5900743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3556015 
End bp3558207 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content68% 
IMG OID641563794 
ProductBeta-glucosidase 
Protein accessionYP_001684913 
Protein GI167647250 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.362005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.984755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGAT CCCTGACGAC CGCCGCCAGC GTCGCCGTCC TCGTGTTCGC CACGGCTGCT 
TCGGCACAGG CCGTCAAGCC TTGGATGAAC ACCAAGCTGT CGCCCGATGA ACGCGCCCGC
CTGCTCGACG CCGAGTTGAC CCTCGACGAG CGCATAGGCC TGGTGCATGG CCCGATGGCC
ATGCCCTACA AGATCAAGAC GATCCCCAAG GGGGCGATCG GCTCCGCGGG CTATATCGCC
GGGGTGGAGC GCCTGGGCAT TCCCGCCCTT CAGGAGAGCG ACGCCAGCCT GGGTGTCACC
AACCCTATGG GCATGCGGCT CGGCGATGGC GCCACCGCCC TGCCGTCGGG CTTGGCCCTG
GCCGCGACTT TCAACGCCAA GCTGGCCTAT GACGGCGGAG CGCTGGTCGG GCGCGAGGCG
CGGGCCAAGG GCTTCAATGT CCAACTGGCC GGCGGTGTTA ACCTGGCCCG CGACCCCCGC
AATGGCCGCA ATTTCGAATA TCTGGGCGAG GATCCGCTGC TGGCCGGAAC CCTCGCTGGG
GAGTCGATCC GGGGCATCGA GGCCCAGAAC GTGGTGTCGA CGGTCAAACA CTTCGCGTTG
AACGGCCAGG AGACCAACCG CCACTGGGCC AATTCGGTGA TCGAGGAGGG CGCGCACCGA
GAGAGCGACC TGCTGGCCTT CCAGATCGCC ATCGAGAAGG GCCAGCCCGG CTCGGTGATG
TGCGCCTACA ACCTCGTCAA CGGCGACTAT AGCTGCGGCA ACGACCATCT GCTGAACGGC
GTCCTGAAGG GCGACTGGGC CTACAAGGGT TGGGTGATGT CGGACTGGGG CGCGGTCCAC
GCCATGGACT ACGCCGTCAA GGGGCTGGAT CAGCAGTCGG GCGAACAACT GGACGACCAA
GTCTGGTTCG GCGCGCCGCT TAAGGCCGCC GTGGAGAAAG GCGAGGTCCC GGCCGCGCGC
CTGTCGGGCA TGACTCGCCG CATCCTGCGC TCGATGTTCG CCGCCGGCCT GTTCGACGTT
CCGGCCGCCA ACAGCGAGAT CGACTATGCC GCCAACGCGG AGGTCGCGCG CAAGGTCGCG
CAGGAAGGCA TCGTCCTGCT GAAGAACCGC AACGGTCTGC TGCCCCTGGC CAAGAGCGCC
AGGACCATCG CCGTGATCGG CGGCCATGCC GACGCGGGCG TGCTGTCGGG CGGCGGGTCG
TCCCAGGTGA TCGCGCCCGG CGGCGCCAAG GTGTCCATAC CGCTGGGCGG GGAGGGGCAG
ATGGCCGCCT TCCGCAACCA GATCTTCCAC CCGTCCGCGC CCCTGGCCGC TATCCGCGCC
GAAGCGCCGG GGGCCAAGGT CACGTTCGAT GACGGCCGCT ACATCGCCTC CGCCGTGGCC
GCCGCTAGGG CTGCCGACGT GGTCATCGTA TTCGGCAACC AGTGGATGGG CGAGGGCGAG
GACGCGCCGG ATCTGTCGCT CCCCCAAGGC CAGGACGCCG TGATCGCGGC GGTGATCGCC
GCCAATCCCA ACAGCATCGT GGTGCTGCAG ACCGGCGGTC CGGTGACGAT GCCCTGGCTG
GATCAGGCCG GCGCCGTGGT CGAAGCCTGG TACTCGGGCG CCAAGGGCGG CGAGGCCATC
GCCGACGTGC TGTTCGGCGA GGTCGACGCC TCGGGCCGCC TGCCAGTGAC CTTCCCGGTG
TCGATCGAGC AGTACCCGCG CGTCGCGATG CCGGGGCTTG GCCTGCCCGA GAAGACTCAG
TTCGACGTGG TCTATGACGA GGGGGCGGAC GTCGGCTATC GCCGCTTCGC GGCCACCGGC
CAGAAGCCGC TGTTCCCGTT CGGCCATGGC CTGTCCTATA CGACGTTCAG CTACGCCAAC
CTGAAGGTGA CGGGCGGCGA TGCGCTGACG GTCGGCTTCG ATGTGACCAA TATCGGTCGC
CGACCCGGCA AGGACGCGCC GCAGGTCTAT CTGACCGGCG CAGCCGGTAA GCCGCTACAG
CGGCTGATCG GCTTCGAGAA GCTCGAGCTG AAGCCGGGCG AGACCCGCCG CGTCAGCATC
ACCGCCGATC CGCGCCTGCT GGGCGGCTTC GACGTCGCTA GCAAGCGATG GTCGCTGAAG
GCTGGAGACT ATCAGGTCGC TGTTGGCGCC TCGTCCGCCA ATCTGATGCT GAACGGCGTC
GCGAAGATTA GGGCCCGGAG CATCAAGCCG TAA
 
Protein sequence
MHRSLTTAAS VAVLVFATAA SAQAVKPWMN TKLSPDERAR LLDAELTLDE RIGLVHGPMA 
MPYKIKTIPK GAIGSAGYIA GVERLGIPAL QESDASLGVT NPMGMRLGDG ATALPSGLAL
AATFNAKLAY DGGALVGREA RAKGFNVQLA GGVNLARDPR NGRNFEYLGE DPLLAGTLAG
ESIRGIEAQN VVSTVKHFAL NGQETNRHWA NSVIEEGAHR ESDLLAFQIA IEKGQPGSVM
CAYNLVNGDY SCGNDHLLNG VLKGDWAYKG WVMSDWGAVH AMDYAVKGLD QQSGEQLDDQ
VWFGAPLKAA VEKGEVPAAR LSGMTRRILR SMFAAGLFDV PAANSEIDYA ANAEVARKVA
QEGIVLLKNR NGLLPLAKSA RTIAVIGGHA DAGVLSGGGS SQVIAPGGAK VSIPLGGEGQ
MAAFRNQIFH PSAPLAAIRA EAPGAKVTFD DGRYIASAVA AARAADVVIV FGNQWMGEGE
DAPDLSLPQG QDAVIAAVIA ANPNSIVVLQ TGGPVTMPWL DQAGAVVEAW YSGAKGGEAI
ADVLFGEVDA SGRLPVTFPV SIEQYPRVAM PGLGLPEKTQ FDVVYDEGAD VGYRRFAATG
QKPLFPFGHG LSYTTFSYAN LKVTGGDALT VGFDVTNIGR RPGKDAPQVY LTGAAGKPLQ
RLIGFEKLEL KPGETRRVSI TADPRLLGGF DVASKRWSLK AGDYQVAVGA SSANLMLNGV
AKIRARSIKP