Gene Caul_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3972 
Symbol 
ID5901434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4301584 
End bp4303875 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content71% 
IMG OID641564493 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001685595 
Protein GI167647932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.471941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.534128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACG GGCGACACGG TCCGGAGCGA CGGGGCGTGG ACCGGCGGGC GGTCCTGGCC 
GGCGCGACGG GCCTGGCGGG CTTCGCCCTG GCCGGAGCGG GCGCGGCCCG CGCGGCCTCG
CCCCGGGTGG AAGCCCTGAT CGCCCAGATG ACCCTGGAGG AGAAGGCCGG CCAGCTGTCG
TGCTATTCCG ACATGATCCG GCCGCCGGTC GGCGACATCA ATCCGCTGGT CAACCAGCGC
AACACCCAGC AGATCCTGGC CGACACCCGG GCGGGCCGCA TCGGGGTGCT GATGAACGGG
ATCGGGGTCG AGGGCGCCCT GCTGGCCCAG ACCGCCGCCG TCGAGCATTC GCGACTGCGC
ATCCCGCTGC TGTTCGCGGC CGACGTGATC CACGGCTTCA AGACCGTGTA CCCGATCCCG
CTGGGCGAAT CGGCCAGCTT CGACCCGACC CTCGCCGAGC GCACGGCGCG GGCCGCGGCG
ATCGAGGCCT CGTCGCACGG CCTGCACTGG ACCTTCGCCC CGATGGTCGA CGTGGCCCGC
GACCAGCGCT GGGGGCGGGG CGCCGAGGGC TCTGGCGAGG ACGTGTTCCT GGGCGAGGTG
ATGGCCCAGG CCCGGGTGCG CGGCTTCCAG GGCGGCGACC TGACCGCCGC CGACAGCCTG
CTGTCGACCG CCAAGCACTT CGCCGCCTAC GGGGCGGTGA CGGCGGGCCT GGACTATAAC
ACCGTCGACA TTTCCGAGGA GACCCTGCGC GAGATCCACC TGCCGCCGTT CAAGGCCGCC
TTCGACGCCG GCTGCCTGGC GGTGATGTCG GCCTTCAATG ACATCAATGG CGTGCCCGCC
ACGGCCAACA AGCACCTGCT GACCGACATC CTGCGCGGCG AGTGGAATTT CCGGGGCGTG
GTGATCTCGG ACTACACCGC CGACCAGGAA CTGGTGGCCC ACGGCTTCGC CGCCGACGAC
AAGGACGCCG CCCGCCTGGC GATCCTGGCC GGGGTCGATA TCAGCATGCA GAGTGGGCTC
TACAGCCGCT ACCTGCCCGA ACTGGTCGCC GAGGGGCTGG TCCCGATGGC CACGGTCGAC
ACCGCCGTGC GCCGGGTGCT GGGCTTGAAG GAAGCGCTGG GTCTGTTCGA CCGGCCGTTC
CGCTCGATCG ACCCCAAGGC CCAGGCCGCC AACACCGCCA CCCCGGCCAT GCGCGCCCTC
TCGCGCGAGG CGGGCGGCAA GTCGATCGTG CTGCTGCGCA ATGACGGCGG CCTGCTGCCC
CTGCCCAACG CGGGCAAGAC CATCGCCCTG ATCGGTCCGT TCGCCGAGGA CCGCGACAAC
ATCCTGGGAC CCTGGGCCTT CTTCGGCGAC AAGGCCCTGG GGGTCGACCT GGCGACCGGG
ATCCGCGAGG CGATGGCCGA CCCGTCGCGG CTGATCGTGG CCCGCGGTTG CGACGTCGAG
ACCGTCATCC CCGGCGGCTA TGACCAGGCC ATCGCCGCCG CCCGGGCCGC CGACGTGGTG
CTGCTGGCGG TCGGCGAGAG CCAGAACATG TCCGGCGAAG CCCAGTCGCG CACCGAGATC
AGCCTGCCGC GCGTCCAGCA GCAGTTGGCA GAGTGGGTGG CGTCGGTGGG CAAGCCGACG
GTGGTGCTGC TGCGCCATGG CCGCGCCCTG GTGCTGGAAG GGGCGGTCAA GGCCGCGCCG
GCCATCCTGG CCACCTGGTT CCTTGGCAGC GAGACCGGCC ACGCCGTGGC CGACGTGCTG
TTCGGCGAGG TCAATCCCTC GGCCCGCCTG CCGGTCAGCT TCCCGCACGA GAGCGGCCAG
GAACCGTTCG CCTACAACCA TCGCACCACC GGCCGTCCCG CGCCCCAGGC TGACGACAGC
CAGGAGTACA AGGCCCGCTG GCGCACCACC CGCAACGAGG CGCTCTATCC GTTCGGCCAC
GGGCTGTCGT ACACCAGCTT CGCGCTCAGC GACGTCAGGC TGTCGACCAC CCGCCTGGGC
TGGAACGAGA AGCTCCATGT CACGGTCAAT GTCGCCAACA CCGGCAAGGT CGCCGGCGAG
CACGTCTTGC AGCTCTATGT CCGCGACCGG GTGGCCAGCC GCACCCGGCC GGTGCGCGAG
CTCAAGCGCT TCCTGCGCGT GGCCCTGAAG CCCGGCGAGC GGCGCGACGT GCGCTTCAGC
CTGGAGCGCG ACTCGCTGAT GTTCGTCGGC GACGACGACC GCTGGCTCGC CGAGCCGGGC
ATGTTCGACC TGTGGGTGGC CAACAGCGCC GCCGACGGAC TGGCGGCGAG CTTTGAGCTG
TTGGGGGCTT AA
 
Protein sequence
MGDGRHGPER RGVDRRAVLA GATGLAGFAL AGAGAARAAS PRVEALIAQM TLEEKAGQLS 
CYSDMIRPPV GDINPLVNQR NTQQILADTR AGRIGVLMNG IGVEGALLAQ TAAVEHSRLR
IPLLFAADVI HGFKTVYPIP LGESASFDPT LAERTARAAA IEASSHGLHW TFAPMVDVAR
DQRWGRGAEG SGEDVFLGEV MAQARVRGFQ GGDLTAADSL LSTAKHFAAY GAVTAGLDYN
TVDISEETLR EIHLPPFKAA FDAGCLAVMS AFNDINGVPA TANKHLLTDI LRGEWNFRGV
VISDYTADQE LVAHGFAADD KDAARLAILA GVDISMQSGL YSRYLPELVA EGLVPMATVD
TAVRRVLGLK EALGLFDRPF RSIDPKAQAA NTATPAMRAL SREAGGKSIV LLRNDGGLLP
LPNAGKTIAL IGPFAEDRDN ILGPWAFFGD KALGVDLATG IREAMADPSR LIVARGCDVE
TVIPGGYDQA IAAARAADVV LLAVGESQNM SGEAQSRTEI SLPRVQQQLA EWVASVGKPT
VVLLRHGRAL VLEGAVKAAP AILATWFLGS ETGHAVADVL FGEVNPSARL PVSFPHESGQ
EPFAYNHRTT GRPAPQADDS QEYKARWRTT RNEALYPFGH GLSYTSFALS DVRLSTTRLG
WNEKLHVTVN VANTGKVAGE HVLQLYVRDR VASRTRPVRE LKRFLRVALK PGERRDVRFS
LERDSLMFVG DDDRWLAEPG MFDLWVANSA ADGLAASFEL LGA