Gene Caul_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4021 
Symbol 
ID5901483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4356358 
End bp4357971 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content68% 
IMG OID641564542 
Productglycoside hydrolase family protein 
Protein accessionYP_001685644 
Protein GI167647981 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.701147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA TCAGCCGCCG CGGAGCCCTG GGCTCTCTGC TGACCGGCGC GGCTGTCGCC 
GCTGTTCCCG CGGCCGCCGA AGCTCAGCCC GGCGCCCTGC CCGTCCGCGC CGCGGCCTCG
CCCTGGGCCA AGGGCGTCGA AGGTCAGCGC AAGGCCGACC TGGGGAACGG GAGGTTCCTC
AATCCCATCC TGGCCGGCGA CCATCCCGAC CCGTCGATCC TCAAGGACGG CGAGGTCTAT
TACATGACCC ACTCGTCGTT CGACGCCTAT CCGGGCCTGC TGATCTGGCG CTCGACCGAC
CTGGTCAACT GGACCCCGGT CGTCGCCGCC CTGAAGACCA ATGTCGGCTC GATCTGGGCG
CCCGAGCTCT GCAAGCACCA GGGCCGCTAC TACATCTACC TGCCGGCCAA ATATCCCGAC
CACAACACCA GCTACGTGAT CTGGGCCGAC AGGATCGAGG GGCCGTGGAG CGAGCCGGTC
GACCTGAAGC TGCCGCGCTA TATCGACCCC GGCCACGTGG TCGACGAGCA TGGCGTGCGC
TGGCTGTTCC TGTCGGGCGG CGACCGCATC CAGCTGGCGC CCGACGGCCT GTCGACGGTC
GGCAAGCCCG AGCACGTCTA TGATCCTTGG CGCTATCCGG ACGACTGGGA CGTAGAGGGC
TTCTCGCCCG AGGGTCCCAA GGTGATGAAG CGCGGCGACT ACTATTACCT GGTCACCGCC
GTCGGCGGCA CGGCCGGCCC GCCGACCGGC CACATGGTCA TCGTCGCCCG CGCCAAGTCG
CTGGCCGGCC CGTGGGAGGA TGACCCGAAG AACCCCGTCG TCCGCACGAC CAACAACAGC
GAGACGTGGT GGTCGCGCGG CCACGCCACC CTGGTCGAGG GTCCGGCCGG CGACTGGTGG
ACCGTCTATC ACGGCTACGA AAACGGCTTC TACACCCTGG GCCGCCAGAC CCTGCTGGCC
CCGGTGACCT GGACCAAGGA CGGCTGGTTC GAGGTCGGCG GCGGCGACTT GTCTCGCCCC
CTCGCCAAAC CCAAGGGGGG CAAGGCCGGA CCGCATGGCC TGGCCCTCTC CGACGACTTC
ACGACCGACA AGGTCGGCGT CCAGTGGAAC TTCTTCGACC CCAAGCCGGG CGAGCACGAA
CGGCTGACCC GGGCTGGCGG GGTGATGACC TTGAAAGGCG CGGGCGAGGC CCCCTCGACC
GGCGCGCCGC TGATCTTCGT CAATGGCGAC CAGACCTATG AGATCGAGTG CGAGATCGAG
GTGGATCCCG ACACCCGCGC CGGCCTGATC CTGTTCTACG ACCGCCAGCT CTATTGCGGG
TTGGGGTTCG ACGCGAAGGC CTTTGTCACC CACCAGTATG GCATCGAGCG CGGCCGGCCG
GCCAATCCGC ATGGCGCGAA GATGCTGATG CGGCTGAGGA ACGACCGCCA CATCGTCAGC
TTCCACACCA GCGGGGACGG CGGGGTCACC TGGAAGCGCT TCGACCGCGG CATGGAGGTC
TCGGGCTACC ACCACAATGT GCGCGGCGGC TTCCTGATGC TGAAGCCGGG CCTCTACGCC
GCCGGCAAGG GCTCGGCGCG GTTCAAGGGG TTCAAGTACC GGGCTCTGGC ATAG
 
Protein sequence
MVQISRRGAL GSLLTGAAVA AVPAAAEAQP GALPVRAAAS PWAKGVEGQR KADLGNGRFL 
NPILAGDHPD PSILKDGEVY YMTHSSFDAY PGLLIWRSTD LVNWTPVVAA LKTNVGSIWA
PELCKHQGRY YIYLPAKYPD HNTSYVIWAD RIEGPWSEPV DLKLPRYIDP GHVVDEHGVR
WLFLSGGDRI QLAPDGLSTV GKPEHVYDPW RYPDDWDVEG FSPEGPKVMK RGDYYYLVTA
VGGTAGPPTG HMVIVARAKS LAGPWEDDPK NPVVRTTNNS ETWWSRGHAT LVEGPAGDWW
TVYHGYENGF YTLGRQTLLA PVTWTKDGWF EVGGGDLSRP LAKPKGGKAG PHGLALSDDF
TTDKVGVQWN FFDPKPGEHE RLTRAGGVMT LKGAGEAPST GAPLIFVNGD QTYEIECEIE
VDPDTRAGLI LFYDRQLYCG LGFDAKAFVT HQYGIERGRP ANPHGAKMLM RLRNDRHIVS
FHTSGDGGVT WKRFDRGMEV SGYHHNVRGG FLMLKPGLYA AGKGSARFKG FKYRALA