Gene Caul_5280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5280 
Symbol 
ID5897438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp221322 
End bp222557 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content66% 
IMG OID641555383 
Productglycoside hydrolase family protein 
Protein accessionYP_001676714 
Protein GI167621929 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0160415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGACCG ACGCCCGCCG CCCCGTTCCC GAAGGCTTTC TTTGGGGCAC CGCGATTTCA 
GCGCACCAGA GCGAAGGTCA AAACATCAAT TCCGACGCCT GGCTGTGCGA GACGGTCAAA
CCCAGCGTCT ACGCCCAGCC CTCGCTGGAC GCTTGCGACA GCTATCATCG CTACGCCGAG
GACATCGCCA TCGCCGCCGG GCTCGGCTTC AACTGCCACC GGATCGGCAT CGAGTGGGCC
AGGATCGAGC CGGAGTGCGG GGTCTTCTCG CTGGCGGCCC TCGATCACTA CCGTCGTGTT
CTGGAAGCCT GCCACGCGCG CGGGCTCAAG CCGATGGTCA CCTTCAACCA CTTCACCGTG
CCGCGCTGGT TCGCCGCCCG GGGCGGCTTT GAGGTCGCCG ACGGGGCCGA CCTCTTCGCC
CGGTTCGCCG CCAAGGCCAC CGAGCATCTG GGTGATCTGA TCAGCTACGC CACCACCTTC
AACGAAGCCA ATATCCAGCG TTTGGTGGCG CTGCTGCGCC GCGGCGCCGA CGCTCAAGGT
CCGATCGACG CGATGATCGC CGCCTGCGCC AAGGCCAGCG GCTCCGAGCG CTTCTCCTCG
GTCCTGTTCG CGCCCCTGGA GGCTTGCGAA CCTGTGATGC TGGACGCCCA TTTTAAGGCC
ACGGCGGCCA TGAAGGCTGG CCCGGGCGAC TTTCCTGTCG GCCTGACCCT GTCGATGCAA
GACGTCCAAG GGCAAGGCGA GGGCCATCTG GCCGAAGCGC TGATCCAGAT GCTCTATGGC
CCTTGGCTGG ACGCGGCGCG CCAAGCCGAC TTCATCGGCG TGCAAACCTA CACTCGGGTG
ATCGTCGGCC CACAGGGACG CGTGGCCCCG GCCAAAGACG CCGAAATGAC GGGGGCGGGG
TATGAATTCT ATCCGCAGGC CCTGGGCGGC ACTATCCGCC TGGCCCATGC GCGGATCGGC
AAGCCGATCT ACGTCACCGA GAGCGGCATC GCCACCCACG ACGACACCCG TCGCATCGCC
TATCTGGACC AGGCCCTGGC CGAGATCCGC CAGTGTCTGG ACGACGGCAT CGAGGTCAAA
AGCTTCATCT GTTGGTCGTT GCTGGACAAC TTCGAATGGA CCCGCGGCTA TGGCGAGCGC
TTTGGCCTGG TTCACGTCGA CTACGACACC TTCGAGCGCA CCCCCAAGCC CAGCGCCCAT
CACCTGGGCG CCATCGCTCG CGCGGGCGTG ATCTGA
 
Protein sequence
MPTDARRPVP EGFLWGTAIS AHQSEGQNIN SDAWLCETVK PSVYAQPSLD ACDSYHRYAE 
DIAIAAGLGF NCHRIGIEWA RIEPECGVFS LAALDHYRRV LEACHARGLK PMVTFNHFTV
PRWFAARGGF EVADGADLFA RFAAKATEHL GDLISYATTF NEANIQRLVA LLRRGADAQG
PIDAMIAACA KASGSERFSS VLFAPLEACE PVMLDAHFKA TAAMKAGPGD FPVGLTLSMQ
DVQGQGEGHL AEALIQMLYG PWLDAARQAD FIGVQTYTRV IVGPQGRVAP AKDAEMTGAG
YEFYPQALGG TIRLAHARIG KPIYVTESGI ATHDDTRRIA YLDQALAEIR QCLDDGIEVK
SFICWSLLDN FEWTRGYGER FGLVHVDYDT FERTPKPSAH HLGAIARAGV I