Gene Caul_0315 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0315 
Symbol 
ID5897589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp355491 
End bp357575 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content66% 
IMG OID641560799 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_001681950 
Protein GI167644287 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.648797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCA AACCGCTAGT AAACCGGCGT GGCATGCTGG CCCTTTTCGG CGGGACCGTC 
CTGGCCACCG GAGCGCTTGG CGCGAGCCGG GCGTGGGCCG CGGTGTCCGT GGCGCGCGTG
GGCGACGCGG CCCTGACCCT TGAGTTTGAC GCCGCCCTGA ACAGCCGGCT GATCTCCAAC
CTGGCCGGAC AAGCCGTCGC CCTGACCGAC TTCGCGCCGT CCGAAACGGT GACGCTCAAG
GGCGGCCAGG TGGTCGATCG CTTCACCTTC GCCGAACACC GCCAGGCCCC CGTCAGCGAC
GCGCATGGCA AGGGAATTCT CCACACCGTG CGCGGCGTCT CGGTCGAGGG TCTGGAAAAG
ACCGCCAACC TGACCTTCTA CGACCGTTAT CCGGGCTTTG CCTTGATGAA GGTGAGTTGG
CGGCACACGG GCGGCGAGCC GCTCAGCGTC GAGCGCTACC GGATCGGCGG GCACGTGCTG
AAGTCGTCGG GCGCGGGCTT CTGGTCGTGG TCTGGCTCGA CCCATTCCGA CCGCCGCGAC
TGGGTGCAGC CGGTGAAGGA AGGCTTTGAG CAAGCCAACT TCATGGGCAT GAACGCCTCC
GACTACGGAT CCGGCACGCC GGTCGTCGAC GTCTGGCGGC CGGACGCCGG CATCGCGGTG
GGCCATGTCG AACTGACGCC GAAGCTGGTC TCGCTGCCGA TCATCGCGTC ATCGACCGGC
GCGTCGGTGG CGCTGGAGGA GGCGAGGGCG GTGACGCTGA AGCCCGGCGA CGCCTTGGTC
TCGCTCGAAG CCTTTGTCGC TGTCCACCAG CGCGACTACT ACGCCACGCT CGACACCTAT
CGCCGCGTGA TGGCCGACCG CGGCCTGGCG GCCCCCAAGG CGCCAGAAGG CTCCTACGAG
GCCATCTGGT GCGCCTGGGG CTATGAGCGC GACTTCACTG TCGCCGAAAT CGAAGGCACG
TACAGCAAGG TCAAGGACCT GGGCTTCAAG TGGGCCGTCC TTGACGACGG CTGGCAGACC
AACGAGGGCG ACTGGGCGCT GGACCCGCGC AAGTTCCGCA CCGAGGCGGA CATGGTCGCC
TTCGTCAAGG CGATCCGGGC GGCGGGCCTC AAGCCCAAGC TGTGGCTGGC GCCCCTGGCC
GTCGATCCGG GCTCGGACCT GCTGCACGAT CACACCGACA TGCTGTTGCT GGACGCCGAC
GGCGCGGTGC AGAACGTGAC CTGGTGGAAC GCCTTCTATC TCTGCCCCGC CTACCAGCCG
ACGCGGGACT ATACCCAGAA ACTGGTGCGG AAGATCATCG GCGAGTGGGG CTATGAGGGC
CTCAAGCTCG ACGGCCAGCA CATGAATGGC GTCGCCCCCT GCTACAATCC GGCTCACCAT
CATGCGCGCC CGGAGGAGTC CGTCGAAAAG CTCCAGGACT TCTGGAAGCT GGTCTATGAC
ACCGCGCGAG AGGTCAATCC CAACGCCGTG GTCGAGTTCT GCCCCTGCGG CACCTCCTAC
GCGTTCCACA ATCTGCCCTA CACCAACCAG GTCCCGGCCT CGGATCCGCT GTCGTCGTGG
CAGGTGCGCC TGAAGGGCAA GTCGCTGAAG GCCCTGATGG GACGTGACGC GCCCTATGCC
GGCGACCATG TCGAACTGAG CGATGGCGGC AACGACTTCG CCTCGAGTGT CGGCATCGGC
GCGGTCATCT CGACCAAGTT CACCTGGCCC AACGAAGGCC GGGGCAAGGA CGCCAATTTC
CGCCTGACGC CCGAGCGCGA GGCGATGTGG CGCAAGTGGG CCGACCTCTA TAACGCCAAG
ATGCTGTCCA AGGGCCAGTA CCGCGGCGAG CTCTACGACA TCGGCTTCGA CAAGCCCGAG
GCCCACGCGA TCGAGAAGAG CGGCCGGCTC TACTACGCCT TCTACGCCCC GCAATGGGAT
GGACAGGTCG AACTGCGTGG CCTGGGCGAG GGCGCCTACC GCCTGACCGA CTATTACAAT
AGCCGGGATC TGGGCGTCGT GACCGCCAAG ACCGCCCGCA TCCCCGCCAA GTTCGAGAAG
TTCCTGCTGA TCGAAGCCGT CCCCACGAAA GGCGGCCGCG CATGA
 
Protein sequence
MITKPLVNRR GMLALFGGTV LATGALGASR AWAAVSVARV GDAALTLEFD AALNSRLISN 
LAGQAVALTD FAPSETVTLK GGQVVDRFTF AEHRQAPVSD AHGKGILHTV RGVSVEGLEK
TANLTFYDRY PGFALMKVSW RHTGGEPLSV ERYRIGGHVL KSSGAGFWSW SGSTHSDRRD
WVQPVKEGFE QANFMGMNAS DYGSGTPVVD VWRPDAGIAV GHVELTPKLV SLPIIASSTG
ASVALEEARA VTLKPGDALV SLEAFVAVHQ RDYYATLDTY RRVMADRGLA APKAPEGSYE
AIWCAWGYER DFTVAEIEGT YSKVKDLGFK WAVLDDGWQT NEGDWALDPR KFRTEADMVA
FVKAIRAAGL KPKLWLAPLA VDPGSDLLHD HTDMLLLDAD GAVQNVTWWN AFYLCPAYQP
TRDYTQKLVR KIIGEWGYEG LKLDGQHMNG VAPCYNPAHH HARPEESVEK LQDFWKLVYD
TAREVNPNAV VEFCPCGTSY AFHNLPYTNQ VPASDPLSSW QVRLKGKSLK ALMGRDAPYA
GDHVELSDGG NDFASSVGIG AVISTKFTWP NEGRGKDANF RLTPEREAMW RKWADLYNAK
MLSKGQYRGE LYDIGFDKPE AHAIEKSGRL YYAFYAPQWD GQVELRGLGE GAYRLTDYYN
SRDLGVVTAK TARIPAKFEK FLLIEAVPTK GGRA