Gene Caul_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4086 
Symbol 
ID5901548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4431428 
End bp4433761 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content70% 
IMG OID641564606 
Productglucan 1,4-alpha-glucosidase 
Protein accessionYP_001685708 
Protein GI167648045 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR01535] glucan 1,4-alpha-glucosidase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGCC TGAAGATCCT GGCCCTGTCC TCGACCGCCG CCGTCGCCCT GGCGGGCGCG 
GCCCGCGCCG AAGCTCCGAC CTCCTGGGCC TATGCCGCCA AGACCGGGGT CGGCGCGTCC
TACGAGGCCT ATGTCGACGG CGCCTACAAG GACGGCGGGC CGACCGGTCC GGTGTCGAAG
GTCTGGTTCT CGATCGCCGA CGGAACCCTG ACCGAGACCA TGTACGGCCT GATCCACGAG
GCCCAGATCA AGCAGATGCG CGTGGCGGTG AAGACCGCGA CCGGCCTGGC CGTCGAGGGC
GCTGACACTA CATCGAAGAC CGAGTACCTG CACGTCGACG CCGCCGGCCG CCCGCTGTCG
CCGGCCTACA AGGTGACCAC CACCGACAAG CAGGGCCGCT TCCAGATCGA GAAGCGGATC
TTCACCGACC CCGACCACAA CAGTCTGTTC GTGCGGGTCA CCGTCCGCGC CCTGAAGGGG
CCGATCACGC CGTTCCTCGT GCTGGAGCCC CACATGGCCA ACACCGGCGG CGGCGATGTC
GGCTCGGCCG GCGGCGGGGC GCTGACGGCC CATGAGGGCA AGTTCTTCCT CAGCCTCAAG
GGCCAGCGAT CGTTCGTCAA GGCGGCCGCC GCGCCGCTGA AGGACGGCGA CGCCCTGGCG
ATCTTCAAGG ACGGCGCCTT GGTCGGCGCG GCCGAGGCCA AGGGCGCCAT CGTCCTGGCC
GGCCAGCTTC CGACCCAGGC CTCGGGCGAG GCGACCTACG ACTTCGTCAT CGGCTTTGGC
GACAGCATCG GCGCCGCCGA CAGGGCCGCG TCGGCCACGC TGAGCACGGG CTACGCCGAA
GTGCTGGCCC GCTACAACGG CGAGGGCGAC CGCGTGGGCT GGGAGGACTA TCTGGCCTCG
CTGACCGAGC TGCCGCGCCT GCGCAAGGCC TCGGAGGATG GCGGCAAGCT GGTCCAGGCC
AGCGCCCTGA TGCTGAAGGT GCAGGAAGAT CGCACCTATG CCGGGGCCCT GATCGCCTCG
CTGTCCAATC CCTGGGGCGA CACGGTGGAC GCCTCCAAGC CATCGACCGG CTACAAGGCC
GTCTGGCCGC GCGACTTCTA CCAGTGCGCC ATGGCCCTGG CGGCCCTGGG CGACAAGCAG
ACGCCGCTGG CCGCCTTCCA CTATCTGCCG CGGGTCCAGG TCAAGGCGAC CACGCCGGGC
AATACCGGGG TCGGCGGCTG GTTCCTGCAG AAGACCTGGG TGGACGGCAC CCCCGAATGG
GTCGGCGTCC AGCTGGACCA GACCGCCATG CCGATCATGC TGGGTTGGAA GCTGTGGAAG
CTGGGCTGGC TGCCCGAGGC CGACCTGAAG ACCTACTATG GCAAGATGAT CAAGCCGGCC
GCCGACTTCC TGGTCGATGG GGGCAAGGTC GGGGTTGGTT GGAACCACGA GACGATCAAG
CCGCCCTTTA CCCAGCAGGA GCGCTGGGAA GAGCAGGGCG GCTATTCGCC CTCGACCACG
GCGGCGACCA TCGCCGGCCT GGTGGTGGCG GGCGACATCG CCGAGCTGGC GGGCGACACG
GACGGCGCGG CCCGCTACCA CGCCACGGCC GACGCCTATT CGGCCAAGGT CGAGGCCCGG
ATGGTCACCA CCAAGGGACC GTTCGGCGAC GGGACCTACT ATGTGCGCCT CAACCAGAAC
GAGGATCCCA ACGACCACGC CCCGATCGGC GCCGCCAACG GCCAGATCGC CCCGCCCAAG
GACCAGGTGG TCGATGGCGG CTTCCTGGAG CTGGTCCGCT ACGGCGTGCG CCGGGCCGAC
GATCCGGCCA TCGTCGGCAG CCTCCCGGAG CTGGACGACA CCACGCGGGC CGACCTCTAT
CGCGTCCGTT ACGACTTCAC CTTCCCGAGC GTGAAGGGCG ACTATCCGGG CTGGCGGCGC
TACGACGTCG ACGGCTATGG CGAGGACGCC AAGACCGGGG CCAACTACGG CGTGGGCGGC
CAGATGAGCC CGGGCCAGCG CGGCCGGGTC TGGCCGATCT TCACCGGCGA ACGCGGCCAC
TACGAGCTGG CGCTGGCCAG CTTGCACGGC AAGCCGAGCG CGGCGGCCGT GCGGCGGATC
CGCGACCGCT ACGTCAAGGC CATGGAGCTG TTCGCCAATG ACGGCCTGCT GATTTCCGAA
CAGGTCTGGG ACGGCGTCGG ACAAAACCCG CGCGGCTATG AACGCGGCGA GGGCACGGAC
TCGGCCACCC CCCTGGCCTG GTCGCACGCC GAATACGTCA AGCTGCTGCG CTCGGTCAGC
GACGGCGAGG TGTGGGACCG CTATGCGCCG GTGGCGGCGC GCTACGCGAA GTAG
 
Protein sequence
MRCLKILALS STAAVALAGA ARAEAPTSWA YAAKTGVGAS YEAYVDGAYK DGGPTGPVSK 
VWFSIADGTL TETMYGLIHE AQIKQMRVAV KTATGLAVEG ADTTSKTEYL HVDAAGRPLS
PAYKVTTTDK QGRFQIEKRI FTDPDHNSLF VRVTVRALKG PITPFLVLEP HMANTGGGDV
GSAGGGALTA HEGKFFLSLK GQRSFVKAAA APLKDGDALA IFKDGALVGA AEAKGAIVLA
GQLPTQASGE ATYDFVIGFG DSIGAADRAA SATLSTGYAE VLARYNGEGD RVGWEDYLAS
LTELPRLRKA SEDGGKLVQA SALMLKVQED RTYAGALIAS LSNPWGDTVD ASKPSTGYKA
VWPRDFYQCA MALAALGDKQ TPLAAFHYLP RVQVKATTPG NTGVGGWFLQ KTWVDGTPEW
VGVQLDQTAM PIMLGWKLWK LGWLPEADLK TYYGKMIKPA ADFLVDGGKV GVGWNHETIK
PPFTQQERWE EQGGYSPSTT AATIAGLVVA GDIAELAGDT DGAARYHATA DAYSAKVEAR
MVTTKGPFGD GTYYVRLNQN EDPNDHAPIG AANGQIAPPK DQVVDGGFLE LVRYGVRRAD
DPAIVGSLPE LDDTTRADLY RVRYDFTFPS VKGDYPGWRR YDVDGYGEDA KTGANYGVGG
QMSPGQRGRV WPIFTGERGH YELALASLHG KPSAAAVRRI RDRYVKAMEL FANDGLLISE
QVWDGVGQNP RGYERGEGTD SATPLAWSHA EYVKLLRSVS DGEVWDRYAP VAARYAK