Gene Caul_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1302 
Symbol 
ID5898757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1374661 
End bp1376283 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content67% 
IMG OID641561787 
Productglycoside hydrolase family alpha-L-fucosidase 
Protein accessionYP_001682930 
Protein GI167645267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.186001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCA ATCGCCGCGA CCTTCTGAAC CTCGGGGCCG GCCTGGCCGC CGCCTCGGCC 
ACGCCCGCCT TCGCCGACGC CAGACAAGCC TTCCAGCCGA GTTGGGAGTC GCTGGCCGAG
GGCTACAAGA CCCCTGATTG GTTCCGCGAC GCCAAGCTCG GCGTCTGGTC GCACTGGGGT
CCGCAATGCG TGCCCGAATA CGGCGACTGG TACGGCCGCC AGATGTATAT CCAGGGCAAC
GGCGTCTACG AGCACCACGT CAAGACCTAC GGCCACCCGA CGACGTTCGG CTTCATGGAG
CTGATCAACC GCTGGAAGGC CGAGCGCTGG GATCCCGAAG GCCTGATGAA GACGTACCAG
GCCGCCGGGG CCAAGTACTT CATGTCGATG GCCAACCACC ACGACAACCT CGACATGTTC
GCCAGCCGCC ACCACGCGTG GAATACGCTG CGCGTCGGTC CAGGGCGGGA TATTGTCGGG
ACCTGGGAGA AGGTCGCCCG CGCCCACGGC ATGCGGTTCG GCGTCTCCAA CCATTCGGCC
CATGCCTGGC ACTGGTGGCA GACGGCCTAT GGCTACGACG CCGAGGGGCC GCTGAAGGGC
CAGCGCTACG ACGCCTATCG GCTGACCAAG GCCGACGGCC AGGGCAAGTG GTGGCAGGGC
CTGGATCCTC AGGAGCTCTA TACCGGCCGC AACATGGTCA TCCCCGATGG GATGGGCGGC
ATCAAGGACG CCAACGCCTG GCACGACGCC CATGACGGCG AGTGGATCGA GACCGCGCCG
CCGAACAATC CAGGCTTCAC GGCCAGCTGG CTGGCTCGCC AAAACGACCT GGTGGAGCGC
TACAGGCCCG ACCTGGTCTA TTTCGACAAC TACACCCTGC CGCTGGGCCA GGCGGGCCTT
GCCGCCACGG CCCACTATTA CAACCAGGCG CGCGCCTGGC GCGGCGCCAG CGACGTGGTG
GTGACCGGCA AGAAGCTCAA CGCTCTCCAA CGCCGGGGCA TTGTCGAGGA CGTCGAGCGC
GGCTTCTCCG ACCGTCTACG GCCCGAGCCC TGGCAGACCG ACACCTGCAT CGGCAACTGG
CACTATGACC GCGGCCTCTA CGACCGCGAC GGCTACAAGA GCGCCAAGGA CGTGATCCAG
CGGCTGATCG ACGTGGTCAG CAAGAACGGC TGCCTTCTGG TCTCCATCCC CCAGCGCGGC
GACGGAACGA TCGACGACAA GGAAGAAAAG GTGCTGGCGG GCATGGCCGG CTGGATCGCC
GTCAACGGTC CGGCGATCTA CGCCTCGCGG CCGTGGAGGA TCTACGGCGA GGGTCCGACC
CGGCTGGTCG AGGGGATGCA GAACGAAGGC GACGCCAAGC CGTTCGAGGC GGCCGACATC
CGGTTCACGA CCCGGGGCGG TGACCTGTTC GCGCTGCCGA TGGCGTGGCC GGCGGGCGAA
CTGGTGATCG AGAGCCTGGC GACCTCCGGC CCGACAAGCG CTGGCGAGGT GCGGCGGGTG
GAACTTCTGG GCGGCGGGGA ACTGGCTTTC GTCCGCGACG GCAAGGGCTT GCGGGTGTCG
ATGCCGGATC AGCGGCCGAC GTTCACGCCG GTGGTGAGGA TTTCGGGACG GGGTTTGGTG
TAA
 
Protein sequence
MSLNRRDLLN LGAGLAAASA TPAFADARQA FQPSWESLAE GYKTPDWFRD AKLGVWSHWG 
PQCVPEYGDW YGRQMYIQGN GVYEHHVKTY GHPTTFGFME LINRWKAERW DPEGLMKTYQ
AAGAKYFMSM ANHHDNLDMF ASRHHAWNTL RVGPGRDIVG TWEKVARAHG MRFGVSNHSA
HAWHWWQTAY GYDAEGPLKG QRYDAYRLTK ADGQGKWWQG LDPQELYTGR NMVIPDGMGG
IKDANAWHDA HDGEWIETAP PNNPGFTASW LARQNDLVER YRPDLVYFDN YTLPLGQAGL
AATAHYYNQA RAWRGASDVV VTGKKLNALQ RRGIVEDVER GFSDRLRPEP WQTDTCIGNW
HYDRGLYDRD GYKSAKDVIQ RLIDVVSKNG CLLVSIPQRG DGTIDDKEEK VLAGMAGWIA
VNGPAIYASR PWRIYGEGPT RLVEGMQNEG DAKPFEAADI RFTTRGGDLF ALPMAWPAGE
LVIESLATSG PTSAGEVRRV ELLGGGELAF VRDGKGLRVS MPDQRPTFTP VVRISGRGLV