Gene Caul_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1541 
Symbol 
ID5898996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1632214 
End bp1633764 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content63% 
IMG OID641562029 
ProductAlpha-L-fucosidase 
Protein accessionYP_001683169 
Protein GI167645506 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3669] Alpha-L-fucosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.664624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGT TTAGCAAGCG CGAGATTTTG TACGGACCAC TTCTGGCGCT CGGCGCCGGG 
GCGCTGGGCC GGGCCGGCCC CACGGCGGCG GCGGCGGCGC CAACGATCGA ACGGGACATG
GCGCGCGGCC CGTTCAATAC GACACAGGAG TCGCTCGAGA CCTACCGCAC GCCCGATTGG
TTTAGAGACG CGAAGTTCGG CATCTGGGCC CACTGGGGTC CCCAGGCGGT GCCGCGCCAG
GGCGATTGGT ACGCGCGTTG GCTGTATGTG CCAGGCCATC CCCACTATGA CCACCACCTC
AAGACCTACG GCCACCCTTC GGAGAAGGGG TACAAGGATA TCCTCCCGCT CTGGAAGGCC
GAACGCTGGG ATCCCGAGGC GCTGATGGCG CGCTACGCGG GCGCCGGCGC GAAATATTTC
GTGTCCATGG GCGTCCATCA CGACAATTTC GACCTTTGGA ACTCCAAGCA TCACCGATGG
AACGCCGTGG CCATGGGACC CAAGCGCGAT ATCGTCGGCG CCTGGAAGGC GGCGGCCAAG
CGTCAGGGAC TTCGGTTCGG CGTTTCCGAA CATCTCGGCG CCAGCTATTG CTGGTGGTAC
CCCAGCCATC TCTACGACCA GTTCTGGCCA AAGCTTGGCG TACCGTATGA CGGCGCAGAT
CCGGCCTATG CGGACCTCTA TCACGATAAT CGCGACGAAC CGTACCTCAA CACCAAGCCC
AGTTGGTACA CGCGCAATCC GACGTTTCAT CAGCTCTGGC TGAAACGGAT CCGCGACTTG
GTCGACAGCT ATCAGCCCGA TCTTCTGTAC TCGGACGGCG GCCTGCCGTT CGGAGAGGTC
GGCCGCACGC TGGTCGCCCA TCTGTACAAC AGCAGCATCA CCCGGACCGG CCGCCTCGAA
GCGGTCTACA CCTGCAAGGA CGTGGGCACC GGCGAGTTCT TCAAGGAGGG CATGGTGCAG
GATGTGGAGC GCGGCGTGCT CAAGGGCGTC AACCCGCTCC CCTGGCAGAC CGACACCTCC
AATGGCGACT GGTTCGACAG CGACAACGTC AAGTACAAGA CGTCCAGCGA AATCATCACC
ATGCTCGCCG ACATCGTCAG CAAGAATGGC AACATGCTGC TGAACATCGT CCTCCACGCG
GACGGATCGC TGCCGCCCGA ATCCGACGCG CTTCTGACCG ACCTCTCGGC ATGGATGGCG
GTCAACGCCG AGGCCATCCA CGGCACGCGC CCATGGACCC ACTATGGCGA AGGTCCCACC
GAGGTCGCCG AGGGCATGTT CAAGGAGAAG GCCGACTATT CCGCGCGCGA CATCCGCTTC
ACCGTCAAGG ACAAGACGCT TTACGCCATC GCGCTGGGCG AACCGTCGGA CGTAACGGAG
GTGGTTTCGT TGAGGAAAGG CGCGCCCGAA GCTCGCGGTC GCGTTGTCGG CGTGGAATTG
CTCGGGGCCG GTCCCGTTCA TTTCCGCCAA ACGTCGAAAG CTCTCTTGAT CAGCGTCCCT
GCCCGTCTGC CGACCCGCCA CGCCAGTGTT TTCAAGATCC ACCTGGCCTG A
 
Protein sequence
MAKFSKREIL YGPLLALGAG ALGRAGPTAA AAAPTIERDM ARGPFNTTQE SLETYRTPDW 
FRDAKFGIWA HWGPQAVPRQ GDWYARWLYV PGHPHYDHHL KTYGHPSEKG YKDILPLWKA
ERWDPEALMA RYAGAGAKYF VSMGVHHDNF DLWNSKHHRW NAVAMGPKRD IVGAWKAAAK
RQGLRFGVSE HLGASYCWWY PSHLYDQFWP KLGVPYDGAD PAYADLYHDN RDEPYLNTKP
SWYTRNPTFH QLWLKRIRDL VDSYQPDLLY SDGGLPFGEV GRTLVAHLYN SSITRTGRLE
AVYTCKDVGT GEFFKEGMVQ DVERGVLKGV NPLPWQTDTS NGDWFDSDNV KYKTSSEIIT
MLADIVSKNG NMLLNIVLHA DGSLPPESDA LLTDLSAWMA VNAEAIHGTR PWTHYGEGPT
EVAEGMFKEK ADYSARDIRF TVKDKTLYAI ALGEPSDVTE VVSLRKGAPE ARGRVVGVEL
LGAGPVHFRQ TSKALLISVP ARLPTRHASV FKIHLA