Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1541 |
Symbol | |
ID | 5898996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1632214 |
End bp | 1633764 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641562029 |
Product | Alpha-L-fucosidase |
Protein accession | YP_001683169 |
Protein GI | 167645506 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3669] Alpha-L-fucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.664624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGT TTAGCAAGCG CGAGATTTTG TACGGACCAC TTCTGGCGCT CGGCGCCGGG GCGCTGGGCC GGGCCGGCCC CACGGCGGCG GCGGCGGCGC CAACGATCGA ACGGGACATG GCGCGCGGCC CGTTCAATAC GACACAGGAG TCGCTCGAGA CCTACCGCAC GCCCGATTGG TTTAGAGACG CGAAGTTCGG CATCTGGGCC CACTGGGGTC CCCAGGCGGT GCCGCGCCAG GGCGATTGGT ACGCGCGTTG GCTGTATGTG CCAGGCCATC CCCACTATGA CCACCACCTC AAGACCTACG GCCACCCTTC GGAGAAGGGG TACAAGGATA TCCTCCCGCT CTGGAAGGCC GAACGCTGGG ATCCCGAGGC GCTGATGGCG CGCTACGCGG GCGCCGGCGC GAAATATTTC GTGTCCATGG GCGTCCATCA CGACAATTTC GACCTTTGGA ACTCCAAGCA TCACCGATGG AACGCCGTGG CCATGGGACC CAAGCGCGAT ATCGTCGGCG CCTGGAAGGC GGCGGCCAAG CGTCAGGGAC TTCGGTTCGG CGTTTCCGAA CATCTCGGCG CCAGCTATTG CTGGTGGTAC CCCAGCCATC TCTACGACCA GTTCTGGCCA AAGCTTGGCG TACCGTATGA CGGCGCAGAT CCGGCCTATG CGGACCTCTA TCACGATAAT CGCGACGAAC CGTACCTCAA CACCAAGCCC AGTTGGTACA CGCGCAATCC GACGTTTCAT CAGCTCTGGC TGAAACGGAT CCGCGACTTG GTCGACAGCT ATCAGCCCGA TCTTCTGTAC TCGGACGGCG GCCTGCCGTT CGGAGAGGTC GGCCGCACGC TGGTCGCCCA TCTGTACAAC AGCAGCATCA CCCGGACCGG CCGCCTCGAA GCGGTCTACA CCTGCAAGGA CGTGGGCACC GGCGAGTTCT TCAAGGAGGG CATGGTGCAG GATGTGGAGC GCGGCGTGCT CAAGGGCGTC AACCCGCTCC CCTGGCAGAC CGACACCTCC AATGGCGACT GGTTCGACAG CGACAACGTC AAGTACAAGA CGTCCAGCGA AATCATCACC ATGCTCGCCG ACATCGTCAG CAAGAATGGC AACATGCTGC TGAACATCGT CCTCCACGCG GACGGATCGC TGCCGCCCGA ATCCGACGCG CTTCTGACCG ACCTCTCGGC ATGGATGGCG GTCAACGCCG AGGCCATCCA CGGCACGCGC CCATGGACCC ACTATGGCGA AGGTCCCACC GAGGTCGCCG AGGGCATGTT CAAGGAGAAG GCCGACTATT CCGCGCGCGA CATCCGCTTC ACCGTCAAGG ACAAGACGCT TTACGCCATC GCGCTGGGCG AACCGTCGGA CGTAACGGAG GTGGTTTCGT TGAGGAAAGG CGCGCCCGAA GCTCGCGGTC GCGTTGTCGG CGTGGAATTG CTCGGGGCCG GTCCCGTTCA TTTCCGCCAA ACGTCGAAAG CTCTCTTGAT CAGCGTCCCT GCCCGTCTGC CGACCCGCCA CGCCAGTGTT TTCAAGATCC ACCTGGCCTG A
|
Protein sequence | MAKFSKREIL YGPLLALGAG ALGRAGPTAA AAAPTIERDM ARGPFNTTQE SLETYRTPDW FRDAKFGIWA HWGPQAVPRQ GDWYARWLYV PGHPHYDHHL KTYGHPSEKG YKDILPLWKA ERWDPEALMA RYAGAGAKYF VSMGVHHDNF DLWNSKHHRW NAVAMGPKRD IVGAWKAAAK RQGLRFGVSE HLGASYCWWY PSHLYDQFWP KLGVPYDGAD PAYADLYHDN RDEPYLNTKP SWYTRNPTFH QLWLKRIRDL VDSYQPDLLY SDGGLPFGEV GRTLVAHLYN SSITRTGRLE AVYTCKDVGT GEFFKEGMVQ DVERGVLKGV NPLPWQTDTS NGDWFDSDNV KYKTSSEIIT MLADIVSKNG NMLLNIVLHA DGSLPPESDA LLTDLSAWMA VNAEAIHGTR PWTHYGEGPT EVAEGMFKEK ADYSARDIRF TVKDKTLYAI ALGEPSDVTE VVSLRKGAPE ARGRVVGVEL LGAGPVHFRQ TSKALLISVP ARLPTRHASV FKIHLA
|
| |