Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0936 |
Symbol | |
ID | 5898391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 986693 |
End bp | 988333 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561419 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_001682565 |
Protein GI | 167644902 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.789253 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGC AAACGATCCA GAACCCCATC CTGCGGGGCT TCAATCCCGA CCCGTCGATC ATCCGAGTGG ATGAGGACTA CTACGTCGCC ACCTCGACCT TCGAGTGGTT CCCGGGCGTA CAGATCCACC ACTCGCGCGA CCTGGTGAAC TGGCGGCTGC TGACCCGGCC CCTGACCCGG GCGAGCCAGC TGAACATGCT GGGCGCGTCC GACGGCTGCG GGGTCTGGGC CCCGTGCCTG ACCCACGCCG ACGGCAAGTT CTGGCTGATC TATACCGACG TGAAGCGCTA TGGCCGCACC ACGGTGGGCG GGGCCTCGGG CGCATCCCTG CGCGACTTCC ACAACTATCT GGTCACCGCC GACCACATCG AGGGACCGTG GTCGGACCCG GTCTATCTCA ACAGCAGCGG CTTCGACCCG TCGCTGTTCC ACGACGACGA CGGCCGCAAG TGGCTGCTGA ACCAGCTGTG GGACCACCGG CCGGGCCGCA ACCGCTTCGC CGGCATCGTG GCCCAGGAAT ACGACGCCGC CGCCCAGGCG CTGGTGGGGC GGCGCGTGAA CATCTTCCCC GGCACCCCGC TGGGCCTGAC CGAGGCGCCG CATCTCTACA AGCGGGACGG CTGGTACCAC CTGATCACCG CCGAGGGCGG CACCGGCTTT GGCCACGCGG TGACCATGGC CCGGTCCCGA ACCTTGGAGG GTCCCTACGA GGTTCACCCC GACGGACCGG TCCTGACCGC CCGCGACCGC CCGCATGCGC CGCTGCATCG GGCCGGCCAC GCCGATCTGG TCGAGACGGC CGACGGCCAG ACCTGGATGG CCTATCTGTG CGGCCGGCCG CTGCCCAACC GGGGACGCTG CGTGCTGGGC CGCGAGACGG CGATCCAGCC GATGACCTGG GGCGACGACG GCTGGCTGCG CACCCGCGAC GGCTCGGGCG ATCCGGAGCT GACCCCGCCG TCGCCCGGCC TGCCGCCCGC GCCGTTCCCG GCCGCGCCGG CCTGGGAGGA TTTCGACGGC CCCGACCTGC CCCTCGACTT CCAGTGGCTG CGCTCGCCGT TCCCCGAGGA GCTGTTCAGC CTGACGGCCC GGCCGGGCTG GCTGCGGCTG TTTGGCCGCG AGACGATCGG TAGCCAATAC CGCCAGGCCC TGGTCGCCCG CCGCCAGCAG GCGTTCTGCT ACTCCGCCCG CACGGTGTTG GACTTTTCGC CCGAGCACTT CCAGCAGGTG GCCGGGCTGC TCTGCTACTA TGGGGCAAGC AAGTTCCACT ACCTGTTCGT GTCGCGCGAC GACGAGAACG GCCGCCATGT CCAGGTGATG TCGGCCCTGC CCGACAGTCC CCAGGCCGAC GCCTTCACCC CGCCGATCCC GCTGCCCGAC ACGGGCCTGG TCCACCTGCG CGTCGAGGTC GATTTCGAGC GTCTGCGCTT CGCCTTCAGC CTGGACGGCC AGGCCTGGAC TTGGCTGGAG CAGGTGTTCG ACGCCTCGAT CCTGTCCGAC GAGGCCACCA GCCCCGGCGC GCCCAACTTC ACCGGCGCCT TCGTCGGCAT GGCCTGCCAG GACTTGGCCG GCACGGCGCG GGCGGCCGAT TTCGACGGGT TTGGGTATGT CGAGCGCGAG TATCAGGCGA CGGTGGGGTG A
|
Protein sequence | MTPQTIQNPI LRGFNPDPSI IRVDEDYYVA TSTFEWFPGV QIHHSRDLVN WRLLTRPLTR ASQLNMLGAS DGCGVWAPCL THADGKFWLI YTDVKRYGRT TVGGASGASL RDFHNYLVTA DHIEGPWSDP VYLNSSGFDP SLFHDDDGRK WLLNQLWDHR PGRNRFAGIV AQEYDAAAQA LVGRRVNIFP GTPLGLTEAP HLYKRDGWYH LITAEGGTGF GHAVTMARSR TLEGPYEVHP DGPVLTARDR PHAPLHRAGH ADLVETADGQ TWMAYLCGRP LPNRGRCVLG RETAIQPMTW GDDGWLRTRD GSGDPELTPP SPGLPPAPFP AAPAWEDFDG PDLPLDFQWL RSPFPEELFS LTARPGWLRL FGRETIGSQY RQALVARRQQ AFCYSARTVL DFSPEHFQQV AGLLCYYGAS KFHYLFVSRD DENGRHVQVM SALPDSPQAD AFTPPIPLPD TGLVHLRVEV DFERLRFAFS LDGQAWTWLE QVFDASILSD EATSPGAPNF TGAFVGMACQ DLAGTARAAD FDGFGYVERE YQATVG
|
| |