Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2410 |
Symbol | |
ID | 5899865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2629526 |
End bp | 2631601 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562901 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_001684035 |
Protein GI | 167646372 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000000106036 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.27509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGATATG GATTGATGTT GGCCGCCTGC GCCGGCTTGA CGATGGCCGC CTCGACGGCG GCGATGGCCA CAACCCCGGT CAAGGTAGCG ATCGATGGCG CCAGCCGGGC CGCGCCGGTC ACGAAATACG AATACGGCAT GTTCATCGAG CCCATCGGGG GGCTGGTGGC GCGCACGCTG TGGGCGGAAA TGCTCGACGA CCGGAAGTTC TACTATCCCG TCATGGCGCC GGCCTTCGAC AAGCCGCCGC CGCTTAACGC CGAAGGTCGG CCAGGCGTCA GCTATCGCAA GTGGCGGCCG ATCGGCGGCG ACGCCGCGGT GACCATGGAC ACCAAGGCGC CCTATGTCGG GACGCAGAGC CCTCGCGTCG CCGTCGATCC GTCGATGCGC AAGGGTTTCT CACAATCGGG AATCAGCGTG GCCAAGGGAG AGCGCTACGA CGGCTACCTG CTGATGACGG GCGATCCAGG GGCGAGGGTC GAGGCGGCGC TCGTCTGGGG TCCTGGACCC AATGATCGTC AGGCCATAGC CCTGCCCGAA CCCGGTGACG ACTGGCGCCG AGCCGATTTC AGCTTCACGC CGACGGTGGA CGCGGCGGAC GCCCGCCTGG AGATCACGGG CCTGGGTTCG GGATCCTTCC GAATCGGCGC GGTCTCGCTG ATGCCCGCCG ACAACATTCG CGGCTGGCGC GCCGACACCA CCGCGATCGC CCGCTCGCTG AATTCGGGCA TGTGGCGCCT GCCCGGCGGC AATTTCCTGT CGGATTGGGA CTGGCATGGC GCCATTGGAC CGAGGGACAA GCGCGCGCCG ATGTTCGACC ACGCCTGGAG CGCGATGCAA CCCAACGACA TCGGCATGGA CGAGTGGATG GATCTGACCA AGATCATCGG CGTCGAGCCC TACGTCACGG TCAATGCTGG CCTTGGCGAC GCCAACTCGG CCGCCGAGGA GGTGGAATAT CTAAACGGCT CGGCCAGCAC CCCCTGGGGC GCGCGCCGAG CCGCCAATGG CCATCCACAG CCCTATGGCG TGAAGTACTG GAACATCGGC AACGAGCCGT ACGGCTGGTG GCAGATCGGC AAGACCTCGC TCGACTATTT CATGATCAAG CACAATGCGT TCGCCGAGGC GATGCGCGCG GTCGATCCGA CGATCACCCT GATCGGCTCG GGGGCCATGC CTGACCAAGG TCATCCGCGC GGTACAAAGG AAAACGCGTC GATCGAGAGC GTCGCGCCGA AGTTCGGCAC CGAATGGGAT TGGACCGGTG GGCTCTTGGA AAAGGCCTGG GGCAATTTCG ACGGCATTTC CGAGCATTGG TACGATCAAC CCGAGGCTCG CCCCGACGCG CCCGCCGACG CTGAACTGAT CGAGTACGCT CGCTCGCCCT CCAACCAGGT CCGGATGAAA GCCGATCAAT GGAAGATCTA CCAGAAACGC TTCCCGGCCA TGAAGGACAA GGCGATCTTC CTGTCGATCG ACGAGTACGC CTATTTCGGC CAAGTGAACC TGAAGTCAGC GCTGGCCTAC GCGATGGTCC TGCAGGAGAT GCTTCGCCAC ACCGATTTCC TGACCATGGG AGCCTTCACC ACGGGGGTCT CGACCATGGA CATCACGCCC ACCGACGCGG TGCTCAACAC GACGGGCCAG GTCTTCAAGC TCTATGGCGA ACATTTCGGC GCCGGCGTCG TTCCTGTGAC GGTGCAAGGC GACTCGCCTC AGCCCGAACC GCGCTATCCC GTCGGCTACA ATCACCCCAA GGTGCGCGCG GGAAGTCCGA CCTATCCGCT CGATGTAGTC GCGGGCCTGA GCCCCGACGG CAAGACCCTG CGGATCGCCG TGGTGAATCC GACGCTCACG CCCCAAACCC TGAAGCTTGA CCTGAAGGCG CTGGCCGCGC GGGGCGCTGG ACGCAAATGG GCACTGAGCG GCGCGTCGCT GAACGCCCAG AACACGGTTG GCGCCGCAGC CGGAGTCACT ATCACCCAGA GCGCAGCGCC GCGTCCGGGC GGCGAACTCG TCGTCTCGCC GATTTCGGCG ACCGTGTTCG AGTTTCCGAT CGAGGCCAAG CGATAG
|
Protein sequence | MRYGLMLAAC AGLTMAASTA AMATTPVKVA IDGASRAAPV TKYEYGMFIE PIGGLVARTL WAEMLDDRKF YYPVMAPAFD KPPPLNAEGR PGVSYRKWRP IGGDAAVTMD TKAPYVGTQS PRVAVDPSMR KGFSQSGISV AKGERYDGYL LMTGDPGARV EAALVWGPGP NDRQAIALPE PGDDWRRADF SFTPTVDAAD ARLEITGLGS GSFRIGAVSL MPADNIRGWR ADTTAIARSL NSGMWRLPGG NFLSDWDWHG AIGPRDKRAP MFDHAWSAMQ PNDIGMDEWM DLTKIIGVEP YVTVNAGLGD ANSAAEEVEY LNGSASTPWG ARRAANGHPQ PYGVKYWNIG NEPYGWWQIG KTSLDYFMIK HNAFAEAMRA VDPTITLIGS GAMPDQGHPR GTKENASIES VAPKFGTEWD WTGGLLEKAW GNFDGISEHW YDQPEARPDA PADAELIEYA RSPSNQVRMK ADQWKIYQKR FPAMKDKAIF LSIDEYAYFG QVNLKSALAY AMVLQEMLRH TDFLTMGAFT TGVSTMDITP TDAVLNTTGQ VFKLYGEHFG AGVVPVTVQG DSPQPEPRYP VGYNHPKVRA GSPTYPLDVV AGLSPDGKTL RIAVVNPTLT PQTLKLDLKA LAARGAGRKW ALSGASLNAQ NTVGAAAGVT ITQSAAPRPG GELVVSPISA TVFEFPIEAK R
|
| |