Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3612 |
Symbol | |
ID | 5901067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3896698 |
End bp | 3898263 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641564123 |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_001685237 |
Protein GI | 167647574 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3534] Alpha-L-arabinofuranosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.102736 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGCTGA CTTCGCTGAA GCACGCGCTC GTCGCCGGCT TGGCTACCGC CGTCCTGGCC AGCGGGAGCG CCGCTTGCGC CCAGACCGCC GTCTCCGCCA CCCTGCGCGC CGACCAGCCG GGCGCGGTTA TCCAGCCCGA GGTCTATGGC CAGTTCGCCG AGCACCTGGG GCGCGGAATC TACGAGGGCG TCTGGGTCGG CGAGGACAGC AAGATCCCCA ACACCAGGGG CTACCGCAAC GACGTGGTCG CGGCCCTGAA GGCCATCAAG GTGCCCGTGG TGCGCTGGCC CGGCGGCTGC TTCGCCGACG ACTATCACTG GCGCGAGGGC GTGGGTCCGC GCGACAAGCG TCCCGTGAAG GTCAATGTGT CCTGGGGCGG CGTCGAGGAG CCCAACAGCT TCGGCACCAA CGAGTTCATG GAGTTCGCCG AACTGCTCGG CGCCAAGACC TATGTCGCCG GCAACGTCGG CACCGGCACG CCGCAGGAAA TGGCCGAGTG GGTCGAGTAC ATGGTCTCGC CGACCAACTC GACCATCGCC AACATGCGCC GCGCCAATGG CCGCGACAAA CCCTGGAAGC TCGACTATTT CGGGATCGGC AACGAGAACT GGGGCTGCGG CGGCCAGATG ACGGCGGCCC ACTATACCGA CCTCTACCGC AACTTCGCCG AGTTCGTGCG CGTGCCGCAG GGGACCAAGA CCGTGAAGGT CGCCGGTGGA CCCAATAGCG ACGACTACAG CTGGACCGAG ACCCTGATGG CCGGCGCGGC CAAGCACACC GACGCCATCA GCCTGCACTA CTACACGATC CCCAGCGGCA AATGGTCGAA GAAGGGCTCG GCCACCCAGT TCGACGAACA GGTCTGGGCC GACACCATGT TCCAGGCCCT GCGCATGGAC GAACTGGTCA CCAAGCACAG CGCGGTCATG GACAAGTACG ACCCCGAAAA GAAGGTCGGA CTGTATGTCG ATGAATGGGG GCTGTGGCAC GACGTGGAGC CCGGCTCCAA TCCGGGCTTC CTGTATCAGC AGAACACCAT GCGCGACGCG GTGGCGGCGG GCCTGACCCT GAACGTCTTC CACAAGCACG CCGACCGGGT GCGGCTGACC GCCATCGCCC AGATGGTCAA TGTGCTGCAG GCCATGATCC TGACCGACGG CGACAAGATG ATCCTGACCC CGACCTACTG GGTCTACGAC CTGTACAAGC CGTTCCAGGG GGCGACCTCG TTGCCGATCG AGGTCAGCAG CCCGGCCTAC GGGCTTGGCA AGTCCAGCGT GCCGGCGGTC AGCGCCTCGG CGGGCAAAGA CACGGCCGGC GTCGTTCACC TGGCCCTGGT CAACCTGGAT CCCAACAGGT CGGCGACCGT GACGATCAAG CTCTCGGGGG TGACCGGCAA GACCGCCAAG GGCCGTGTGC TGACCGGACC GACCATGAGC GCCCACAACA CCTTCGAGGC CCCCGACGCC GTCAAGCCGG CGGCTTTCAC GGCGGCCTCG CTCAAGGGCG ATGTGCTGAC CGCGACCCTG CCGAGCAAGT CAGTGGTGGT GTTGGATCTG AACTAG
|
Protein sequence | MKLTSLKHAL VAGLATAVLA SGSAACAQTA VSATLRADQP GAVIQPEVYG QFAEHLGRGI YEGVWVGEDS KIPNTRGYRN DVVAALKAIK VPVVRWPGGC FADDYHWREG VGPRDKRPVK VNVSWGGVEE PNSFGTNEFM EFAELLGAKT YVAGNVGTGT PQEMAEWVEY MVSPTNSTIA NMRRANGRDK PWKLDYFGIG NENWGCGGQM TAAHYTDLYR NFAEFVRVPQ GTKTVKVAGG PNSDDYSWTE TLMAGAAKHT DAISLHYYTI PSGKWSKKGS ATQFDEQVWA DTMFQALRMD ELVTKHSAVM DKYDPEKKVG LYVDEWGLWH DVEPGSNPGF LYQQNTMRDA VAAGLTLNVF HKHADRVRLT AIAQMVNVLQ AMILTDGDKM ILTPTYWVYD LYKPFQGATS LPIEVSSPAY GLGKSSVPAV SASAGKDTAG VVHLALVNLD PNRSATVTIK LSGVTGKTAK GRVLTGPTMS AHNTFEAPDA VKPAAFTAAS LKGDVLTATL PSKSVVVLDL N
|
| |