Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5274 |
Symbol | |
ID | 5897458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | - |
Start bp | 213701 |
End bp | 214804 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641555377 |
Product | glycosidase PH1107-related |
Protein accession | YP_001676708 |
Protein GI | 167621923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000268053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCAAGC GCGCGACCGT GCCAGATTGG GCCATCGGCC CGTTTTCGCC ACCCCAACGG ATCCTGGGTC CGCGACCGGA CCTGCGCTTT GACTGTCCTG TCTCGGGCGA CGCCGTCGCT TGGGCCGCCA AGGACGTGTT CAACGCCGGC GCGGTCGTCC ACGAGGGCCG GGTCTGCCTT CTGGTGCGGG CTGAAGACAC CGTTGGCCGA TACGCTGGCG TCTCGCGCAT CGGGCTGGCC ACTAGCGCCG ACGGCGTGAC GTTCGACCTT GAGTCCAAGC CGGTCCTTTA TCCGGACGAC GACCGCTGGC AGGCCTGGGA GTGGCCGGGG GGCCTGGAGG ATCCGCGTAT CGTGGTCGGG CCCGACGGCA CGTTCGTCTG CGCTTACACC GCGTTCGACG GGAAGGTCGG CTGCTTGTTC ATCGCCACCT CCCGCGATCT TCGCCAGTGG ACCAAGCACG GTCCGGCCTT TGCCGGGTCG CCCTATGCGC GCCTGGCCAC CAAGGCCGGC GCAATCGTCA CCGAGCTGGT CGAGGGGCGA CTGGTGGCCG CGCGCATCGA CGGCCGCTAT TGGATGTACT GGGGCGAGGG CGCGCTTTAT GCGGCGACCT CCGAGGACCT GGTGCGCTGG ACCCCGGTGG AAGCCGACAG CGCGCCGGAC AAGTACCTGA CTTGGGATCC CGAACACCGC GGCCCGATGG GCGCCTGGAC CTTGGAACGT CCACCTGGAC CCAGGGGCGT CCGCCTCCTG GCGGGACCGC GCAGGCATCG CTTCGATTCC CTGTTGGTCG AGCCAGGCCC GCCGGCGATC TTGACGCCTG AGGGCGTGGT TCTGATCTAC AACGGCGGCA ATCACGTGGT GGATGGCGAC CCCGACATCG AGCCCTTCGC CTATCAGCCC AGCCAGATGC TGTTCGACGC CCGCGATCCC ACGGCCCTGA TCGCCCGGGC GCGCGAGCCG TTCCTGGGTA TTCCCAGGCA CGAGGCCGAG GGGCAGGTGG GCAACGTCTG TTTCGCGCAG GGTCTGGTGA CCTTCCAGGG GCAATGGCGA CTTTATCTGG GCCTTGCCGA CTCCAGGCTG GGGGTTTCCA CAGCGCCATT CTGA
|
Protein sequence | MTKRATVPDW AIGPFSPPQR ILGPRPDLRF DCPVSGDAVA WAAKDVFNAG AVVHEGRVCL LVRAEDTVGR YAGVSRIGLA TSADGVTFDL ESKPVLYPDD DRWQAWEWPG GLEDPRIVVG PDGTFVCAYT AFDGKVGCLF IATSRDLRQW TKHGPAFAGS PYARLATKAG AIVTELVEGR LVAARIDGRY WMYWGEGALY AATSEDLVRW TPVEADSAPD KYLTWDPEHR GPMGAWTLER PPGPRGVRLL AGPRRHRFDS LLVEPGPPAI LTPEGVVLIY NGGNHVVDGD PDIEPFAYQP SQMLFDARDP TALIARAREP FLGIPRHEAE GQVGNVCFAQ GLVTFQGQWR LYLGLADSRL GVSTAPF
|
| |