Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen2424_3290 |
Symbol | |
ID | 4453769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia HI2424 |
Kingdom | Bacteria |
Replicon accession | NC_008543 |
Strand | - |
Start bp | 131877 |
End bp | 133040 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639695354 |
Product | arabinogalactan endo-1,4-beta-galactosidase |
Protein accession | YP_836927 |
Protein GI | 116691394 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3867] Arabinogalactan endo-1,4-beta-galactosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAC GATCGATGCT GCGCTGGAGC ATGTCGAGCG CGGCGCTTGC CTGCCTCGAC CTGGCCGGTC CGCTGGCCGC CTTCGCACGA CCCGCGCCGC ACAGTGCGGC GGCGGCCGAG TTCGCGATGG GCGCCGACAT CTCGACGTTG CCGGAACTCG AAGCGCATGG CGCCGCCTTC TTCGACCGCG GCGGCGCGCC ACGCGACTGC CTGAAGATTC TCCGCGCGCA CGGCGTCGAT TCGATCCGGA TCAAGGTCTG GAACGATCCC GGCAACCCGG ATTTCTTCCC AGCGAACCAG AGCGATGCGG CGGGCTACAA CAATGCCGCG CACGTCGTCG TGCTCGCGCA GCGCGCGGCC GCGCTCGGGA TGCGCATCCT GATCGACTTC CACTACAGCG ACTGGTGGGC CGACCCCGGC AAGCAATATC CTCCGCATGC ATGGGCCGGC AAGAGCCTGG CCGAAACCTG CGCGCTGCTG TCGGCGTACA CGACCGACGT GCTGCGCCGG CTGCAGCGCG CCGGCGTGAG CCCCGAGTGG GTGCAGATCG GCAACGAGAT CACGGGCGGC ATGCTGTGGC CGCTCGGCCG CTACGACCAG TGGGACAATC TCGCGCAGTT GCTGAAAACC GGCCACGACG CGGTGAAGGC CGTCGATCCG CGCATCAAGG TGATGCTGCA CGTCGACAGC GGTGGCGACA ACGGCAAGAG CCGCTGGTGG TTCGACAGCG CGACGCAGCG CGGCGTCGCA TTCGATGTGA TCGGCCTGTC GTATTACCCG CAATGGCAAG GCTCGCTCGA CGATCTGCGC AACAACGCGA ACGACCTAGC GGTGCGCTAC GACAAGGAGC TGATCGTCGT CGAAACCGCG TATCCGTGGA CCACCAGCGA TGGCGATTCC GAGCCGAACG CGATGACCAA CACCGGATCG ACGACCTTTC CGCCGTCGCC GGCCGGCCAG GCCCAATTCC TCGCAGCGGT CGTCGATATC GTGAAGGGCG TGCCGGGCAA TCGCGGCAAG GGCGTGTTCT GGTGGGAACC GGAATGGATC CCGACGCGCG GTGTCGGCTG GAAGCTCGGC GCGGGCGACC AGTGGGACAA CAACACGCTG TTCGATTTCC ACGGTCACGC GTTGCCGTCG CTCGACGCGT TCCGGCAGCG CTGA
|
Protein sequence | MNRRSMLRWS MSSAALACLD LAGPLAAFAR PAPHSAAAAE FAMGADISTL PELEAHGAAF FDRGGAPRDC LKILRAHGVD SIRIKVWNDP GNPDFFPANQ SDAAGYNNAA HVVVLAQRAA ALGMRILIDF HYSDWWADPG KQYPPHAWAG KSLAETCALL SAYTTDVLRR LQRAGVSPEW VQIGNEITGG MLWPLGRYDQ WDNLAQLLKT GHDAVKAVDP RIKVMLHVDS GGDNGKSRWW FDSATQRGVA FDVIGLSYYP QWQGSLDDLR NNANDLAVRY DKELIVVETA YPWTTSDGDS EPNAMTNTGS TTFPPSPAGQ AQFLAAVVDI VKGVPGNRGK GVFWWEPEWI PTRGVGWKLG AGDQWDNNTL FDFHGHALPS LDAFRQR
|
| |