Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0792 |
Symbol | |
ID | 3846454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 923402 |
End bp | 924883 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637838095 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_438989 |
Protein GI | 83717569 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.127237 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGAC GGGCGGGGCG CGACGCGGGT GAGAGGCGCG AGATGCACGA AATGCACGAG ATGCACGAGA TGCACGAGAT GCACGAGATG CGCGGCGCGG AGGATGCGCG GGAGAGGCGG CAGGCGCGGG CATGCGCGCC CGCGCGCCGA TCGTCGTGCG GGCGGCGGAT GCGTTGGCGC GTGATCGCGC GCGCCGTGGC GGCGAGCGTC GCGGTTGCCG CGTTCGCGGC GGCGTCGGCA GGCCTCGCCT GCGCGACGGA GCGCACACGA GGCGACGGCG CGCGGCGCGA CGCCGGATTG CGCGACGCGG CGGCGGGGGT GGGCGAGACA TCGGCGCCGT TGTCGGAATC GATGTCGACA TCGCCGGCGC AGGCGGTCCC CCGCGCATCC GCCGACATCG CCGCCTGCTC GCCATCGTGG CCGCGCTGGG AGCGCTTCAA GCGCGATTTC GTGTCGGCCG ACGGCCGCGT GATCGACGTC GGCTCGCCCG ATGAGCGGAC CGTGTCGGAG GGGCAGGCGT ACGGCCTGTT CTTCGCGCTC GTCGCGAACG ACCGGCCGGC GTTCGACGCG CTGCTGCGCT GGACCGAGGA CAACCTCGCG CAGGGCGACC TGGCCGCGCG TCTGCCGGCG TGGCTGTGGG GGCGCGCGGC CGACGGCGCT TGGCGCGTGC TCGACGCGAA CGCCGCGTCC GACGCGGATC TGTGGCTCGC GTACGCGCTG CTCGAAGCCG GCCGGCTGTG GGGCGAGCGC AGCTACACCG CGCGCGGCGC GCTGCTCGCA AAGCGCGTGC TCGACGACGA AACCGCGACG CTGCCGGGGC TCGGCCTCGT GCTGCTGCCG GGCCCGACGG GCTTCCGGCC GGCGCGCGAC GCGTGGCGGC TGAACCCGAG CTATTCGCCG CCGCAGGCGA TTCGCGGGAT CGGCGCGCAT CTGCCCGACG ACGCGCGCTG GGCGCGGCTC GCGGCGAGCG CCGGCCGCGT GCTGATCGAC AGCGCGCCGC GCGGCTTCGC GCCGGACTGG GTGCTGTACC GCGCGAACGA CGGCTTCCGG CCGGACGCCG ACACGCGCGC GGCGAGCGCG TACAACGCGA TTCGCGTCTA TTTGTGGGCG GGGATGCTCG ATGCGCGCGA TCCGCTCGCG ATACCGTTGA CGGCGCGTTT CGCACCGTTC GCCGACTACG TCGCCGCACA CGGCGCACCG CCGGAAACGG TCGACACGAT GACGGGCGCG GCCGGTTCTC GCGACGGCAA CGCCGGGTTT TCCGCGGCGG CGGTGCCGTT TCTCGAGGCG CGCGGCGAGC GCGCGCTCGC CGACGCGCAG GTCGCGCGCA TCGCGCGGCT CGAACGCGAG ACGCCGAGCG GCTACTACGC GAACGTGCTG ACGCTGTTCG GGCTCGGATG GCGCGACGGG CGTTACCGGT TCGCGGCCGA CGGCACGCTG CGGGTCCGAT GGAGCGAGCC GTGCTCGATG CCCGCGCGTT GA
|
Protein sequence | MARRAGRDAG ERREMHEMHE MHEMHEMHEM RGAEDARERR QARACAPARR SSCGRRMRWR VIARAVAASV AVAAFAAASA GLACATERTR GDGARRDAGL RDAAAGVGET SAPLSESMST SPAQAVPRAS ADIAACSPSW PRWERFKRDF VSADGRVIDV GSPDERTVSE GQAYGLFFAL VANDRPAFDA LLRWTEDNLA QGDLAARLPA WLWGRAADGA WRVLDANAAS DADLWLAYAL LEAGRLWGER SYTARGALLA KRVLDDETAT LPGLGLVLLP GPTGFRPARD AWRLNPSYSP PQAIRGIGAH LPDDARWARL AASAGRVLID SAPRGFAPDW VLYRANDGFR PDADTRAASA YNAIRVYLWA GMLDARDPLA IPLTARFAPF ADYVAAHGAP PETVDTMTGA AGSRDGNAGF SAAAVPFLEA RGERALADAQ VARIARLERE TPSGYYANVL TLFGLGWRDG RYRFAADGTL RVRWSEPCSM PAR
|
| |