Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_6541 |
Symbol | |
ID | 6248078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010625 |
Strand | + |
Start bp | 1123383 |
End bp | 1125344 |
Gene Length | 1962 bp |
Protein Length | 653 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642598213 |
Product | enterotoxin |
Protein accession | YP_001862615 |
Protein GI | 186471297 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.633739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.541047 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCCC ATCACGCCTC TTTGGAACCT TCTGACGCGC CCCGCTTGCG TGTGCGAGGC GAACGCTATG CGTTCGGCAA TGACGCGATC GAACTGGACT GGACCATCGC AGACGATAGT CTGTGCGACG TTTGCCTCTT CGATCGTACG CATGGACGTG CTCTACAGAT CGACGCGCCA TTCGTTCTCA CGTTAGCGGA TGGCCGCACG GTTGGGGCGG CCGGGTTGCG GCTCGTCGCG CCCCTGCGCG AAGAAGCGTT GTGCGCGGAT GCCGATGCGC TGCGACAAGC CGAACGTCAT GCAGGGCGGC GCGTGATCGC GACGTTCGGC GATAGCGAGC AGCGCCTGCG CGTCGAATGG AGTATCGAGC AGCGCGACGG CGCACGTTAT CTTCGCCAGC ATCTGTCCAT CACTGCGCTG TTGCAGGACG AGCATATTGC GTCCGTTGCA TTATGGCGGA CGTACGCGCC GTCTGCCCGA AAAGCCGGCA ACCTCACGGC AGTGCCAGTC GTCGACGGGA ATGTGTATCT CGGCTTCGAG CTTCCGATGG CGGAAAGCGA AATCCGCGAC GGCACCGTCT GCTTCAGCGT CACGCGCACA CTGCCGCTCA AAAAGGGCAA GACGCTTGCT TATTCCGTTG TCGCAGGCGT GTTTCGCGAA GGCCAGCTGC GCCGCGATTT CGCAACCTGC CTGGAGCGCG AACGTGCGCG ACCTTATTGT CCCTTCCTGC ATTACAACTC CTGGTACGAC ATCGGCTTTC TCACGACCTA TACGCAGAAT CAGGCCATCG AACGCATTCA CGCGACGGGC CGCGAACTGC ACGACAAGCG CGGAGTGCAA ATCGATTCAT TCGTGTTCGA CGACGGCTGG GACGATTACA GCGGCACCTG GACGTTCAGC GATGCATTTC CCAACGGCTT CGCGCCGTTG AGAGAGGCCG CGGCGCGCTA TGGCGCAGCA CCCGGTGTAT GGTTGTCCCC TTGGGGCGGC TACGGCCCGC CTAAAACCGA GCGCGTGACG CGCGGCCGCG CGACAGGATA CGAAACGGCC GGCGACGGCT TCGCGCTGTC GGGTCCGGCG TACTATCGCC GCTTCCACGA AGTCGCGATG GATCTTCTGA CGAAACACGG CGTCAACCAT TTCAAGCTCG ATGGCACGGG CAACGCGAAT ACGGTCGTCG CCGGCAGTCG TTTCAATAGC GATTGGGATG CCGCGATAAG ACTCATCGAC GACATGCGCT GCGTGAATCG CGATGTGTTC GTCAACCTGT CGACGGGCAC GCAGGCATCG CCATTCTGGC TGCGCTATGT CGATTCGATC TGGCGGGACG GCGCAGATTG CGGTTTCGCC GGCAAGGGCA GCGAGCGCCA GCGCTGGATC ACGTATCGCG ACGCGCAAAC GTACCGCAAC GTCGTGTGCC GAAGCCCGCT ATTTCCGCTC AATTCGCTGA TGCTTCACGG CATCATCTAT GCGCAGGCCA ACGCACAACT CAACAGCGAT CCCTCGCATG CGTTCGCGCA CGAAGTGCTT TCCTATTTCG GCAGCGGGAC GCAGTTGCAG GAACTGTATG TCACGCCGTC GTTGCTGAGT GATCGCGACT GGGACGTGCT CGCGAAAGCG GCCCGCTGGG CGCGCGCGCA TGCCGACGTA CTACGCGACT GCCACTGGGT CGGCGGCGCG CCGGATGAAC TCGATGTCTA TGGCTGGGCT GCGTGGACGC CGTCGAAAGC GATCGTGACG CTGCGCAATC CGGATGATCG TGCGAAAGAC TTCGCGCTCG ATCTGCAAGC GCATCTCGAA CTGCCGGATG GTGTTGGAGG TCGGTTCACC GCACGATTTC CGTTCACAGA GCGCGATGCA AACGTCGCAA CACAATGGGA CGCACGGCGC CCGCAAGCCA TACGCCTGGA CCCGTTTCAG GTGCTGACGC TCGAACTCTC TCCCGTCGAG TGCGACGTTT GA
|
Protein sequence | MQAHHASLEP SDAPRLRVRG ERYAFGNDAI ELDWTIADDS LCDVCLFDRT HGRALQIDAP FVLTLADGRT VGAAGLRLVA PLREEALCAD ADALRQAERH AGRRVIATFG DSEQRLRVEW SIEQRDGARY LRQHLSITAL LQDEHIASVA LWRTYAPSAR KAGNLTAVPV VDGNVYLGFE LPMAESEIRD GTVCFSVTRT LPLKKGKTLA YSVVAGVFRE GQLRRDFATC LERERARPYC PFLHYNSWYD IGFLTTYTQN QAIERIHATG RELHDKRGVQ IDSFVFDDGW DDYSGTWTFS DAFPNGFAPL REAAARYGAA PGVWLSPWGG YGPPKTERVT RGRATGYETA GDGFALSGPA YYRRFHEVAM DLLTKHGVNH FKLDGTGNAN TVVAGSRFNS DWDAAIRLID DMRCVNRDVF VNLSTGTQAS PFWLRYVDSI WRDGADCGFA GKGSERQRWI TYRDAQTYRN VVCRSPLFPL NSLMLHGIIY AQANAQLNSD PSHAFAHEVL SYFGSGTQLQ ELYVTPSLLS DRDWDVLAKA ARWARAHADV LRDCHWVGGA PDELDVYGWA AWTPSKAIVT LRNPDDRAKD FALDLQAHLE LPDGVGGRFT ARFPFTERDA NVATQWDARR PQAIRLDPFQ VLTLELSPVE CDV
|
| |