Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphy_0558 |
Symbol | |
ID | 6242059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phymatum STM815 |
Kingdom | Bacteria |
Replicon accession | NC_010622 |
Strand | - |
Start bp | 631460 |
End bp | 633145 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642592322 |
Product | extracellular solute-binding protein |
Protein accession | YP_001856796 |
Protein GI | 186475326 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.760139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.00152331 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAATT CCGCGGACCC GCGTTTTGCG CCTCGCCTGG CAGGATCAGT TGACCTGGAG GCGGCGCGCG TCGGCGCCGA CGCTCACGGT AACCACGCCA TCGACGAGTT CCTCGCTGGC CGTCTCACGC GGCGCGAACT GCTGCGCTAC GCCAGCGTGA TCGGGATGTC GCTCGCGGGC GGCAGCCTGC TCGCGCCGCG CAGCGCGCGC GCCCAGGGCG CGGCGGGCGC CAACGCGACG ATCCGCGTCG CGCATCTGAC GCCCGCAGGC GCCGTCGATC CGATGACGGT CACCGATGCC GCCAGTCTCT GCCTGCTCAA TCAGACGGGC GAATTCCTGA TTGACGACGA CGGCGAAAAG CAGACGCTCA AGCCCGCGCT CGCGCTGTCG TGGAAGCCGA ACGACAAGGG CGACGTGTGG ACGTTCAAGC TGCGCGAGAA CGTGAAATTC CACGACGGTC AGACCTTCAC CGCGAAGGAC GTCGCGGCGA CTTTCGACCG GCTCGCCGAT CCCGCCGCCG GCTCCGCTGC CCTGTCGACC TTGAAGGGGG TGCTGTCGAA GGGCAACACG AAAGTCGTCG ACGATCACAC TGTCGCGTTC CATCTCGACG CGCCCAACGG CAACTTCCCG TACTACGTCT CGTCGGACAA CTACAACGCG GTCATCCTGC CCGCGAATTA CGCGGGCAAC TACGAGAAGA CCTTCATCGG CACGGGTCCG TTCAAGCTCG AGAAGTATCA GGCGAAAGTG GGCGCGTCGT TCGTGCGCAA TCCCGACTAC TGGGGCGACA AGGCCTTGCC GCAGCGCGTG CAGTTCACCT TCTATGCGGA CCAGCAGGCG CAGATTCTCG CGCTGCAAGG CCATCAGGCC GACGTGATGG GCACCTTCAC CGTACAGGGC GGTCAAGGTT TGATGAACAA CCCGGAATTC AAGGTGATCG GCGTGAAGTC GAGCGCGCAT CGGCAGATAC ACATGCGCGT CGACAGCCCG CAATTCAAGG ACAAGCGCGT GCGCCAGGCG CTTGCGCTGT CGCTCGATCG CGAGGTCATC GTCAAGGGTC TCTTCAAGGG CCGTGCTCAG GTCGGCAACG ACAGCCCGTT CGCGCCCGCG TTTCCGTCGT CCGATGCAGG CGTGGCGCAG CGCAAGATCG ATGTCGCGAA GGCGAAGCAG CTGCTCGCGC AGGCGGGCGT GCCGAACGGC TTCGACGCGA CGCTCACGAC CGAAAAATAC ATGGAGATTC CCGACCTCGC CGTCGTCGTG CAGAACTATG CGAAGGCGGT TGGCATCCGC ATCAACCTGA AGGTCGAAAG CCAGTCGCAA TACTACGGCT CGGGCACGCC GGGCAAATCG GACTGGCTCG ATTCGCCGCT CGGCATCACC GACTACGGCA GCCGTGGCGT GCCCAATGTG TTCCTGAGCG CGCCGTTGAC GAGCGGCGGC ACGTGGAACG CGGCACACTT CAAGAACCCG CAGTACGACA AGCTGGTTGC AGACTACGTC GCGGCGCTCG ATATCGCCGG GCAGAAGAAA GTCTCCGCGC AGATCCAGAC GCTGCTGCTC GACGAAACGC CCGTGATCTT CCCGTTCTTC TACGATCAGC TGATCGCCGC GCGCAAGCAG TTGAACGGCG TGCGCTTCAC GGCGATCGCG CAGCTGTATT TCGATCGCGC GACGCTCGCG GGCTGA
|
Protein sequence | MKNSADPRFA PRLAGSVDLE AARVGADAHG NHAIDEFLAG RLTRRELLRY ASVIGMSLAG GSLLAPRSAR AQGAAGANAT IRVAHLTPAG AVDPMTVTDA ASLCLLNQTG EFLIDDDGEK QTLKPALALS WKPNDKGDVW TFKLRENVKF HDGQTFTAKD VAATFDRLAD PAAGSAALST LKGVLSKGNT KVVDDHTVAF HLDAPNGNFP YYVSSDNYNA VILPANYAGN YEKTFIGTGP FKLEKYQAKV GASFVRNPDY WGDKALPQRV QFTFYADQQA QILALQGHQA DVMGTFTVQG GQGLMNNPEF KVIGVKSSAH RQIHMRVDSP QFKDKRVRQA LALSLDREVI VKGLFKGRAQ VGNDSPFAPA FPSSDAGVAQ RKIDVAKAKQ LLAQAGVPNG FDATLTTEKY MEIPDLAVVV QNYAKAVGIR INLKVESQSQ YYGSGTPGKS DWLDSPLGIT DYGSRGVPNV FLSAPLTSGG TWNAAHFKNP QYDKLVADYV AALDIAGQKK VSAQIQTLLL DETPVIFPFF YDQLIAARKQ LNGVRFTAIA QLYFDRATLA G
|
| |