Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_1028 |
Symbol | |
ID | 6284130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | + |
Start bp | 1158425 |
End bp | 1159672 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642620589 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_001894671 |
Protein GI | 187923029 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00732462 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0899654 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTC GCGCGATCAT GGGCGCTTTG TGCGCCGCAG GTCTGATGTG TGGCGTCTCG GCCGTGCAAG CCGCCGAGTC GATCGAAGTG TTGCACTGGT GGACCTCGGG CGGCGAATCG AAAGCCGTCG GCGTCCTCAA GGACGACATG ACGAAGCAGG GTTACACGTG GAAGGACTTC GCGGTTGCGG GTGGCGCCGG CGCGGCTGCC ATGACGGCAC TCAAGACGCA AGTGATCTCG GGCAACGCAC CGAGCGCTGC GCAGATCAAA GGTCCGCTGA TCCAGGACTG GGCGTCGCAA GGCGTGCTGG TGCCGATCGA CGCGGCCGCC GGCGACTGGA AGAAGAACCT GCCGCCGGAA ATCGACAAGA TTATGCACGC GGACGGTCAC TACGTCGCAG CGCCGTTCTC GGTGCACCGC GTGAACTGGC TGTACATCAA CAAGGCAGCG TTGGACAAGG CGGGCGGCAA GGCGCCGACC ACGTGGCCTG AGTTCTTCGC GGTGGCCGAC AAGATGAAGG CCGCGGGCAT CCAGCCGATC GCGATGGGCG GCCAGCCGTG GCAAGACCTG ACGCTGTGGG AAGACGTCGT GCTGTCGCAA GGCGCGGACT TCTACAAGAA GGCGCTGGTC GACCTCGACG AGAAGACGCT GACTTCGGAC AAGATGGTCG GCGTGTTCGA CACGGTCCGC AAGATCCAGG GTTACTTCGA CGCGGGCCGC ACGGGTCGTG ACTGGAACCT GGCAACGGCT ATGGTCATCA ACGGCAAGGC CGGCATGCAG TTCATGGGCG ACTGGGCGAA GGGCGAATTC GCCAACGCCG GCAAGAAGTC GGGCTCGGAC TACATCTGCG CCGCTGTCCC GGGCACGGAA AAGTCCTACA CGTTCAACGT CGACTCGTTC GTGTTCTTCC AGCAGAAAGG CCAGAAGGCG GCAACGCCGG GTCAGCTCGC GCTGGCGAAG ACGATCATGT CGCCGGAGTT CCAGGAACAG TTCAGCCTGA ACAAGGGGTC CATCCCGGTT CGTCTGGGCG TGTCGATGGC CAAGTTCGAC GACTGCGCGA AGAAGTCGTA CGCGGATGAA CAAGTGGCGA TCAAGTCGGG CGGCTATGTG CCTTCGCTGG CACACGGCAT GGCGCAACCG GATGCAGCAG CCGGCGCGAT CTCGGACGTG GTCACGAAGT TCATGAACTC GCAGCAGGAT TCGAAGAGCG CAGTTGCCGC GCTCGCGAAG GCAGCGAAGA CCAAGTAA
|
Protein sequence | MKFRAIMGAL CAAGLMCGVS AVQAAESIEV LHWWTSGGES KAVGVLKDDM TKQGYTWKDF AVAGGAGAAA MTALKTQVIS GNAPSAAQIK GPLIQDWASQ GVLVPIDAAA GDWKKNLPPE IDKIMHADGH YVAAPFSVHR VNWLYINKAA LDKAGGKAPT TWPEFFAVAD KMKAAGIQPI AMGGQPWQDL TLWEDVVLSQ GADFYKKALV DLDEKTLTSD KMVGVFDTVR KIQGYFDAGR TGRDWNLATA MVINGKAGMQ FMGDWAKGEF ANAGKKSGSD YICAAVPGTE KSYTFNVDSF VFFQQKGQKA ATPGQLALAK TIMSPEFQEQ FSLNKGSIPV RLGVSMAKFD DCAKKSYADE QVAIKSGGYV PSLAHGMAQP DAAAGAISDV VTKFMNSQQD SKSAVAALAK AAKTK
|
| |