Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bphyt_2501 |
Symbol | |
ID | 6283352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia phytofirmans PsJN |
Kingdom | Bacteria |
Replicon accession | NC_010681 |
Strand | - |
Start bp | 2823341 |
End bp | 2825182 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 642622060 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_001896121 |
Protein GI | 187924479 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0677964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.0181569 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTGCG GCGTGCTGGC CGTGGCTGGT CTTCTCGTGG CGCCGCGGGC GCACGCGGTC TATGCGATCG CGCAATACGG CGAGCCGAAG TATCTGGCGG ATTTCAAGCA CTTCGATTAC GTCAATCCGA ATGCGCCCAA GGGCGGCACG CTGGTGCTAG CCAATCCGAG CCGCCTCACC AGCTTCGACA AATTCAATCC GTTCACGCTG CGCGGCAATA CGGCGCCCGG TGTCGACCTG ATGTTCGAAA GCCTCACCGT CGGCAGCAGC GACGAAGTGG CGTCCGCCTA CGGGCTGCTC GCCGACGACA TCAGCATCGC GCCGGACGGC CTCTCGGTGA CGTTCCACAT CAATCCGCTC GCACGCTTTT CGAACGGCGA TCCGGTCACC GCCGACGACG TCAAGTTCTC GCTCGACACG CTGAAAAGCC CGCAGGCCGC GCCGCAATTC GCATCGATCT TCGGCGAAAT CACGCGCGCG GTGGTGGTCG ATCCGCATAC GATCCGCTTC GAGTTTCACC AGCGCAATCG CGAGTTGCCT CTTCTCGCGG GCGGCATACC GGTGTTCTCG CGCAAGTGGG GGATGAAGCC GGACGGCAGC CGCATTGCAT TCGACCAGCT CGCGTTCGAA AAGCCGATCG GCAGCGGGCC TTATCTGATC GAGCAGTACG ATAACGGCCG CACCATCACC TATCGGCGCG ACCCGAATTA CTGGGGCGCG GCGCTGCCGG TGCGCGTCGG CATGAATAAC TTCGATCGCA TCGTCTACAA GCTGTATTCG GATAACACGG CGCGGCTGGA GGCGTTCAAG GCAGGCGAGT ACGACGCACT GGTCGAGTAC GTCGCGCGCA ATTGGGTGCG GCGTGACGTC GGCAAGAAAT TCGACAGCGG CGAACTGATC AAGCAGGTGT TTCCTCAACA TAACGGCACC GGCATGCAGG GCTTCATGCT GAACACGCGG CGGTCCTTGT TTCAGGATGT GCGCGTGCGC AAGGCACTCG ATCTCGCGCT CGACTTTCAA TGGCTCAATC GCCAGTTGTT CTTCAATCAG TACACGCGTA TCGACAGTTT CTTCGCCAAC ACCGATCTGC AGGCGAAGGG TCTGCCTTCA CCGGGCGAGT TGGCCTTGCT CGAACCGTGG CGCGCGAAGC TTGACCCGGC CGTGTTCGGT CCGCCGCCGA AGCAGCCGGA CACCGACCCG CCCGGCTCGC TGCGCGCCAA TCTGCTGGAG GCGCGCGCGC TGTTGCAGCA GGCCGGCTGG ACCTATCGCG ACGGCGCGTT GCGCAACGCC AAGGGCGAGC CGTTCCGCTT CGAGATTCTC GACGACTCCG GTTCGTCCGC GCAGATGGAG CCGATCGTCG CGACCTTCAT CCGCAATCTG CAGAAGCTGG GCATCCAGGC CACGTTTCGC GTGTCGGATT TCGCGGTCTA TCAGAAGCGT CTGGACGCCT TCGACTTTGA CACCACCACG ATCCGCATGC CGGACGTGCA GGTGCCCGGC TCCGAGCAGA TCGAACGATT CGGCAGCAAG GCGGCCGATA CGCAGGGTTC CGATAACATG ATCGGGCTGA AGTCGCCTGT CGTGGATGCG ATCCTGAATG CGCTTGTGCA TGCGCAAACA CGCGAGCAAC TGGTCGACGC CACGCACGCG CTCGACCGTG TGCTGATGCA TGGCTACTAT GTCGTGCCGC ATTGGTACAG CGCCACGCAT CGGGTGGCGT TCAAGCGCGG TCTTGCGTGG CCGAAAACGC TACCCCTGTA CTATTCAGCG GAAGGCTGGA TTACATCGAT GTGGTGGTTC GCGCAGCCAC CATCGCAATC GCAACCGCCG GCGCAACCTT AA
|
Protein sequence | MMCGVLAVAG LLVAPRAHAV YAIAQYGEPK YLADFKHFDY VNPNAPKGGT LVLANPSRLT SFDKFNPFTL RGNTAPGVDL MFESLTVGSS DEVASAYGLL ADDISIAPDG LSVTFHINPL ARFSNGDPVT ADDVKFSLDT LKSPQAAPQF ASIFGEITRA VVVDPHTIRF EFHQRNRELP LLAGGIPVFS RKWGMKPDGS RIAFDQLAFE KPIGSGPYLI EQYDNGRTIT YRRDPNYWGA ALPVRVGMNN FDRIVYKLYS DNTARLEAFK AGEYDALVEY VARNWVRRDV GKKFDSGELI KQVFPQHNGT GMQGFMLNTR RSLFQDVRVR KALDLALDFQ WLNRQLFFNQ YTRIDSFFAN TDLQAKGLPS PGELALLEPW RAKLDPAVFG PPPKQPDTDP PGSLRANLLE ARALLQQAGW TYRDGALRNA KGEPFRFEIL DDSGSSAQME PIVATFIRNL QKLGIQATFR VSDFAVYQKR LDAFDFDTTT IRMPDVQVPG SEQIERFGSK AADTQGSDNM IGLKSPVVDA ILNALVHAQT REQLVDATHA LDRVLMHGYY VVPHWYSATH RVAFKRGLAW PKTLPLYYSA EGWITSMWWF AQPPSQSQPP AQP
|
| |