Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bmul_4021 |
Symbol | |
ID | 5769850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia multivorans ATCC 17616 |
Kingdom | Bacteria |
Replicon accession | NC_010086 |
Strand | - |
Start bp | 996962 |
End bp | 997942 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641318324 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001583996 |
Protein GI | 161520569 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.391639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.714624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTT TCCCCCGCTG GATCGCCCGC ACCGTCGCCA CCGCACTCGT CGCCCTGTCG GCCGCGTCCG TCTGCGCGCC GGCGAGCGCG GGCCAGGTCG TACGCATCGG CTATCAGAAG GCCGGGCTGC TCGCGATCAT CCACGCGCAG CATTCGCTCG AAGCGCGGCT GAAGCCGCTC GGCTACGACG TGCAATGGTT CGAGTTTCCG GCCGGCCCGC AGCTGCTCGA AGCGCTGAAC GCGAACGGCA TCGACTTCGG CTATACGGGC GCGCCGCCGC CTGTGTTCGC GCAGGCGGCC GGCGTGCGCT TCGTGTACGT CGGCGCGGAG CCGCCGGCGC CGCACAACGA GGCGGTGTTC GTGAAGGCCG ATTCGCCGAT CCGCTCGGTG GCCGAGCTGC GCGGCAAGCG CGTCGCGCTG CAGAAAGGCT CGAGCGCGAA CTACCTGCTG CTCGAAGCGC TGAACAAGGC CGGCGTGCGC TACGACGAGA TCCGCCCGGT CTACCTGCCG CCCGCCGATG CGCGTGCCGC CTTCGAAAGC GGGCACGTCG ACGCGTGGGC CGTCTGGGAC CCGTATTACG CGGCCGCGCA AAACGCGTTG AAGATCCGCA CGCTGTCCGA TTACACGGGC CTCACGCCGA CCAACAACTT CTACGAGGCG ACGCGCGATT TCGCGCAGCA GCATCCCGAC GTGGTTGCCG CGATTCTCGC GCAGGCGCGC GAGACCGGCG CATGGGTGAA CGGTCATCCG GCCGAGACGG CCGCGCTGAT CGCGCCGACT GTCGGCATGC CGGCGCCGCT CGTCGAAACC TGGATCAAGC GCGTGCCGTT CGGCGCGGTG CCGGTCGACG AGAAGATCGT CGCGGTTCAG CAGCGTGTCG CCGATGCGTT CCTCGCGGCG AAGCTGATTC CGCAGAAGCT GAATGTCGCC GACAACGCGT GGATCGACCG CCGCGTCGCG GCCGCGCTCG CCGCGAAATA G
|
Protein sequence | MIRFPRWIAR TVATALVALS AASVCAPASA GQVVRIGYQK AGLLAIIHAQ HSLEARLKPL GYDVQWFEFP AGPQLLEALN ANGIDFGYTG APPPVFAQAA GVRFVYVGAE PPAPHNEAVF VKADSPIRSV AELRGKRVAL QKGSSANYLL LEALNKAGVR YDEIRPVYLP PADARAAFES GHVDAWAVWD PYYAAAQNAL KIRTLSDYTG LTPTNNFYEA TRDFAQQHPD VVAAILAQAR ETGAWVNGHP AETAALIAPT VGMPAPLVET WIKRVPFGAV PVDEKIVAVQ QRVADAFLAA KLIPQKLNVA DNAWIDRRVA AALAAK
|
| |