Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_4203 |
Symbol | |
ID | 4094523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008061 |
Strand | - |
Start bp | 1401609 |
End bp | 1403171 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638017494 |
Product | extracellular solute-binding protein |
Protein accession | YP_624062 |
Protein GI | 107026551 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.237738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGT CGTATTCGTT CCCGATGTTC CGTCCGCGCG CGCTGTTCGC CGCGGGGGCC GGCGCGCTCG CGCTGTCGGT CGCGGTGCCC GCGTTCGCGC AGCAGAACGT CGTGGTCGCC GTGTACTCGA CGTTTACGAC GATGGACCCG TACGACGCGA ACGATACGGT GTCGCAGGCT GTCGTCAAGT CGTTCTACGA AGGCCTGTTC GGTTTCGACC GGAACATGAA GCTCGTCAAC GTGCTGGCGA CCAGCTATAC GGCGTCGCCG GACGCGAAGG TGTACACGGT CAAACTGCGC CAGGGCGTGA AGTTCCACGA CGGCACCGAC TTCAATGCCG CCGCGGTGAA GGCGAATTTC GACCGCGTGA CCGATCCGGC GAACAAGCTG AAGCGGTACG GCCTGTTCCG GGTGATCGAG AAGACCGAAG TGGTCGATCC GAACACCGTG CGATTCACGT TGCGCGAGCC GTTCTCGGCG TTCATCAACA CGCTCGCGCA CCCGTCCGCG GTGATGATTT CGCCGGCCGC GCTGAAGAAG TGGGGGCGTG ACGTGTCGCT GCATCCGGTC GGCACCGGCC CGTTCGAGTT CGTCGAATGG AAGCAGACCG ACGACATGAA GGTGAAGAAA TTCGCCGGCT ACTGGAAGAA GGGCTATCCG AAGGTCGATG CGATCGACTG GAAACCGGTG GTCGACAACA ACACGCGCGC CGCGCTGATC AAGACCGGCG AGGCCGATTT CGCGTTCACG ATTCCGTTCG AGCAGGCGAC CGATCTGAAG AGCAATCCGA AGGTGGACTT GATCGAGGCG CCGTCGATCA TCCAGCGCTA CATTTCGCTG AACACGCGGC AAAAGCCGTT CGATAACCCG AAGGTGCGCG AAGCGCTGAA CTACGCGGTC AACAAGGAGG CGCTCGCGAA GGTCGTGTTC GCCGGTTACG CGACGCCGCA GACGGGCGTG GCGCCGACGG GCGTCGAATA CGCAACGAAA CTCGGGCCCT GGCCGTATGA CCCGGCGAAG GCGCGCGCGC TGCTGAAGGA GGCCGGCTAT CCGAACGGCT TCGAATCGAC GCTGTGGTCC GCCTACAATC ACACGACGGC GCAGAAGGTG ATCCAGTTCG TCCAGCAGCA GCTCGCGCAG GTCGGCGTGA AGGTGCAGGT GCAGGCGCTC GAGGCCGGCG AACGGGTTGC CCGGGTGGAG AGCGCCCAGG ATGCGGCGAA GGCGCCGGTG CGGATGTACT ACAGCGGCTG GTCGGCGTCG ACGGGCGAGG CGAACTGGGC CCTGTCGCCG CTGCTTGCGT CGGAGTCGGC GCCGCCGAAG TTGTACAACA CGGCGTACTA CAAGAACGGT CTGGTCGACG ACGATCTCGC GCAGGCACTC TCCACGACCG ATCGCGCGAA GAAGGCCAGC CTCTACGCCG ATGCGCAGAA GCAGATCTGG GCCGACGCGC CGTGGATCTT CCTCGTGCAG GAGAAGATCG TCTACGCACG CAGCAAGCGC CTGCAGGGCA TGTACGTGAT GCCGGACGGC TCGTTCAACT TCGACGAAAT CTCGCTGAAA TGA
|
Protein sequence | MNKSYSFPMF RPRALFAAGA GALALSVAVP AFAQQNVVVA VYSTFTTMDP YDANDTVSQA VVKSFYEGLF GFDRNMKLVN VLATSYTASP DAKVYTVKLR QGVKFHDGTD FNAAAVKANF DRVTDPANKL KRYGLFRVIE KTEVVDPNTV RFTLREPFSA FINTLAHPSA VMISPAALKK WGRDVSLHPV GTGPFEFVEW KQTDDMKVKK FAGYWKKGYP KVDAIDWKPV VDNNTRAALI KTGEADFAFT IPFEQATDLK SNPKVDLIEA PSIIQRYISL NTRQKPFDNP KVREALNYAV NKEALAKVVF AGYATPQTGV APTGVEYATK LGPWPYDPAK ARALLKEAGY PNGFESTLWS AYNHTTAQKV IQFVQQQLAQ VGVKVQVQAL EAGERVARVE SAQDAAKAPV RMYYSGWSAS TGEANWALSP LLASESAPPK LYNTAYYKNG LVDDDLAQAL STTDRAKKAS LYADAQKQIW ADAPWIFLVQ EKIVYARSKR LQGMYVMPDG SFNFDEISLK
|
| |