Gene Bcen_4203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_4203 
Symbol 
ID4094523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008061 
Strand
Start bp1401609 
End bp1403171 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID638017494 
Productextracellular solute-binding protein 
Protein accessionYP_624062 
Protein GI107026551 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.237738 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGT CGTATTCGTT CCCGATGTTC CGTCCGCGCG CGCTGTTCGC CGCGGGGGCC 
GGCGCGCTCG CGCTGTCGGT CGCGGTGCCC GCGTTCGCGC AGCAGAACGT CGTGGTCGCC
GTGTACTCGA CGTTTACGAC GATGGACCCG TACGACGCGA ACGATACGGT GTCGCAGGCT
GTCGTCAAGT CGTTCTACGA AGGCCTGTTC GGTTTCGACC GGAACATGAA GCTCGTCAAC
GTGCTGGCGA CCAGCTATAC GGCGTCGCCG GACGCGAAGG TGTACACGGT CAAACTGCGC
CAGGGCGTGA AGTTCCACGA CGGCACCGAC TTCAATGCCG CCGCGGTGAA GGCGAATTTC
GACCGCGTGA CCGATCCGGC GAACAAGCTG AAGCGGTACG GCCTGTTCCG GGTGATCGAG
AAGACCGAAG TGGTCGATCC GAACACCGTG CGATTCACGT TGCGCGAGCC GTTCTCGGCG
TTCATCAACA CGCTCGCGCA CCCGTCCGCG GTGATGATTT CGCCGGCCGC GCTGAAGAAG
TGGGGGCGTG ACGTGTCGCT GCATCCGGTC GGCACCGGCC CGTTCGAGTT CGTCGAATGG
AAGCAGACCG ACGACATGAA GGTGAAGAAA TTCGCCGGCT ACTGGAAGAA GGGCTATCCG
AAGGTCGATG CGATCGACTG GAAACCGGTG GTCGACAACA ACACGCGCGC CGCGCTGATC
AAGACCGGCG AGGCCGATTT CGCGTTCACG ATTCCGTTCG AGCAGGCGAC CGATCTGAAG
AGCAATCCGA AGGTGGACTT GATCGAGGCG CCGTCGATCA TCCAGCGCTA CATTTCGCTG
AACACGCGGC AAAAGCCGTT CGATAACCCG AAGGTGCGCG AAGCGCTGAA CTACGCGGTC
AACAAGGAGG CGCTCGCGAA GGTCGTGTTC GCCGGTTACG CGACGCCGCA GACGGGCGTG
GCGCCGACGG GCGTCGAATA CGCAACGAAA CTCGGGCCCT GGCCGTATGA CCCGGCGAAG
GCGCGCGCGC TGCTGAAGGA GGCCGGCTAT CCGAACGGCT TCGAATCGAC GCTGTGGTCC
GCCTACAATC ACACGACGGC GCAGAAGGTG ATCCAGTTCG TCCAGCAGCA GCTCGCGCAG
GTCGGCGTGA AGGTGCAGGT GCAGGCGCTC GAGGCCGGCG AACGGGTTGC CCGGGTGGAG
AGCGCCCAGG ATGCGGCGAA GGCGCCGGTG CGGATGTACT ACAGCGGCTG GTCGGCGTCG
ACGGGCGAGG CGAACTGGGC CCTGTCGCCG CTGCTTGCGT CGGAGTCGGC GCCGCCGAAG
TTGTACAACA CGGCGTACTA CAAGAACGGT CTGGTCGACG ACGATCTCGC GCAGGCACTC
TCCACGACCG ATCGCGCGAA GAAGGCCAGC CTCTACGCCG ATGCGCAGAA GCAGATCTGG
GCCGACGCGC CGTGGATCTT CCTCGTGCAG GAGAAGATCG TCTACGCACG CAGCAAGCGC
CTGCAGGGCA TGTACGTGAT GCCGGACGGC TCGTTCAACT TCGACGAAAT CTCGCTGAAA
TGA
 
Protein sequence
MNKSYSFPMF RPRALFAAGA GALALSVAVP AFAQQNVVVA VYSTFTTMDP YDANDTVSQA 
VVKSFYEGLF GFDRNMKLVN VLATSYTASP DAKVYTVKLR QGVKFHDGTD FNAAAVKANF
DRVTDPANKL KRYGLFRVIE KTEVVDPNTV RFTLREPFSA FINTLAHPSA VMISPAALKK
WGRDVSLHPV GTGPFEFVEW KQTDDMKVKK FAGYWKKGYP KVDAIDWKPV VDNNTRAALI
KTGEADFAFT IPFEQATDLK SNPKVDLIEA PSIIQRYISL NTRQKPFDNP KVREALNYAV
NKEALAKVVF AGYATPQTGV APTGVEYATK LGPWPYDPAK ARALLKEAGY PNGFESTLWS
AYNHTTAQKV IQFVQQQLAQ VGVKVQVQAL EAGERVARVE SAQDAAKAPV RMYYSGWSAS
TGEANWALSP LLASESAPPK LYNTAYYKNG LVDDDLAQAL STTDRAKKAS LYADAQKQIW
ADAPWIFLVQ EKIVYARSKR LQGMYVMPDG SFNFDEISLK