Gene BamMC406_4052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_4052 
Symbol 
ID6180941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp1075804 
End bp1077366 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content64% 
IMG OID641683822 
Productextracellular solute-binding protein 
Protein accessionYP_001810733 
Protein GI172063082 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.544759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC CGCATTCGTT CCCGGTGTTC CGTCCACGCG CACTTTTCGC CGCGGGGGCA 
GGCGCGCTCG CGCTGTCGGT CGCGGCACCC GCGTTCGCGC AACAGAACGT CGTGGTCGCC
GTGTACTCGA CGTTCACGAC GATGGACCCG TACGACGCGA ACGACACGGT GTCGCAAGCG
GTCGTCAAGT CGTTCTACGA AGGGCTGTTC GGCTTCGACA AGGACATGAA GCTCGTCAAC
GTGCTCGCGA CGGGCTATCA GGCGTCGCCC GATGCGAAGG TCTATACGGT CAAGCTGCGC
GACGGCGTGA AGTTCCAGGA CGGCACCGAC TTCAACGCCG CCGCGGTGAA GGCGAACTTC
GACCGCGTGA CCGATCCGGC GAACAAGCTG AAGCGTTACG GGCTGTTCCG CGTGATCGAG
AAGACCGAGG TGGTCGATCC GATGACGGTG AAGTTCACGC TGCGCGAGCC GTTCTCCGCG
TTCATCAACA CGCTCGCGCA CCCGTCGGCG GTGATGATTT CGCCGGCCGC GCTGAAGAAG
TGGGGGCGCG ACGTATCGCT GCATCCGGTC GGCACGGGCC CGTTCGAGTT CGTCGAATGG
AAGCAGACCG ACGACATGAA GGTGAAGAAG TTTGCGGGCT ACTGGAAGAA GGGCTATCCG
AAGGTCGATT CGATCGACTG GAAGCCGGTG GTCGACAACA ACACGCGCGC CGCGCTGCTC
AAGACCGGCG AGGCGGATTT CGCGTTCACG ATTCCGTTCG AGCAGGCCGC CGACCTGAAG
AGCAACCCGA AGGTCGAGCT GATCGAGCGG CCGTCGATCA TCCAGCGCTA CATCTCGCTG
AACACGCAGA AGAAGCCGTT CGACAATCCG AAGGTGCGTG AGGCGCTGAA CTACGCGGTC
AACAAGGAAG CGCTGGCGAA GGTCGTGTTC GCCGGCTATG CGACGCCGCA GACGGGAGTC
GCACCGATCG GCGTCGAATA CGCGACCAAG CTCGGGCCGT GGCCGTACGA TCCGGCGAAG
GCGCGCGCGT TGCTGAAGGA GGCCGGCTAT CCGAACGGCT TCGAATCGAC GCTCTGGTCC
GCGTACAACC ATTCGACGGC GCAGAAGCTG ATCCAGTTCG TGCAGCAGCA GCTCGCGCAG
GTCGGCGTGA AGGTGCAGGT GCAGGCGCTG GAAGCCGGCG AGCGGGTCGC GAAGGTCGAG
AGCGCGCAGG ATCCGGCCGC GGCGCCGGTG CGGATGTACT ACAGCGGCTG GTCGGCATCG
ACGGGCGAGG CGAACTGGGC GCTGTCGCCG CTGCTCGCGT CCACGTCGGC GCCGCCGAAG
CTCTACAACA CGGCGTACTA CAAGAACAGT GCGGTCGACG ATGCGCTCGC GAAGGCGCTC
GAGACGACCG ACCGTGCGAA GAAGGCCGCC CTGTACGCCG ACGCGCAGAA GCAGGTGTGG
GCCGACGCGC CGTGGATTTT CCTGGTGCAG GAGAAGATCG TTTATGCGCG TAACAAGCGC
CTGCACGGCA TGTACGTCAT GCCGGACGGT TCGTTCAACT TCGACGAAAT CTCGGTGAAA
TGA
 
Protein sequence
MNKPHSFPVF RPRALFAAGA GALALSVAAP AFAQQNVVVA VYSTFTTMDP YDANDTVSQA 
VVKSFYEGLF GFDKDMKLVN VLATGYQASP DAKVYTVKLR DGVKFQDGTD FNAAAVKANF
DRVTDPANKL KRYGLFRVIE KTEVVDPMTV KFTLREPFSA FINTLAHPSA VMISPAALKK
WGRDVSLHPV GTGPFEFVEW KQTDDMKVKK FAGYWKKGYP KVDSIDWKPV VDNNTRAALL
KTGEADFAFT IPFEQAADLK SNPKVELIER PSIIQRYISL NTQKKPFDNP KVREALNYAV
NKEALAKVVF AGYATPQTGV APIGVEYATK LGPWPYDPAK ARALLKEAGY PNGFESTLWS
AYNHSTAQKL IQFVQQQLAQ VGVKVQVQAL EAGERVAKVE SAQDPAAAPV RMYYSGWSAS
TGEANWALSP LLASTSAPPK LYNTAYYKNS AVDDALAKAL ETTDRAKKAA LYADAQKQVW
ADAPWIFLVQ EKIVYARNKR LHGMYVMPDG SFNFDEISVK