Gene BMA10247_A1065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10247_A1065 
Symbol 
ID4890788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10247 
KingdomBacteria 
Replicon accessionNC_009079 
Strand
Start bp1004670 
End bp1006262 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID640147339 
Productsolute-binding family 5 protein 
Protein accessionYP_001078258 
Protein GI126445727 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACA TGCTGTCCAA GCTCGCGGCG AGCGCCGCAC TCGCCGCGCT GGCCCCGGTG 
CTGGCCCCCG CGCACGCGGC CACGCCGCCC GGCATCTTCG TGATCGCGAC GCAGCTCGGC
GAATTCACGA CGCTCGACCC GAGCGAAATC TACGAGCTCG TGCCGTCCGA ATACGTCGCG
AACACGTACG AGCGGCTCGT GCGCGTCGAC CTGCGCGAAC CGTCGAAATT CGAAGGCCGG
ATCGCGCAAT CGTGGAGCGT CGGCGCGGAC GGCCTCACCT ACACGTTCAA GCTGCGCACC
GGCCTGAAGT TCCACTCGGG CAATCCGGTG ACGGCCGACG ACGTGGCGTG GTCGCTGCAG
CGCACGGTGC TGCTCGACAA AGGGCCGGCC GGCGTGCTCG CGGACCTCGG CCTGACCAAG
GACAACGTCG CGCGGAAGGT ACGCAAGCTC GACGACACGA CCGTGTCGAT CGAGACCGAC
CGCCGGTACG CGCCGAGCTT CGTGCTGAAC GTGCTGAGCG CGGACCCGGC ATCGATCGTC
GACAAGCAGT TGCTGCTCTC GCACGAGAAG AACGGCGACT TCGGCAATGC ATGGCTGAAG
AACGCGGATG CCGGCTCGGG CCCGTACCGG CTCGTCAAGT GGACGCCGAA CGAAAGCCTC
GTGCTGCAAC GCTTCGACGG CTACCGCGCG CCGTATCCGA TGAAGCGCAT CGTGTTGCGG
CACGTGCCGG AAGCGTCCGC GCAGCGCCTG CTGCTCGAGA ACGGCGACGT CGACGCCGCG
CGCAACCTGA GCCCCGACAG CCTTGCTGCG CTGTCGAAGG CGGGCAAGAT CCACGTCGCG
TCATGGCCCG TGTCCGCGCT GCTGTACCTG AGCCTGAACA CGAGGAATCC GAATCTCGCG
AAGCCCGAGG TGCAGGAAGC GATGAAGTGG CTCGTCGATT ACGACGGCAT CCAGCGCAAC
ATCGTCAGGA CGACGTACAA GGTGCATCAG ACCTTCCTGC CGGACGGCTT CCTCGGCGCG
CTGGACGCGA ATCCGTACCG GCAGAACGTC GCGAAGGCGA AGGCGCTGCT CGCGAAGGCC
GGCCTGCCGA ACGGCTTCGC GGTAACGATG GACATGCCGA ACGATTACCC GTACGTCGAG
ATCGCGCAGG CGTTGCAGGC GAACTTCGCG CAGGGCGGCA TCCAGGTGAA GCTGATTCCG
GGCGACGCGA AACAGGCGAT CGGCAAGTAC CGTGCGCGCC AGCACGACAT CTTCATCGGC
GAATGGTCGC CGGACTACAT GGACCCGAAC AGCAACGCGC GCGGTTTCGC GTGGAATCCC
GACAATTCGG ACAACGCCAA GCACAAGCTG CTCGCGTGGC GCAACGGCTG GGATGTGCCG
CAACTGACCG CGAAGACCGA TGCGGCGCTC GCCGAGCCGT CGGCCGCGAA GCGCGCGCAG
GACTATCAGG CGCTGCAAAA GGCGGTGCTC GCGAATTCGC CGTTCGTGAT CCTGTTCGAG
AAGGTCGTGC AGGTTGCGAC GCGGCCGGGT GTCACGGGCC CGGAAATCGG GCCGATCAAC
GATCTCGTGT CGTATCGGAC CTTGAAGAAG TAA
 
Protein sequence
MKHMLSKLAA SAALAALAPV LAPAHAATPP GIFVIATQLG EFTTLDPSEI YELVPSEYVA 
NTYERLVRVD LREPSKFEGR IAQSWSVGAD GLTYTFKLRT GLKFHSGNPV TADDVAWSLQ
RTVLLDKGPA GVLADLGLTK DNVARKVRKL DDTTVSIETD RRYAPSFVLN VLSADPASIV
DKQLLLSHEK NGDFGNAWLK NADAGSGPYR LVKWTPNESL VLQRFDGYRA PYPMKRIVLR
HVPEASAQRL LLENGDVDAA RNLSPDSLAA LSKAGKIHVA SWPVSALLYL SLNTRNPNLA
KPEVQEAMKW LVDYDGIQRN IVRTTYKVHQ TFLPDGFLGA LDANPYRQNV AKAKALLAKA
GLPNGFAVTM DMPNDYPYVE IAQALQANFA QGGIQVKLIP GDAKQAIGKY RARQHDIFIG
EWSPDYMDPN SNARGFAWNP DNSDNAKHKL LAWRNGWDVP QLTAKTDAAL AEPSAAKRAQ
DYQALQKAVL ANSPFVILFE KVVQVATRPG VTGPEIGPIN DLVSYRTLKK