Gene BURPS1710b_A2574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2574 
Symbol 
ID3693339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp3095600 
End bp3097192 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID637732828 
Productsolute-binding family 5 protein 
Protein accessionYP_337724 
Protein GI76818567 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACA TGCTGTCCAA GCTCGCGGCA AGCGCCGCAC TCGCCGCGCT GGCCCCGGTG 
CTGGCCCCCG CGCACGCGGC CACGCCGCCC GGCATCTTCG TGATCGCGAC GCAGCTCGGC
GAATTCACGA CGCTCGACCC GAGCGAAATC TACGAGCTCG TGCCGTCCGA ATACGTCGCG
AACACGTACG AGCGGCTCGT GCGCGTCGAC CTGCGCGAAC CGTCGAAATT CGAAGGCCGG
ATCGCGCAAT CGTGGAGCGT CGGCGCGGAC GGCCTCACCT ACACGTTCAA GCTGCGCACC
GGCCTGAAGT TCCACTCGGG CAATCCGGTG ACGGCCGACG ACGTGGCGTG GTCGCTGCAG
CGCACGGTGC TGCTCGACAA AGGGCCGGCC GGCGTGCTCG CGGACCTCGG CCTGACCAAG
GACAACGTCG CGCGGAAGGT ACGCAAGCTC GACGACACGA CCGTGTCGAT CGAGACCGAC
CGCCGGTACG CGCCGAGCTT CGTGCTGAAC GTGCTGAGCG CGGACCCGGC ATCGATCGTC
GACAAGCAGT TGCTGCTCTC GCACGAGAAG AACGGCGACT TCGGCAATGC ATGGCTGAAG
AACGCGGATG CCGGCTCGGG CCCGTACCGG CTCGTCAAGT GGACGCCGAA CGAAAGCCTC
GTGCTGCAAC GCTTCGACGG CTACCGCGCG CCGTATCCGA TGAAGCGCAT CGTGTTGCGG
CACGTGCCGG AAGCGTCCGC GCAGCGCCTG CTGCTCGAGA ACGGCGACGT CGACGCCGCG
CGCAACCTGA GCCCCGACAG CCTTGCTGCG CTGTCGAAGG CGGGCAAGAT CCACGTCGCG
TCATGGCCCG TGTCCGCGCT GCTGTACCTG AGCCTGAACA CGAGGAATCC GAATCTCGCG
AAGCCCGAGG TACAGGAAGC GATGAAGTGG CTCGTCGATT ACGACGGCAT CCAGCGCAAC
ATCGTCAGGA CGACGTACAA GGTGCATCAG ACCTTCCTGC CGGACGGCTT CCTCGGCGCG
CTGGACGCGA ATCCGTACCG GCAGAACGTC GCGAAGGCGA AGGCGCTGCT CGCGAAGGCC
GGCCTGCCGA ACGGCTTCGC GGTAACGATG GACATGCCGA ACGATTACCC GTACGTCGAG
ATCGCGCAGG CGTTGCAGGC GAACTTCGCG CAGGGCGGCA TCCAGGTGAA GCTGATTCCG
GGCGACGCGA AACAGGCGAT CGGCAAGTAC CGTGCGCGCC AGCACGACAT CTTCATCGGC
GAATGGTCGC CGGACTACAT GGACCCGAAC AGCAACGCGC GCGGTTTCGC GTGGAATCCC
GACAATTCGG ACAACGCCAA GCACAAGTTG CTCGCGTGGC GCAACGGCTG GGATGTGCCG
CAACTGACCG CGAAGACCGA TGCGGCGCTC GCCGAGCCGT CGGCCGCGAA GCGCGCGCAG
GACTATCAGG CGCTGCAAAA GGCGGTGCTC GCGAATTCGC CGTTCGTGAT CCTGTTCGAG
AAGGTCGTGC AGGTTGCGAC GCGGCCGGGT GTCACGGGCC CGGAAATCGG GCCGATCAAC
GATCTCGTGT CGTATCGGAC CTTGAAGAAG TAA
 
Protein sequence
MKHMLSKLAA SAALAALAPV LAPAHAATPP GIFVIATQLG EFTTLDPSEI YELVPSEYVA 
NTYERLVRVD LREPSKFEGR IAQSWSVGAD GLTYTFKLRT GLKFHSGNPV TADDVAWSLQ
RTVLLDKGPA GVLADLGLTK DNVARKVRKL DDTTVSIETD RRYAPSFVLN VLSADPASIV
DKQLLLSHEK NGDFGNAWLK NADAGSGPYR LVKWTPNESL VLQRFDGYRA PYPMKRIVLR
HVPEASAQRL LLENGDVDAA RNLSPDSLAA LSKAGKIHVA SWPVSALLYL SLNTRNPNLA
KPEVQEAMKW LVDYDGIQRN IVRTTYKVHQ TFLPDGFLGA LDANPYRQNV AKAKALLAKA
GLPNGFAVTM DMPNDYPYVE IAQALQANFA QGGIQVKLIP GDAKQAIGKY RARQHDIFIG
EWSPDYMDPN SNARGFAWNP DNSDNAKHKL LAWRNGWDVP QLTAKTDAAL AEPSAAKRAQ
DYQALQKAVL ANSPFVILFE KVVQVATRPG VTGPEIGPIN DLVSYRTLKK