Gene BURPS668_A1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1410 
Symbol 
ID4888642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1321471 
End bp1323063 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID640131350 
Productsolute-binding family 5 protein 
Protein accessionYP_001062408 
Protein GI126443233 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.139309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACA TGCTGTCCAA GCTCGCGGCG AGCGCCGCAC TCGCCGCGCT GGCCCCGGTG 
CCGGCCCCCG CGCACGCGGC CACGCCGCCC GGCATCTTCG TGATCGCGAC GCAGCTCGGC
GAATTCACGA CGCTCGACCC GGGCGAAATC TACGAGCTCG TGCCGTCCGA ATACGTCGCG
AACACGTACG AGCGGCTCGT GCGCGTCGAC CTGCGCGAAC CGTCGAAATT CGAAGGCCGG
ATCGCGCAAT CGTGGAGCGT CGGCGCGGAC GGCCTCACCT ACACGTTCAA GCTGCGCACC
GGCCTGAAGT TCCACTCGGG CAATCCGGTG ACGGCCGACG ACGTGGCGTG GTCGCTGCAG
CGCACGGTGC TGCTCGACAA AGGGCCGGCC GGCGTGCTCG CGGACCTCGG CCTGACCAAG
GACAACGTCG CGCGGAAGGT ACGCAAGCTC GACGACACGA CCGTGTCGAT CGAGACCGAC
CGCCGGTACG CGCCGAGCTT CGTGCTGAAC GTGCTGAGCG CGGACCCGGC ATCGATCGTC
GACAAGCAGT TGCTGCTCTC GCACGAGAAG AACGGCGACT TCGGCAATGC ATGGCTGAAG
AACGCGGATG CCGGCTCGGG CCCGTACCGG CTCGTCAAGT GGACGCCGAA CGAAAGCCTC
GTGCTGCAAC GCTTCGACGG CTACCGCGCG CCGTATCCGA TGAAGCGCAT CGTGTTGCGG
CACGTGCCGG AAGCGTCCGC GCAGCGCCTG CTGCTCGAGA ACGGCGACGT CGACGCCGCG
CGCAACCTGA GCCCCGACAG CCTTGCTGCG CTGTCGAAGG CGGGCAAGAT CCACGTCGCG
TCGTGGCCCG TGTCCGCGCT GCTGTACCTG AGCCTGAACA CGAGGAATCC GAATCTCGCG
AAGCCCGAGG TGCAGGAAGC GATGAAGTGG CTCGTCGATT ACGACGGCAT CCAGCGCAAC
ATCGTCAGGA CGACGTACAA GGTGCATCAG ACCTTCCTGC CGGACGGCTT CCTCGGCGCG
CTGGACGCGA ATCCGTACCG GCAGAACGTC GCGAAGGCGA AGGCGCTGCT CGCGAAGGCC
GGCCTGCCGA ACGGCTTCGC GGTAACGATG GACATGCCGA ACGATTACCC GTACGTCGAG
ATCGCGCAGG CGTTGCAGGC GAACTTCGCG CAGGGCGGCA TCCAGGTGAA GCTGATTCCG
GGCGACGCGA AACAGGCGAT CGGCAAGTAC CGTGCGCGCC AGCACGACAT CTTCATCGGC
GAATGGTCGC CGGACTACAT GGACCCGAAC AGCAACGCGC GCGGTTTCGC GTGGAATCCC
GACAATTCGG ACAACGCCAA GCACAAGCTG CTCGCGTGGC GCAACGGCTG GGATGTGCCG
CAACTGACCG CGAAGACCGA TGCGGCGCTC GCCGAGCCGT CGGCCGCGAA GCGCGCGCAG
GACTATCAGG CGCTGCAAAA GGCGGTGCTC GCGAATTCGC CGTTCGTGAT CCTGTTCGAG
AAGGTCGTGC AGGTTGCGAC GCGGCCGGGC GTCACGGGCC CGGAAATCGG GCCGATCAAC
GATCTCGTGT CGTATCGGAC CTTGAAGAAG TAA
 
Protein sequence
MKHMLSKLAA SAALAALAPV PAPAHAATPP GIFVIATQLG EFTTLDPGEI YELVPSEYVA 
NTYERLVRVD LREPSKFEGR IAQSWSVGAD GLTYTFKLRT GLKFHSGNPV TADDVAWSLQ
RTVLLDKGPA GVLADLGLTK DNVARKVRKL DDTTVSIETD RRYAPSFVLN VLSADPASIV
DKQLLLSHEK NGDFGNAWLK NADAGSGPYR LVKWTPNESL VLQRFDGYRA PYPMKRIVLR
HVPEASAQRL LLENGDVDAA RNLSPDSLAA LSKAGKIHVA SWPVSALLYL SLNTRNPNLA
KPEVQEAMKW LVDYDGIQRN IVRTTYKVHQ TFLPDGFLGA LDANPYRQNV AKAKALLAKA
GLPNGFAVTM DMPNDYPYVE IAQALQANFA QGGIQVKLIP GDAKQAIGKY RARQHDIFIG
EWSPDYMDPN SNARGFAWNP DNSDNAKHKL LAWRNGWDVP QLTAKTDAAL AEPSAAKRAQ
DYQALQKAVL ANSPFVILFE KVVQVATRPG VTGPEIGPIN DLVSYRTLKK