Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1410 |
Symbol | |
ID | 4888642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1321471 |
End bp | 1323063 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640131350 |
Product | solute-binding family 5 protein |
Protein accession | YP_001062408 |
Protein GI | 126443233 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.139309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACA TGCTGTCCAA GCTCGCGGCG AGCGCCGCAC TCGCCGCGCT GGCCCCGGTG CCGGCCCCCG CGCACGCGGC CACGCCGCCC GGCATCTTCG TGATCGCGAC GCAGCTCGGC GAATTCACGA CGCTCGACCC GGGCGAAATC TACGAGCTCG TGCCGTCCGA ATACGTCGCG AACACGTACG AGCGGCTCGT GCGCGTCGAC CTGCGCGAAC CGTCGAAATT CGAAGGCCGG ATCGCGCAAT CGTGGAGCGT CGGCGCGGAC GGCCTCACCT ACACGTTCAA GCTGCGCACC GGCCTGAAGT TCCACTCGGG CAATCCGGTG ACGGCCGACG ACGTGGCGTG GTCGCTGCAG CGCACGGTGC TGCTCGACAA AGGGCCGGCC GGCGTGCTCG CGGACCTCGG CCTGACCAAG GACAACGTCG CGCGGAAGGT ACGCAAGCTC GACGACACGA CCGTGTCGAT CGAGACCGAC CGCCGGTACG CGCCGAGCTT CGTGCTGAAC GTGCTGAGCG CGGACCCGGC ATCGATCGTC GACAAGCAGT TGCTGCTCTC GCACGAGAAG AACGGCGACT TCGGCAATGC ATGGCTGAAG AACGCGGATG CCGGCTCGGG CCCGTACCGG CTCGTCAAGT GGACGCCGAA CGAAAGCCTC GTGCTGCAAC GCTTCGACGG CTACCGCGCG CCGTATCCGA TGAAGCGCAT CGTGTTGCGG CACGTGCCGG AAGCGTCCGC GCAGCGCCTG CTGCTCGAGA ACGGCGACGT CGACGCCGCG CGCAACCTGA GCCCCGACAG CCTTGCTGCG CTGTCGAAGG CGGGCAAGAT CCACGTCGCG TCGTGGCCCG TGTCCGCGCT GCTGTACCTG AGCCTGAACA CGAGGAATCC GAATCTCGCG AAGCCCGAGG TGCAGGAAGC GATGAAGTGG CTCGTCGATT ACGACGGCAT CCAGCGCAAC ATCGTCAGGA CGACGTACAA GGTGCATCAG ACCTTCCTGC CGGACGGCTT CCTCGGCGCG CTGGACGCGA ATCCGTACCG GCAGAACGTC GCGAAGGCGA AGGCGCTGCT CGCGAAGGCC GGCCTGCCGA ACGGCTTCGC GGTAACGATG GACATGCCGA ACGATTACCC GTACGTCGAG ATCGCGCAGG CGTTGCAGGC GAACTTCGCG CAGGGCGGCA TCCAGGTGAA GCTGATTCCG GGCGACGCGA AACAGGCGAT CGGCAAGTAC CGTGCGCGCC AGCACGACAT CTTCATCGGC GAATGGTCGC CGGACTACAT GGACCCGAAC AGCAACGCGC GCGGTTTCGC GTGGAATCCC GACAATTCGG ACAACGCCAA GCACAAGCTG CTCGCGTGGC GCAACGGCTG GGATGTGCCG CAACTGACCG CGAAGACCGA TGCGGCGCTC GCCGAGCCGT CGGCCGCGAA GCGCGCGCAG GACTATCAGG CGCTGCAAAA GGCGGTGCTC GCGAATTCGC CGTTCGTGAT CCTGTTCGAG AAGGTCGTGC AGGTTGCGAC GCGGCCGGGC GTCACGGGCC CGGAAATCGG GCCGATCAAC GATCTCGTGT CGTATCGGAC CTTGAAGAAG TAA
|
Protein sequence | MKHMLSKLAA SAALAALAPV PAPAHAATPP GIFVIATQLG EFTTLDPGEI YELVPSEYVA NTYERLVRVD LREPSKFEGR IAQSWSVGAD GLTYTFKLRT GLKFHSGNPV TADDVAWSLQ RTVLLDKGPA GVLADLGLTK DNVARKVRKL DDTTVSIETD RRYAPSFVLN VLSADPASIV DKQLLLSHEK NGDFGNAWLK NADAGSGPYR LVKWTPNESL VLQRFDGYRA PYPMKRIVLR HVPEASAQRL LLENGDVDAA RNLSPDSLAA LSKAGKIHVA SWPVSALLYL SLNTRNPNLA KPEVQEAMKW LVDYDGIQRN IVRTTYKVHQ TFLPDGFLGA LDANPYRQNV AKAKALLAKA GLPNGFAVTM DMPNDYPYVE IAQALQANFA QGGIQVKLIP GDAKQAIGKY RARQHDIFIG EWSPDYMDPN SNARGFAWNP DNSDNAKHKL LAWRNGWDVP QLTAKTDAAL AEPSAAKRAQ DYQALQKAVL ANSPFVILFE KVVQVATRPG VTGPEIGPIN DLVSYRTLKK
|
| |