Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_4031 |
Symbol | |
ID | 4899811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3934379 |
End bp | 3935518 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640137257 |
Product | putative periplasmic substrate-binding protein |
Protein accession | YP_001068250 |
Protein GI | 126451657 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACACA CGATGAAAAA GCTGGCAGGC GCGACGTTCG TCGCGGTCAT GTCGCTCGCG GGGACGGCGC ACGCGGATGA CGTCAAGATC GGCTTTGCCG CGCCGATGAC GGGCGCGCAG GCGCACTACG GCAAGGATAT GCAGAACGGG ATCGTGCTCG CGATCGAGGA CATCAACGCG ACGAAGCCGA CGATCGGCGG CAAGCCGGTG AAGTTCGTGC TCGACACGCA GGACGATCAG GCCGACCCGC GCACCGGCAC GACGGTCGCG CAGAAGCTCG TCGACGACGG CATCAAGGGC ATGCTCGGCC ACTTCAACTC GGGCACGACG ATTCCGGCTT CGCGCATCTA CGCGAACGCG GGCATCCCGC AGATCGCGAT GGCGACGGCG CCCGAGTACA CGACGCAGGG CTACAAGACG ACCTTCCGGA TGATGACGTC CGACACGCAG CAGGGCTCGG TCGCGGGCAC GTTCGCGGTG AAGGATCTCG GCATGAAGAA GATCGCGATC GTCGACGATC GCACCGCTTA CGGCCAGGGC CTTGCCGACC AGTTCGAGAA GGCGGCGAAG GCGGCGGGCG CGACGATCGT CGATCGTGAA TTCACGAACG ACAAGGCTGT CGACTTCAAG GCGATCCTGA CGAAGCTCAA GGCGACGAAG CCGGACCTCG TCTACTACGG CGGCGCGGAT TCGCAGGCCG CGCCGATGGC CAAGCAGATG AAGTCGCTCG GCGTCACGGC GCCGCTGATG GGCGGCGAGA TGGTGCACAC GCCGACCTTC CTGAAGATCG CGGGCGACGC GGCCGAAGGC TCGATCGCTT CGCTCGCCGG CCTGCCGCTC GCCGAAATGC CCGGCGGCAA GGCGTACGCG GACAAGTACA AGAAGCGCTT CGGCGAAGAC GTGCAGACGT ACTCGCCGTA TGCGTACGAC GGCGCGATGG CGATGTTCAG CGCGATGAAG AAGGCAAACT CGACGGACCC GGCGAAGTAT CTGCCGCTGC TCGCGAAGAC CGACATGGCG GGCGTGACGT CGACGCACAT CGCGTATGAC GCGAAGGGCG ACCTGAGGAA CGGCGGCATC ACGATGTACA AGGTCGAGAA GGGCGAATGG AAGCCCCTGA AGAGCATCGG CGGCAAGTAA
|
Protein sequence | MQHTMKKLAG ATFVAVMSLA GTAHADDVKI GFAAPMTGAQ AHYGKDMQNG IVLAIEDINA TKPTIGGKPV KFVLDTQDDQ ADPRTGTTVA QKLVDDGIKG MLGHFNSGTT IPASRIYANA GIPQIAMATA PEYTTQGYKT TFRMMTSDTQ QGSVAGTFAV KDLGMKKIAI VDDRTAYGQG LADQFEKAAK AAGATIVDRE FTNDKAVDFK AILTKLKATK PDLVYYGGAD SQAAPMAKQM KSLGVTAPLM GGEMVHTPTF LKIAGDAAEG SIASLAGLPL AEMPGGKAYA DKYKKRFGED VQTYSPYAYD GAMAMFSAMK KANSTDPAKY LPLLAKTDMA GVTSTHIAYD AKGDLRNGGI TMYKVEKGEW KPLKSIGGK
|
| |