Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1714 |
Symbol | |
ID | 4888526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1663660 |
End bp | 1664661 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640131652 |
Product | L-arabinose-binding periplasmic protein |
Protein accession | YP_001062709 |
Protein GI | 126443509 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATTGC GCTGGCCCCA AGCCGCCCTC GTCTGCGCGA GCCTCGCCGC CGGTTTGTCG GCGGCGGCGC CCGCGCATGC GCAAGGCGCG GCCCCGGTGA AGATCGGCTT CGTCGTCAAG CAGCCCGACG ACCCGTGGTT TCAGGACGAA TGGCGCTTCG CCGAGCAGGC GGCGAAGGAC AAGCACTTCA CGCTCGTGAA GATCGCCGCG CCGAGCGGCG AGAAGGTGTC GACCGCGCTC GACAGCCTCG CCGCGCAAAA GGCGCAGGGC GTGATCATCT GCGCGCCCGA CGTGAAGCTC GGCCCCGGCA TCGCCGCGAA GGCGAGGCGC TACGGGATGA AGCTGATGTC GGTCGACGAT CAGCTCGTCG ACGGGCGCGG CGCGCCGCTC GCCGACGTGC CGCACATGGG CATTTCCGCA TACCGGATCG GCCGGCAGGT CGGCGACGCG ATCGCCGCCG AGGCGAAGCG GCGCGGCTGG AATCCGGCCG AGGTCGGCGT GCTGCGCCTC GCGTACGACC AGTTGCCGAC CGCGCGCGAG CGCACGACGG GCGCGGTCGA TGCGCTGAAG GCCGCGGGCT TCGCGGCGGC GAACGTCGTC GACGCGCCGG AGATGACGGC CGATACCGAA GGCGCGTTCA ACGCCGCGAA CATCGCGTTC ACCAAGCATC GGAACTTCAA GCACTGGGTG GCGTTCGGAT CGAATGACGA CACGACGGTC GGCGCGGTGC GCGCGGGCGA GGGGCGCGGC ATCGGGGCGG ACGACATGAT CGCGGTCGGC ATCAACGGCA GCCAGGTCGC GCTGAACGAA TTCGCGAAAC CGAAGCCGAC GGGCTTTTTC GGCTCGATCC TGCTGAATCC GCGGCTGCAC GGCTACGACA CGTCGGTCAA CATGTACGAC TGGATCACGC AGAACCGCGC GCCGCCGCCG GTCGTGCTGA CGTCCGGCAC GCTGATCACG CGCGCGAACG AAAAGACGGC GCGCGCGCAG CTCGGGCTGT GA
|
Protein sequence | MGLRWPQAAL VCASLAAGLS AAAPAHAQGA APVKIGFVVK QPDDPWFQDE WRFAEQAAKD KHFTLVKIAA PSGEKVSTAL DSLAAQKAQG VIICAPDVKL GPGIAAKARR YGMKLMSVDD QLVDGRGAPL ADVPHMGISA YRIGRQVGDA IAAEAKRRGW NPAEVGVLRL AYDQLPTARE RTTGAVDALK AAGFAAANVV DAPEMTADTE GAFNAANIAF TKHRNFKHWV AFGSNDDTTV GAVRAGEGRG IGADDMIAVG INGSQVALNE FAKPKPTGFF GSILLNPRLH GYDTSVNMYD WITQNRAPPP VVLTSGTLIT RANEKTARAQ LGL
|
| |