Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1907 |
Symbol | |
ID | 4884221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1868955 |
End bp | 1869902 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640127835 |
Product | carbohydrate ABC transporter periplasmic sugar-binding protein |
Protein accession | YP_001058942 |
Protein GI | 126440180 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0516898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCACA CCGCCTTGTC GACGCCGGCT TCCGGCCGCC GGGCCGCCCG CGCGCTGCGC GCCGCCGCGC TTTCGCTCGC GCTCGGCGCG GCGAGCGCCG CGCACGCGGC GCCGCTGAAG ATCGGCATGA CGTTCCAGGA ACTGAACAAC CCGTACTTCG TGACGATGCA AAAGGCGCTC GACGAGGCGG CCGCGTCGAT CGGCGCGCAG GTGATCGTCA CCGACGCGCA TCACGACGTG AGCAAGCAGG TGAGCGACGT CGAGGACATG CTGCAGAAGA AGATCGACAT CCTGCTCGTG AACCCGACCG ATTCGACGGG CATCCAGTCG GCCGTCGTGT CGGCGAAGAA GGCGGGCGCC GTCGTCGTCG CGGTGGACGC GAACGCGAAC GGCCCGGTCG ACGCGTTCGT CGGCTCGAAG AATTTCGACG CGGGCGCGAT GTCGTGCGAC TACCTCGCGA AGGCGATCGG CGGCGGCGGC GAAGTCGCGA TCCTCGACGG CATCCCGGTC GTGCCGATTC TCGAGCGCGT GCGCGGCTGC CGCGCGGCGC TCGCGAAATT CCCGAACGTG AGGATCGTCG ACGTGCAGAA CGGCAGGCAG GAGCGCGCGA GCGCGCTCGC CGTCACCGAG AACATGATCC AGGCGCACCC GTTGCTCAAG GGCGTCTTCA GCGTCAACGA CGGCGGCTCG ATGGGCGCGC TGTCCGCGAT CGAGGCGTCG GGCCGCGACA TCAAGCTCAC GAGCGTCGAC GGCGCGCCGG AGGCGATCGC GGCGATGCAG AAGCCGAACT CGAAGTTCAT CGAGACGTCC GCGCAGTTCC CGCGCGACCA GATTCGCCTC GCGATCGGCA TCGGGCTCGC GAAGAAGTGG GGCGCCAATG TGCCGAAGGC GATTCCGGTC GACGTGAAGC GGATCGACAA GGGCAACGCG AAGACGTTCA GTTGGTGA
|
Protein sequence | MMHTALSTPA SGRRAARALR AAALSLALGA ASAAHAAPLK IGMTFQELNN PYFVTMQKAL DEAAASIGAQ VIVTDAHHDV SKQVSDVEDM LQKKIDILLV NPTDSTGIQS AVVSAKKAGA VVVAVDANAN GPVDAFVGSK NFDAGAMSCD YLAKAIGGGG EVAILDGIPV VPILERVRGC RAALAKFPNV RIVDVQNGRQ ERASALAVTE NMIQAHPLLK GVFSVNDGGS MGALSAIEAS GRDIKLTSVD GAPEAIAAMQ KPNSKFIETS AQFPRDQIRL AIGIGLAKKW GANVPKAIPV DVKRIDKGNA KTFSW
|
| |