Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0448 |
Symbol | |
ID | 4886337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 409836 |
End bp | 410924 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640130389 |
Product | putative ABC transporter, periplasmic substrate-binding protein |
Protein accession | YP_001061454 |
Protein GI | 126443162 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGAAA ACCGGCCATT GCTGACGGCG CTGCGCCGCG CCGCGCTCGC CTTCGGCATG TGCGCGACGC TCGTCGCGAA CGGCGCATCC GCCGAGCCGC TTTACGCGGG CGAAGACGCG CTCTATGCGA AGGCCGCCGA CGAAGGGCTC GTCGTGTCGT TCGACACGGG CCCCGAATGG GCGAACTGGA AGGCGCTGTT CGCGGCGTTC CGCAAGCGCT ATCCGAAGGT GGAGCTCACG TACAACGACA TCGGCTCGGC CGCGACGGTC GTCGCGCTCG ACAAGTCACG CCGCCGTCCG CAGGCGGACA CCGCGTACTA CTTCGCGGCA TCGGCGCTCG ACGCGGCTGG CAAGGACGTC GTCGCGCCGT TCAAGCCGGT CAACTTCGAC AAGCTCCCGC CCGTGTTTCG CGAAGCCGAC GGCCGCTGGT TCACCGTGCA TTCGCTGAAT GTCGCGTTCC TCGTCAACCG CAAGCTCGTG AAGAACGTGC CGCGCCGCTG GTCCGATCTG TTGAAGCCCG AGTACAGGAA CGCGGTCGTC TATCTCGACC CGCGTTCGAC CGGCCAGGGG CAGGTGGTCG TGTTCGCGGC GGCGTCTGCG CTCGGCGGCG GCGTCGACGA TCCGAAGCCC GGCGCGGAAT TCTTCGGAAA GCTAAAGCAT GCGGGCAACG TGCTGCGCAT CGAGGGCACG ACGCCGTATG CGAAGTTCGT CAAGGGTGAG ATCCCGATCC TGATCGGCTA CGAGAACGAC GGCCTGAAGG CGAAGTACGC GGACGGCCTG GGCGATGCGG TCGACGTCGT GATTCCGCAG GACGGCAGCG TGTGCGCGCC GTATGCGATG AGCCTCGTGA AGAACGGGCC GAATCCTGCT GCCGCGCAGC TATGGTTGAA CTTCGTGATG AGCGATGCCG GCCAGGCGCT GTTCGCGCAC GGCTACGTGC GGCCCGCAGT GCCGGGCGTC GCGCTCGCGC CCGACGTCGC GGCGAAGATG CCGAACGCGC CACAGGTGCG TGCGCTCGAC GTCGCGAAGG CCGCCGCGCG CAAGGCCGAA GTCGACCGGC TATGGTCGCA GGCGGCGCTT GGCCAGTAA
|
Protein sequence | MSENRPLLTA LRRAALAFGM CATLVANGAS AEPLYAGEDA LYAKAADEGL VVSFDTGPEW ANWKALFAAF RKRYPKVELT YNDIGSAATV VALDKSRRRP QADTAYYFAA SALDAAGKDV VAPFKPVNFD KLPPVFREAD GRWFTVHSLN VAFLVNRKLV KNVPRRWSDL LKPEYRNAVV YLDPRSTGQG QVVVFAAASA LGGGVDDPKP GAEFFGKLKH AGNVLRIEGT TPYAKFVKGE IPILIGYEND GLKAKYADGL GDAVDVVIPQ DGSVCAPYAM SLVKNGPNPA AAQLWLNFVM SDAGQALFAH GYVRPAVPGV ALAPDVAAKM PNAPQVRALD VAKAAARKAE VDRLWSQAAL GQ
|
| |