Gene BURPS668_A1714 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1714 
Symbol 
ID4888526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1663660 
End bp1664661 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content70% 
IMG OID640131652 
ProductL-arabinose-binding periplasmic protein 
Protein accessionYP_001062709 
Protein GI126443509 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC GCTGGCCCCA AGCCGCCCTC GTCTGCGCGA GCCTCGCCGC CGGTTTGTCG 
GCGGCGGCGC CCGCGCATGC GCAAGGCGCG GCCCCGGTGA AGATCGGCTT CGTCGTCAAG
CAGCCCGACG ACCCGTGGTT TCAGGACGAA TGGCGCTTCG CCGAGCAGGC GGCGAAGGAC
AAGCACTTCA CGCTCGTGAA GATCGCCGCG CCGAGCGGCG AGAAGGTGTC GACCGCGCTC
GACAGCCTCG CCGCGCAAAA GGCGCAGGGC GTGATCATCT GCGCGCCCGA CGTGAAGCTC
GGCCCCGGCA TCGCCGCGAA GGCGAGGCGC TACGGGATGA AGCTGATGTC GGTCGACGAT
CAGCTCGTCG ACGGGCGCGG CGCGCCGCTC GCCGACGTGC CGCACATGGG CATTTCCGCA
TACCGGATCG GCCGGCAGGT CGGCGACGCG ATCGCCGCCG AGGCGAAGCG GCGCGGCTGG
AATCCGGCCG AGGTCGGCGT GCTGCGCCTC GCGTACGACC AGTTGCCGAC CGCGCGCGAG
CGCACGACGG GCGCGGTCGA TGCGCTGAAG GCCGCGGGCT TCGCGGCGGC GAACGTCGTC
GACGCGCCGG AGATGACGGC CGATACCGAA GGCGCGTTCA ACGCCGCGAA CATCGCGTTC
ACCAAGCATC GGAACTTCAA GCACTGGGTG GCGTTCGGAT CGAATGACGA CACGACGGTC
GGCGCGGTGC GCGCGGGCGA GGGGCGCGGC ATCGGGGCGG ACGACATGAT CGCGGTCGGC
ATCAACGGCA GCCAGGTCGC GCTGAACGAA TTCGCGAAAC CGAAGCCGAC GGGCTTTTTC
GGCTCGATCC TGCTGAATCC GCGGCTGCAC GGCTACGACA CGTCGGTCAA CATGTACGAC
TGGATCACGC AGAACCGCGC GCCGCCGCCG GTCGTGCTGA CGTCCGGCAC GCTGATCACG
CGCGCGAACG AAAAGACGGC GCGCGCGCAG CTCGGGCTGT GA
 
Protein sequence
MGLRWPQAAL VCASLAAGLS AAAPAHAQGA APVKIGFVVK QPDDPWFQDE WRFAEQAAKD 
KHFTLVKIAA PSGEKVSTAL DSLAAQKAQG VIICAPDVKL GPGIAAKARR YGMKLMSVDD
QLVDGRGAPL ADVPHMGISA YRIGRQVGDA IAAEAKRRGW NPAEVGVLRL AYDQLPTARE
RTTGAVDALK AAGFAAANVV DAPEMTADTE GAFNAANIAF TKHRNFKHWV AFGSNDDTTV
GAVRAGEGRG IGADDMIAVG INGSQVALNE FAKPKPTGFF GSILLNPRLH GYDTSVNMYD
WITQNRAPPP VVLTSGTLIT RANEKTARAQ LGL