Gene BURPS1710b_0050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0050 
Symbol 
ID3690854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp52510 
End bp53628 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content72% 
IMG OID637726506 
Productamino acid ABC transporter, permease protein, putative 
Protein accessionYP_331467 
Protein GI76811966 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT TCTTCGTCGC CGGCATGATG TACGCGTCAT GGGGCGTGCA CGTGCCGACG 
GTGCGCGACA AGTTCGCGCT GTCGCCCGCG CTGCTGTCGT TCGCACTCCT CGCGGTCGCG
ATCGGCTCGA TCGTCGCGAT GACGACCAAC GCGCGCTGGA TCGCGCGCGT CGGCTCGCGC
ACCGCGTGCC TCGCCGGCGG CCTCGCGATG TCCGCGAGCG GCGCGCTGAT CCTCGTCGTG
CCGACCTACC CGCTGCTGCT CGCGGTGCTC GCGCTGTTCG GCTTCTCGAT GGCGACGCTC
GACGTGGCGA TGAACGCCGA GGCGAGCGCC GTCGAATCGG CATTCGGCAA ACCGATCATG
TCGATGCTGC ACGGCATGTT CAGCGTCGGC GGGATGGCGG GCGCAGCGGC GGGCGGCGCG
CTGCTGTCGG CCGGCATGGC GAGCGCCGTG CATCTCGGGC TCGCCGCGCT CGCGAGCGCG
CTCGTGCTCG CGCTTGCGTG CCCGGCCGTG CTGCCGCATG TCCCGCACAC GGCCGCCGCG
GACGGCGCCC CGCGCGTGAA CCGCTGGCGC TCGCCCGCGC TGTGGGCGCT CGGCGCCATC
GCGCTCGTCG CGTTGATCGC CGAAGGCGCG ATGTACGACT GGGCGACTGT TTACATGCGC
GACGTCGTCC TCGCGTCGCC GGCGTTCTCG AGCGCCGCCT ATGCGGCGTT CTCGGGCGGC
ATGGCCGCCG CGCGCTTCGC GGGCGATGCG GTTCGCGCGC GCTTCGGCGC CCCGCAGCTC
GTCTGCGCGA GCGCGACGCT CGCGTGCGTC GGCATGATCG CCGCGCTCGC GCTGCCGTCC
CCCTTCGTCG CGCTCGCGGG CTTCACGCTG ATGGGTCTCG GCCTCGCCAA CATGATGCCC
GTGCTGTTCG CCGCCGCCGC GCGGATCGAC GGCATTCACG CGGCCGAAGG GCTCGCGCAC
GTCGCGGGAC TCGCGTACTT CGGGCTGCTG TTCGGTCCGG TCGCGATCGG CGCGGTCACG
CAGGCGGCCA ACCTGTCGGT CGGGCTGTCG ATCGTCGCTC TGTGCGCGGC GCTCGTCGCA
ATCGTCGCGC CGAAGGTGCT GAGCCGGCTG AAAATCTGA
 
Protein sequence
MALFFVAGMM YASWGVHVPT VRDKFALSPA LLSFALLAVA IGSIVAMTTN ARWIARVGSR 
TACLAGGLAM SASGALILVV PTYPLLLAVL ALFGFSMATL DVAMNAEASA VESAFGKPIM
SMLHGMFSVG GMAGAAAGGA LLSAGMASAV HLGLAALASA LVLALACPAV LPHVPHTAAA
DGAPRVNRWR SPALWALGAI ALVALIAEGA MYDWATVYMR DVVLASPAFS SAAYAAFSGG
MAAARFAGDA VRARFGAPQL VCASATLACV GMIAALALPS PFVALAGFTL MGLGLANMMP
VLFAAAARID GIHAAEGLAH VAGLAYFGLL FGPVAIGAVT QAANLSVGLS IVALCAALVA
IVAPKVLSRL KI