Gene BURPS1106A_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2578 
Symbol 
ID4900936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2540285 
End bp2541586 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID640135805 
Productmajor facilitator family transporter 
Protein accessionYP_001066832 
Protein GI126455052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.909928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA GCATGCAGGA CAGCCTCGAC GAAGCCGCGC GGCCCGCGCG CGTCTCGTGG 
CTCGCGGCGC TGCGCGGCCC GTTCGCTTAC CGCACGTTCG CGGCGATCTG GGTCGCGAGC
CTCGTCGGCA ATATCGGCGG ATCGATTCAG ACCGTCGCCG CGTCGTGGCT GATGACGTCG
ATGGCGCCGT CGCCGACGAT GGTCTCGCTC GTGCAGACGG CGTTCACCTT GCCGATCGCG
CTGTTCGCGC TGCTGTCGGG CGTCGCCGCC GACGCGTGGG ATCGCCGCAC GGTGATGCTG
CTGTCGCAGG CGCTGATGTT CTCGGTTGCG CTGTGTCTCG TCGCGCTCGC CGCCGCGGGC
GCGATGACGC CGGCGCGCCT GCTCGTCTGC ATGTTCGTCG GCGGCTGCGC GGGCGCGATG
TTCCAGCCCG CGTGGCAGTC CGCCGTGACC GAGCAGGTGC CCGCGCGCGA GCTGTCCGCG
GCGATCGCGC TCGACAGCTT CTCGATGAAC TTCGCGCGCA CCGCCGGGCC CGCGCTGGGC
GGCTTCATCG TCGCTTCCGT GTCGCCGAAC GCGGCGTTCG TTCTCAGCGG GCTGTCGTAC
GCGGGGCTCA TCTACGCGCT GTCGCGCTCG ATTCGCGGCG CGGCGGCGCG CCCGCCCGTG
CGCGAGCGCC TCGCGACGAT GCTCGTTCAA GGCGTTCGCT ATTGCGGCCG TGCGCGCGGC
ATTCGCGGCA CGTTGATCCG CAGCAGCCTG TTCGGGTTTC TCGGCAGTCC CGTCTGGGCG
CTGCTGCCGC TCTTCGCGAA AACGCAATTC GGCGGCGAGG CGCGCACCTA CGGCGTGCTG
CTCGCGTCGT TCGGCGCGGG CGCGGCGTCC GGCGCGCTGG GCGGCGCGGC GGGGCGCGCG
CGACTCGGCC GCGAGGCGCT CGTGCGGCTG TGCACGCTCA CGTTCGCCGC CGGCATGCTG
GCGACCGCGT GGAGCCCATG CCAGGCCGTC GCGATGCTGG GCCTCGCCGT CGCGGGCGGT
AGCTGGGTCG TGGTCGTCTC GACTTACAAC CTGACGATCC AGACGGCATC GCCGGCCTGG
GTGGCCGGGC GCTCGCTGTC GCTGTTTCAT TCGTTCATCG TCGGCGGGCT GTCGATCGGC
AGCTATCTCT GGGGCGTCGC CGCGCAGGGC AGCTCGATCA ACTCGGCGTT CGCGGTATCG
GCGCTGATGA TGGCGGCGTC GGCGTGTCTC GCGGCATGGC TGCCGCTGCC CACGCACGAG
GCGCTCGGCG AGCGGACGCA CGGCGAGCCG CGGCGGACAT GA
 
Protein sequence
MTDSMQDSLD EAARPARVSW LAALRGPFAY RTFAAIWVAS LVGNIGGSIQ TVAASWLMTS 
MAPSPTMVSL VQTAFTLPIA LFALLSGVAA DAWDRRTVML LSQALMFSVA LCLVALAAAG
AMTPARLLVC MFVGGCAGAM FQPAWQSAVT EQVPARELSA AIALDSFSMN FARTAGPALG
GFIVASVSPN AAFVLSGLSY AGLIYALSRS IRGAAARPPV RERLATMLVQ GVRYCGRARG
IRGTLIRSSL FGFLGSPVWA LLPLFAKTQF GGEARTYGVL LASFGAGAAS GALGGAAGRA
RLGREALVRL CTLTFAAGML ATAWSPCQAV AMLGLAVAGG SWVVVVSTYN LTIQTASPAW
VAGRSLSLFH SFIVGGLSIG SYLWGVAAQG SSINSAFAVS ALMMAASACL AAWLPLPTHE
ALGERTHGEP RRT