Gene BURPS668_2524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2524 
Symbol 
ID4882389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2495107 
End bp2496408 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID640128452 
Productmajor facilitator family transporter 
Protein accessionYP_001059551 
Protein GI126439132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGACA GCATGCAGGA CAGCCTCGAC GAAGCCGCGC GGCCCGCGCG CGTCTCGTGG 
CTCGCGGCGC TGCGCGGCCC GTTCGCTTAC CGCACGTTCG CGGCGATCTG GGTCGCGAGC
CTCGTCGGCA ATATCGGCGG ATCGATTCAG ACCGTCGCCG CGTCGTGGCT GATGACGTCG
ATGGCGCCGT CGCCGACGAT GGTCTCGCTC GTGCAGACGG CGTTCACCTT GCCGATCGCG
CTGTTCGCGC TGCTGTCGGG CGTCGCCGCC GACGCGTGGG ATCGCCGCAC GGTGATGCTG
CTGTCGCAGG CGCTGATGTT CTCGGTTGCG CTGTGTCTCG TCGCGCTCGC CGCCGCGGGC
GCGATGACGC CGGCGCGCCT GCTCGTCTGC ATGTTCGTCG GCGGCTGCGC GGGCGCGATG
TTCCAGCCCG CGTGGCAGTC CGCCGTGACC GAGCAGGTGC CCGCGCGCGA GCTGTCCGCG
GCGATCGCGC TCGACAGCTT CTCGATGAAC TTCGCGCGCA CCGCCGGGCC CGCGCTGGGC
GGCTTCATCG TCGCTTCCGT GTCGCCGAAC GCGGCGTTCG TTCTCAGCGG GCTGTCGTAC
GCGGGGCTCA TCTACGTGCT GTCGCGCTCG ATTCGCGGCG CGGCGGCGCG CCCGCCCGTG
CGCGAGCGCC TCGCGACGAT GCTCGTTCAA GGCGTTCGCT ATTGCGGCCG TGCGCGCGGC
ATTCGCGGCA CGTTGATCCG CAGCAGCCTG TTCGGGTTTC TCGGCAGTCC CGTCTGGGCG
CTGCTGCCGC TCTTCGCGAA AACGCAATTC GGCGGCGAGG CGCGCACCTA CGGCGTGCTG
CTCGCGTCGT TCGGCGCGGG CGCGGCGTCC GGCGCGCTGG GCGGCGCGGC GGGGCGCGCG
CGACTCGGCC GCGAGGCGCT CGTGCGGCTG TGCACGCTCA CGTTCGCCGC CGGCATGCTG
GCGACCGCGT GGAGCCCATG CCAGGCCGTC GCGATGCTGG GCCTCGCCGT CGCGGGCGGT
AGCTGGGTCG TGGTCGTCTC GACTTACAAC CTGACGATCC AGACGGCATC GCCGGCCTGG
GTGGCCGGGC GCTCGCTGTC GCTGTTTCAT TCGTTCATCG TCGGCGGGCT GTCGATCGGC
AGCTATCTCT GGGGCGTCGC CGCGCAGGGC AGCTCGATCA ACTCGGCGTT CGCGGTATCG
GCGCTGATGA TGGCGGCGTC GGCGTGTCTC GCGGCATGGC TGCCGCTGCC CACGCACGAG
GCGCTCGGCG AGCGGACGCA CGGCGAGCCG CGGCGGACAT GA
 
Protein sequence
MTDSMQDSLD EAARPARVSW LAALRGPFAY RTFAAIWVAS LVGNIGGSIQ TVAASWLMTS 
MAPSPTMVSL VQTAFTLPIA LFALLSGVAA DAWDRRTVML LSQALMFSVA LCLVALAAAG
AMTPARLLVC MFVGGCAGAM FQPAWQSAVT EQVPARELSA AIALDSFSMN FARTAGPALG
GFIVASVSPN AAFVLSGLSY AGLIYVLSRS IRGAAARPPV RERLATMLVQ GVRYCGRARG
IRGTLIRSSL FGFLGSPVWA LLPLFAKTQF GGEARTYGVL LASFGAGAAS GALGGAAGRA
RLGREALVRL CTLTFAAGML ATAWSPCQAV AMLGLAVAGG SWVVVVSTYN LTIQTASPAW
VAGRSLSLFH SFIVGGLSIG SYLWGVAAQG SSINSAFAVS ALMMAASACL AAWLPLPTHE
ALGERTHGEP RRT