Gene BURPS668_A1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1591 
Symbol 
ID4887142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1525627 
End bp1527030 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content73% 
IMG OID640131530 
Productmajor facilitator transporter 
Protein accessionYP_001062587 
Protein GI126445559 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCA TCACCGCCCC TTCGTGGGCA GAGCTGCTGT CCGGCCGCAA CGGCTTGCGC 
TCGATCGCGC TCGCAGGCGG GGTCGCGCTG CATGCGATCA ACATCTACAT CGCGACGACC
ATCCTGCCTT CGGTCGTGCG CGACATCGGC GGGCTCGAAT ATTACGCATG GAACACCACG
CTGTTCATGG CCGCCTCGAT CGTCGGCGCG CCGCTGTCCG CGAATGTCCT GAGCCGGTTC
GGGCCGCGCG CCGCGTATCT CGTCGCGCTC GTCGTGTTCT GCGCGGGCAC GCTCGCGTGC
GCGGGCGCGA AGGACATGCC GTGGATGCTC GTCGGCCGGG CCGCGCAAGG CTTCGGCGGC
GGCATCCTGT TCGCGCTCAG CTACGCGCTG ATCCGCATCG TGTTCGACGA GCGGCTGTGG
TCGCGCGCGA TGGCGATGGT CTCCGGCATG TGGGGCGTCG CGACGCTGTG CGGGCCCGCG
ATCGGCGGCG TGTTCGCGCA ATCGGGCACG TGGCGGCTCG CGTTCGTCGC GCTCGTGCCC
GTCGCCGCGG TGCTCGCGCT GATCGTGATC GTTCAGTTGC CCGCGCGCGA AGCATCGGGG
GCGCGGGCCG CGCGGCCCGC GATCGGCAAG ATCCTGCTGC TCGCGGTGTC GGTACTCGTC
GTGTCGGTCG CGAGCCTGTC CAAGGCGATC GTCGCGAACG TCACGGGCGT CGCCGCGGGC
CTCGCGGTCG CGCTGCTGAT CGCGCGCCTC GAGCGCGGCG CGACGCGCCG GCTGCTGCCG
ACGGGCGCCT ACGACGTGCG CGCGCCGCTC GGCGCGATCT ACGCGTGCAT GAGCCTGCTC
GTGATCGGCA TGACGACCGA GATCTTCGTG CCGTACTTCC TGCAGATCAT CCACGGCTAC
CCGCCGCTTC TCGCCGGCTA CCTGACCGCG CTGATGGCGG CCGGCTGGAC CGCCGGCTCG
CTGTTCAGCT CGGGGCGCAG CGGCGCGGCC GCGCAGGCGC TCGTGCGCGG CGGGCCGCTC
GTCGTTGTGA TCGCGCTCGT CGCGCTCGCG CTCGTCGTGC CGCCGCAGCA CCTGCTCGCG
GGCGGCGCCG GCCTCGCCGC GCTGTGCGCG GCGCTCGCGG CGGTGGGCGT CGGCATCGGC
GTGGGCTGGC CGCATCTGCT CACGCAGGTG CTGACGAACG CGCCGGCGGG CCAGGAAGAT
CTCGCGTCGA CGTCGATCAC GACCGTCCAG CTCTATGCGA CCGCGATCGG CTCCGCGCTC
GCGGGCCTTG TCGCGAACCT CGCCGGCTTC TCCGCGCCCG GCGGCCTCGC CGGCGCGCAG
CATGCGGCCG CGTGGCTGTT CGCGGTGTTC GCGGCGGCGC CCGTGCTCGC CGCGATCGTC
GCGCGCCGCG TGCGCGCGCG ATGA
 
Protein sequence
MSSITAPSWA ELLSGRNGLR SIALAGGVAL HAINIYIATT ILPSVVRDIG GLEYYAWNTT 
LFMAASIVGA PLSANVLSRF GPRAAYLVAL VVFCAGTLAC AGAKDMPWML VGRAAQGFGG
GILFALSYAL IRIVFDERLW SRAMAMVSGM WGVATLCGPA IGGVFAQSGT WRLAFVALVP
VAAVLALIVI VQLPAREASG ARAARPAIGK ILLLAVSVLV VSVASLSKAI VANVTGVAAG
LAVALLIARL ERGATRRLLP TGAYDVRAPL GAIYACMSLL VIGMTTEIFV PYFLQIIHGY
PPLLAGYLTA LMAAGWTAGS LFSSGRSGAA AQALVRGGPL VVVIALVALA LVVPPQHLLA
GGAGLAALCA ALAAVGVGIG VGWPHLLTQV LTNAPAGQED LASTSITTVQ LYATAIGSAL
AGLVANLAGF SAPGGLAGAQ HAAAWLFAVF AAAPVLAAIV ARRVRAR