Gene BURPS668_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1404 
Symbol 
ID4881806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1373574 
End bp1374770 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content73% 
IMG OID640127332 
Productmajor facilitator family transporter 
Protein accessionYP_001058447 
Protein GI126442269 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCGC AGCCTCGCCA CTCCGCCGGC ACGCCCGGCC ACTATTCACG CAGCCTGTTG 
CTGCTGCTCG CGACGATCGC CGGCGTCTCC GTCGCGAATA TCTATTACAA CCAGCCGCTG
CTCGACGCAT TCCGCGCATC GTTCCCGGGC AGCGCGTCAT GGATCGGCGT CGTGCCGACC
GCGACGCAGC TCGGCTACGC AACCGGCATG CTCGTCCTCG CGCCGCTCGG CGACCGCTTC
GACCGGCGCA CGCTGATCCT GCTGCAGATC GCCGGGCTGT CGGCCGCGCT CGTCGTCGCG
GCGGCCGCGC CGACGCTCGG CGTGCTCGCC GCGGCAAGCC TCGCGATCGG CATCCTCGCG
ACGATCGCGC AGCAGGCGGT GCCGTTCGCC GCCGAGATCG CGCCGCCCGC CGCGCGCGGG
CAGGCGGTCG GCACCGTGAT GAGCGGCCTG CTGCTCGGCA TCCTGCTCGC GCGCACGGCG
GCGGGCTTCG TCGCCGAATA CTTCGGCTGG CGCGCGGTGT TCGCCGTATC GGTCGCGGCG
CTCGCCGCGC TCGCGGCCGT GATCGTCGCG CGCCTGCCGC GCAGCTCGCC GACATCGACG
CTGCCGTACG GCAAGCTGCT CGCATCGATG TGGCAGCTCG TGCGCGAGTT GCGCGGACTG
CGCGAGGCGT CGATGACGGG CGGCGCGATC TTCGCCGCGT TCAGCGCGTT CTGGCCGGTG
CTCACGCTGC TGCTCGCGGG CGCGCCGTTT CATCTGGGCC CGCAGGCGGC GGGGCTCTTC
GGGATCGTCG GCGCGGCGGG CGCGCTCGCC GCGCCGTACG CGGGCCGCTT CGCCGACAAG
CGCGGCCCGC GCGCGATCAT CTCGCTCGCG ATCGCGCTGA TCGCCGCGTC GTTCGCGATC
TTCGCGCTGT CGGGCGCGAG CCTCATCGGG CTCGTGATCG GCGTGATCGT GCTCGACGTC
GGCGTGCAGG CCGCGCAGAT CTCGAACCAG TCGCGCATCT ACGCGCTGAA GCCGGACGCG
CGCAGCCGCG TGAACACGGT GTTCATGGTC TGCTACTTCA TCGGCGGCGC GATCGGCTCG
TCCGCGGGCG TCGCCGCATG GCGCGCGACG GGCTGGCTCG GCATGTGCGC GGTCGGCCTG
CTGTTCTCGA TCGTCGCGGC GATCGTGCAT TTCCGCGGCG GCGCGGGCGC GCGATAA
 
Protein sequence
MSSQPRHSAG TPGHYSRSLL LLLATIAGVS VANIYYNQPL LDAFRASFPG SASWIGVVPT 
ATQLGYATGM LVLAPLGDRF DRRTLILLQI AGLSAALVVA AAAPTLGVLA AASLAIGILA
TIAQQAVPFA AEIAPPAARG QAVGTVMSGL LLGILLARTA AGFVAEYFGW RAVFAVSVAA
LAALAAVIVA RLPRSSPTST LPYGKLLASM WQLVRELRGL REASMTGGAI FAAFSAFWPV
LTLLLAGAPF HLGPQAAGLF GIVGAAGALA APYAGRFADK RGPRAIISLA IALIAASFAI
FALSGASLIG LVIGVIVLDV GVQAAQISNQ SRIYALKPDA RSRVNTVFMV CYFIGGAIGS
SAGVAAWRAT GWLGMCAVGL LFSIVAAIVH FRGGAGAR