Gene BURPS1106A_A1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1997 
Symbol 
ID4903384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1965078 
End bp1966454 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content69% 
IMG OID640145103 
Productmajor facilitator superfamily permease 
Protein accessionYP_001076031 
Protein GI126455858 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG GAGACATCGG CATGAAGGAG ACAGGCATGC ATCAGAGAAC GATGGGGTGG 
GTCACGGTAT TCCTGCTGTT CGTCGTGTAC GGCATCAATT ATCTCGACCG CGTCGCGCTG
TCGATCGTCG CGCCGATGGT GCAGCGGGAT CTCGGGATCG ACGCCGCGCA GATGGGCATC
GTCTTCAGCA CGTTCTTCGT CGGCTATGCG CTGTTCAACT TCATCGGCGG TCTCGCGTCG
GACCGGCTCG GGCCGAAGCG CGTCTATGTG ATCGCGGTGG GCCTCTGGTC GATCTTCTGC
GGGATGACGG CGATCACGAT CGGCTTCGTC AGCCTGCTGA TCGTGCGCCT GCTGTTCGGC
ATGGCCGAGG GGCCGCTCTG CTCGGCCGCG AACAAGATGG TCAACAACTG GCTGCCGCGC
GAATCGGCGG CCACGGCGAT GGGGCTGCTG AGCGCCGGCT CGCCGCTCGG CGGCGCGCTC
GCCGGGCCGA TCGTCGGGGT GCTCGCCGCG CAGCTCGGCT GGCGGCCGGC GTTCTGGATC
GTGTGCGCGA TCGGCCTCGC GTGGGTGCTC GTGTGGATCG CGACGACGTC GGACCGGCCC
GCGCCGCAGG CGTCGGCGAT GCCGGCCGGC GGCTCGGGCG CCGCCGCCGC GGCGGCGCGC
GCTGCGGCGG CGCCGCGCGC GTGCGCCAGC GGCGGCCGCA CCGTCGATGC CGCCCATGCC
TCCGAGACCG CCGACGTGCC GCCGCTGCGC GATTACCTGA AGCAGCCGCG CATACTGGCG
ACGGGCGTCG CGTTCTTCGG CTACAACTAC GTGCTGTTCT TCTTCCTGAG CTGGTTTCCG
AGCTATCTCG TGCAGGCGCA TCACCTGAAC ATCCGCGAGA TGAGCGTCGC GACGGTCGTG
CCGTGGCTCG TCGGCACGAT CGGTCTCGCG TGCGGCGGCG TGATCTCGGA CGGAATCTAC
AAGCTCACCG GCAACGCGAT GCTGTCGCGC CGGATCGTGC TCGTCGGGTG CCTGCTCGGC
GCGGGGGTCT GCGTCGCGAT CGCGGGCTCG GTGCGCTCGA CGCAGAGCGC GATCGCGCTG
ATGTCGGTAT CGCTCTTTTT CCTTTACGCG ACGGGTGCGA TCTACTGGGC GATCGTTCAG
GACATCGTCG CGCCGGGGCG CGTCGGCGCG GTCAGCGGCT GCCTGCACTG CATGGGAAGC
CTGTCGGGCG TCATCGGGCC CGCCGTCACC GGATTCATCG TCGAGCGCAG CGGCTCGTTC
GTGTCCGCGT TCGTGCTTGC CGGCGCGATC GCGCTCGCGG GCGCGGTGCT GTCCGCGCTC
TTCCTGCGCA ACCGTGCGTG CGATGCGCGT GTGCTGCGCG AAAGTCCGCT TCTGTAG
 
Protein sequence
MNAGDIGMKE TGMHQRTMGW VTVFLLFVVY GINYLDRVAL SIVAPMVQRD LGIDAAQMGI 
VFSTFFVGYA LFNFIGGLAS DRLGPKRVYV IAVGLWSIFC GMTAITIGFV SLLIVRLLFG
MAEGPLCSAA NKMVNNWLPR ESAATAMGLL SAGSPLGGAL AGPIVGVLAA QLGWRPAFWI
VCAIGLAWVL VWIATTSDRP APQASAMPAG GSGAAAAAAR AAAAPRACAS GGRTVDAAHA
SETADVPPLR DYLKQPRILA TGVAFFGYNY VLFFFLSWFP SYLVQAHHLN IREMSVATVV
PWLVGTIGLA CGGVISDGIY KLTGNAMLSR RIVLVGCLLG AGVCVAIAGS VRSTQSAIAL
MSVSLFFLYA TGAIYWAIVQ DIVAPGRVGA VSGCLHCMGS LSGVIGPAVT GFIVERSGSF
VSAFVLAGAI ALAGAVLSAL FLRNRACDAR VLRESPLL