Gene BURPS668_3175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3175 
Symbol 
ID4881882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3115933 
End bp3117288 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content72% 
IMG OID640129103 
Product4-hydroxybenzoate transporter 
Protein accessionYP_001060187 
Protein GI126439396 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00895] benzoate transport 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.902434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGG CGGCGAATCC CGCGCGCGTG CTCGAAATCG AGCGCGTGAT CGACGACACG 
CACCGGCCCG CGTTTCACGC GATGCTGCTC GCGCTTTGCG GGCTGTGCCT CGTGATCGAC
GGTTTCGACG CGCAGGCGAT GGGCTACGTC GCACCGAGCG TGATCGCCGA ATGGGGTGTG
AAGAAGCAGG CGCTCGGGCC CGTCTTCAGC GCGAGCCTGT TCGGCATGCT GCTCGGCGCG
CTCGGCCTGT CGGTGCTCGC CGATCGGATC GGCCGGCGGC CCGTGCTGAT CGGCGCGACG
CTGTTCTTCG CGCTCGCGAT GCTCGCGACG CCGTTCGCGA CGTCGATCCC GATATTGATC
GCGCTGCGCT TCGTCACGGG CCTGGGGCTC GGCTGCATCA TGCCGAACGC GATGGCGCTC
GTCGGCGAAT GCAGCCCGGG CGCGCACCGC GTGAAGCGGA TGATGATCGT GTCGTGCGGC
TTCACGCTCG GCGCGGCGCT GGGCGGGTTC GTCAGCGCCG CGCTGATTCC CGCGTTCGGC
TGGCGCGCGG TGTTCTTCGT CGGCGGCGCG GTGCCGCTCG CGCTCGCGGC CGCGATGGCC
GCGAGCCTGC CCGAATCGCC GCAGTCGCTC GTGCTGCGCG GCCGGCACGA CGCGGCGCGC
GCGTGGCTCG CGAAGTTCGC GCCGCGGCTC GCGGTCCCGC CCGATACGCG GCTTGTCGTG
CGCGAAGCGG GACCCCGGGG CGCGCCCGTC GCCGAGCTGT TCCGCTCGGG ACGCGCGCGC
GTCACGCTGC TGTTGTGGGC GATCAACTTC ATGAACCTGA TCGACCTGTA CTTCCTGTCG
AACTGGCTGC CGACCGTGAT GCGCGACGCG GGCTACGCGA GCGGCACGGC CGTCATCGTC
GGCACGGTGC TGCAGACGGG CGGCGTGATC GGCACGCTGT CGCTCGGCTG GTTCATCGAA
CGGCATGGTT TCGCGCGCGT GCTGTTCGCG TGCTTCGCGT GCGCGACGAT CGCGATCGGC
CTGATCGGCC CGGTCGCGCA CGCGTTCGTC TGGCTGCTCG CAGCCGTGTT CGTCGGCGGC
TTTTGCGTCG TCGGCGGACA GCCCGCGGTC AATGCGCTCG CGGGCCATTA TTACCCGACG
TCGCTGCGCT CGACGGGCAT CGGCTGGAGT CTCGGCGTGG GCCGCGTCGG CTCCGTGCTC
GGGCCGCTCG TCGGCGGGCA ACTGATCGCG CTCGGCTGGT CGAACGACGC GCTGTTTCAC
GCGGCGGCCG TGCCGGTGCT GTGCTCGGCC GTCTTCGTGA TCGGCCTCGC GAGCGTGACG
CGGCGGCGCG GCACGGCCGC GCCGAACGTC GCTTGA
 
Protein sequence
MSAAANPARV LEIERVIDDT HRPAFHAMLL ALCGLCLVID GFDAQAMGYV APSVIAEWGV 
KKQALGPVFS ASLFGMLLGA LGLSVLADRI GRRPVLIGAT LFFALAMLAT PFATSIPILI
ALRFVTGLGL GCIMPNAMAL VGECSPGAHR VKRMMIVSCG FTLGAALGGF VSAALIPAFG
WRAVFFVGGA VPLALAAAMA ASLPESPQSL VLRGRHDAAR AWLAKFAPRL AVPPDTRLVV
REAGPRGAPV AELFRSGRAR VTLLLWAINF MNLIDLYFLS NWLPTVMRDA GYASGTAVIV
GTVLQTGGVI GTLSLGWFIE RHGFARVLFA CFACATIAIG LIGPVAHAFV WLLAAVFVGG
FCVVGGQPAV NALAGHYYPT SLRSTGIGWS LGVGRVGSVL GPLVGGQLIA LGWSNDALFH
AAAVPVLCSA VFVIGLASVT RRRGTAAPNV A