Gene BMA10229_A2687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2687 
Symbol 
ID4791050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2714034 
End bp2715389 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID 
Productputative 4-hydroxyphenylacetate transporter 
Protein accessionYP_001028639 
Protein GI124385969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGG CGGCGAATCC CGCGCGCGTG CTCGAGATCG AGCGCGTGAT CGACGACACG 
CACCGGCCCG CGTTTCACGC GATGCTGCTC GCGCTTTGCG GGCTGTGCCT CGTGATCGAC
GGTTTCGACG CGCAGGCGAT GGGCTACGTC GCACCGAGCG TGATCGCCGA ATGGGGTGTG
AAGAAGCAGG CGCTCGGGCC CGTCTTCAGC GCGAGCCTGT TCGGCATGCT GCTCGGCGCG
CTCGGCCTGT CGGTGCTCGC CGATCGGATC GGCCGGCGGC CCGTGCTGAT CGGCGCGACG
CTGTTCTTCG CGCTCGCGAT GCTCGCGACG CCGTTCGCGA CGTCGATCCC GATATTGATC
GCGCTGCGCT TCGTCACGGG CCTGGGGCTC GGCTGCATCA TGCCGAACGC GATGGCGCTC
GTCGGCGAAT GCAGCCCGGG CGCGCACCGC GTGAAGCGGA TGATGATCGT GTCGTGCGGC
TTCACGCTCG GTGCGGCGCT GGGCGGGTTC GTCAGCGCCG CGCTGATTCC CGCGTTCGGC
TGGCGCGCGG TGTTCTTCGT CGGCGGCGCG GTGCCGCTCG CGCTCGCGGC CGCGATGGCC
GCGAGCCTGC CCGAATCGCC GCAGTTGCTC GTGCTGCGCG GCCGGCACGA CGCGGCGCGC
GCGTGGCTCG CGAAGTTCGC GCCGCGGCTC GCGGTCCCGC CCGATACGCG GCTTGTCGTG
CGCGAAGCGG GACCCCGGGG CGCGCCCGTC GCCGAGCTGT TCCGCTCGGG ACGCGCGCGC
GTCACGCTGC TGTTGTGGGC GATCAACTTC ATGAACCTGA TCGACCTGTA CTTCCTGTCG
AACTGGCTGC CGACCGTGAT GCGCGACGCG GGCTACGCGA GCGGCACGGC CGTCATCGTC
GGCACGGTGC TGCAGACGGG CGGCGTGATC GGCACGCTGT CGCTCGGCTG GTTCATCGAA
CGGCATGGTT TCGCGCGCGT GCTGTTCGCG TGCTTCGCGT GCGCGACGAT CGCGATCGGC
CTGATCGGCT CGGTCGCGCA CGCGTTCGTC TGGCTGCTCG CAGCCGTGTT CGTCGGCGGC
TTTTGCGTCG TCGGCGGACA GCCCGCGGTC AATGCGCTCG CGGGCCATTA TTACCCGACG
TCGCTGCGCT CGACGGGCAT CGGCTGGAGT CTCGGCGTGG GCCGCGTCGG CTCCGTGCTC
GGGCCGCTCG TCGGCGGGCA ACTGATCGCG CTCGGCTGGT CGAACGACGC GCTGTTTCAC
GCGGCGGCCG TGCCGGTGCT GTGCTCGGCC GTCTTCGTGA TCGGCCTCGC GAGCGTGACG
CGGCGGCGCG GCATGGCCGC GCCGAACGTC GCTTGA
 
Protein sequence
MSAAANPARV LEIERVIDDT HRPAFHAMLL ALCGLCLVID GFDAQAMGYV APSVIAEWGV 
KKQALGPVFS ASLFGMLLGA LGLSVLADRI GRRPVLIGAT LFFALAMLAT PFATSIPILI
ALRFVTGLGL GCIMPNAMAL VGECSPGAHR VKRMMIVSCG FTLGAALGGF VSAALIPAFG
WRAVFFVGGA VPLALAAAMA ASLPESPQLL VLRGRHDAAR AWLAKFAPRL AVPPDTRLVV
REAGPRGAPV AELFRSGRAR VTLLLWAINF MNLIDLYFLS NWLPTVMRDA GYASGTAVIV
GTVLQTGGVI GTLSLGWFIE RHGFARVLFA CFACATIAIG LIGSVAHAFV WLLAAVFVGG
FCVVGGQPAV NALAGHYYPT SLRSTGIGWS LGVGRVGSVL GPLVGGQLIA LGWSNDALFH
AAAVPVLCSA VFVIGLASVT RRRGMAAPNV A