Gene BMA10229_A2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2547 
Symbol 
ID4793240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2591737 
End bp2592894 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001028505 
Protein GI124385756 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTT TCCAGTCCGA CATGAGTTCC GCTTCCGCCC CGCGCATCGT TCTCGTCTGC 
AATACCGCCT GGGCGATCTA TACGTACCGG CAAGGCCTGC TTCGCATGCT GATCGCGCGC
GGCGCGCAGG TGACCGTGCT CGCGCCGCGC GACCGCACCG TCGAGCCGCT CGTGCGCATG
GGCTGCCGCT ACGCGGAGCT GCCCGTCGCC TCGAAAGGCA CGAGCCCGCG CGAGGACCTG
CGCACGCTCA TCGCGCTGTA TCGGCACTAC CGCGCGATCC GGCCCGACCT CGTGTTCCAT
TACACGATCA AGCCGAACAT CTACGGCTCG ATCGCCGCGT GGCTCGCGCG CGTGCCGTCG
ATCGCGGTGA CGACGGGCCT CGGCTACGTG TTCATCCAGC AGAGCCACGC CGCACGCGTC
GCGAAGCAGC TGTACCGCTT CGCGTTGCGC TTTCCGCGCG AGGTCTGGTT CCTGAACCGC
GACGATCTGC ACACGTTCAC GCACGAGCAG CTCCTCGCGC ATCCGGCGCG CGCGCGCCTG
CTGCACGGCG AGGGCGTCGA CCTCGAGCAG TTCGCGCTCG CGCCGCTGCC CGCGCGCGAC
ACGTTCACCT TCGTGCTGAT CGGCCGGCTG CTGTGGGACA AGGGCGTGCG CGAATACGTC
GATGCGGCGC GCATGCTGCG CGCGCGCTAT CCGCACGCGC GCTTCGCGCT GCTCGGCCCC
GTCGGCGTCG ACAATCCGAG CGCGATCTCG CAGGCCGACG TCGACGCGTG GGTGCGCGAA
GGCGTGATCG ATTACCTCGG CGAGGCGCAC GACGTACGGC CGCACATCGC CCGCGCCGAT
TGCGTCGTGC TGCCGTCCTA TCGCGAGGGC GTGCCGCGCA CGCTGATGGA GGCCTCCGCG
ATGGGCCGGC CGATCGTCGC GACCGACGTG CCGGGCTGCC GCGACGTCGT CGCCGACGGC
AGCACGGGGC TGCTGTGCGC CGCGCGCGAC AGCGCGAGCC TCGCCGCGCA GCTCGCGCGG
ATGCTCGACA TGAGCGCGGC CGAGCGGCGC GCGATGGGCG AGCGCGGCCG GAGAAAGATC
GTCGCGGAAT TCGACGAGGC GAAGGTCGTC GAGCGTTATC ATCAGACCAT TTCGGCCCTG
ACGGGCATCA CACTTTGA
 
Protein sequence
MIFFQSDMSS ASAPRIVLVC NTAWAIYTYR QGLLRMLIAR GAQVTVLAPR DRTVEPLVRM 
GCRYAELPVA SKGTSPREDL RTLIALYRHY RAIRPDLVFH YTIKPNIYGS IAAWLARVPS
IAVTTGLGYV FIQQSHAARV AKQLYRFALR FPREVWFLNR DDLHTFTHEQ LLAHPARARL
LHGEGVDLEQ FALAPLPARD TFTFVLIGRL LWDKGVREYV DAARMLRARY PHARFALLGP
VGVDNPSAIS QADVDAWVRE GVIDYLGEAH DVRPHIARAD CVVLPSYREG VPRTLMEASA
MGRPIVATDV PGCRDVVADG STGLLCAARD SASLAAQLAR MLDMSAAERR AMGERGRRKI
VAEFDEAKVV ERYHQTISAL TGITL