Gene BMA10229_A2551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A2551 
SymbolwaaC 
ID4793462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp2596777 
End bp2597844 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content71% 
IMG OID 
Productheptosyltransferase I 
Protein accessionYP_001028509 
Protein GI124386486 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGGCCG CCCGGCGCGA TAAAATCCGT CCTTTCGGTC TGTGCCGGCC ACCGCCGGCC 
CTTTTTTTCA GCGTGCAAAA AATTCTGATC GTGCGCGTGT CGTCGCTCGG CGATGTCGTG
CATAACATGC CGGTGATCGC CGATATCCGC CGGCGTCACC CCGATGCGCA GATCGACTGG
CTCGTCGAGG AAGGCTTCGC CGATCTCGTG CGGCTCGTCG ACGGTGTGCG CGACGTGCTG
CCGTTCTCGC TGCGGCGCTG GCGCAAGCGC TTGAGCGCGT CGCAAACGTG GCGCGAGATC
CGCGCGTTCC GGCGGCGCCT CGCCGAGGAG CGCTACGACC TCGTGATCGA CTGCCAGGGG
CTCATCAAGA CCGCGTGGGT CGCGAGCTGG GCGCGCGGGC CGCTTGTCGG CCTCGGCAAC
CGCACCGACG GCGCCGGCTA CGAGTGGCCG GTGCGTTTCT TCTACGACAG GCGGGTGCCG
ATCGCGCCGC GCACGCACGT CGTCGAGCGC TCGCGGCAGC TCGTCGCGGC GGCGCTGGGA
GACCCCGCGC CGGCGCCCGG CGATCCGATC GATTTCGGCC TCGACACGCA TGGCGCGGCG
CGCGCGCTCG CGGCGCTCGA TTTGAATCTG CCGGTGCCCT ACGTGGTATT CGTGCACGCG
ACCTCGCGCG CCGACAAGCA GTGGCCCGAC GAAGCGTGGA CCGGCCTCGG CGAGGCGCTC
GTGCGGCGCG GCGCGTCGCT CGTGCTGCCG TGGGGCAGCG ACGCCGAGCG CGCGACGAGC
GAGCGCCTCG CGAAGGCGTT CGGCGCGGCG GCGATCGTGC CGCCGAAGCT GTCGCTGCCC
GTGGTCGTCG GCCTCGTCGA CGGCGCGGCG GCGACGGTCG GCGTCGATAC CGGCCTCGTC
CACATCGCGG CGGCGCTTAA GCGTCCGACC GTCGAACTGT ACAATTTCGC GACAGCCTGG
CGCACGGGCG GCTACTGGTC GCCCAACGTC GTCAATCTCG GCACCGCCGG CGCGCCGCCG
TCCCTTTCGC AGGCGAAGGA CGCACTCGCG TCGTTCGGCC TCTTGTAA
 
Protein sequence
MSAARRDKIR PFGLCRPPPA LFFSVQKILI VRVSSLGDVV HNMPVIADIR RRHPDAQIDW 
LVEEGFADLV RLVDGVRDVL PFSLRRWRKR LSASQTWREI RAFRRRLAEE RYDLVIDCQG
LIKTAWVASW ARGPLVGLGN RTDGAGYEWP VRFFYDRRVP IAPRTHVVER SRQLVAAALG
DPAPAPGDPI DFGLDTHGAA RALAALDLNL PVPYVVFVHA TSRADKQWPD EAWTGLGEAL
VRRGASLVLP WGSDAERATS ERLAKAFGAA AIVPPKLSLP VVVGLVDGAA ATVGVDTGLV
HIAAALKRPT VELYNFATAW RTGGYWSPNV VNLGTAGAPP SLSQAKDALA SFGLL