Gene BMA10229_1116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_1116 
SymbolhmuS 
ID4789693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp1165174 
End bp1166325 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content72% 
IMG OID 
Producthemin transport protein HmuS 
Protein accessionYP_001024919 
Protein GI124381831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.95887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACA CCGCCGCCCC CGCCGCTTCG CCCGCCCGCG CGCTCGCGCC CGACGAGCTG 
CGCGACGCGT TCCTGCACCT GAAAGAAACC CGCAAGCTGC GCAACCGCGA CGTCGCGCAA
CTGCTCGGCG TGAGCGAAGG CGAGGCGCTC GCCGCCTTCG CGGGCGAGCG CGTCGTGCGG
CTCGAATCGA GCTTCGTCGA GCTGTTCGAG GAGATGCCGC GCTTAGGCGG CGTGATGGCG
CTCACGCGCA ACGCGGCCGC CGTGCACGAG AAGGACGGCG CGTTCGAGCA GATGAGCCAC
GACGGCCCGG TCGGCCTCGC GCTCGGCGCG ATCGACCTGC GCATCTTCTA CCGCAACTGG
GCGGCCGGGT TCGCCGTCTA CGAGCCGACC GCGCACGGCG TGATGAAGAG CCTGCAGTTC
TTCGACGCGC AGGGCGACGC GGTGCACAAG GTCTACCTGC GCAAGCACAG CGATCACGCC
GCGTTCGACG CGTTCGTGTC GCGCTGGCGG ATGCCCGTGC AATCGCCGGC GTTCGCGGTC
GAGCCCGCGC CGCCCGCGCA TGTCGAACGG CCCGACGGCG AGATCGACGC CGCGGGGCTG
CGCGCCGCGT GGGACGCGAT GACGGATACG CACCAGTTCC ACGGCGTCGT GCGCCGCCAC
GGCGTGTCGC GCACGCAGGC GCTGCGGCTC GCCGGCGCGT CGCGCGCGCA TCGCGTCGCG
ACCGACGCGG CGCGGCGCGT GCTGGAGCGC GCCGCGCAGA CGCGGCTGCC GATCATGGTG
TTCGTCGGCA ACCGCGGCAT GATCCAGATC CACACCGGCG CCGTGACGAA CATCCGCCGC
ATGGGCACGT GGATCAACGT GCTCGACGAG GATTTCAACC TGCATCTGCG CGAGGATCTC
GTCGCGTCCG CGTGGGTCGT GAGAAAGCCG ACGAGCGACG GCGCCGTCAC GTCGGTCGAG
CTGTTCGACG CGGCGGGCGA CAACATCGCG ATGTTGTTCG GCGCGCGCAA GCCCGGACAG
CCGGAACTCG CGGGCTGGCG CGAACTGGCG GGCGCGCTGC CGAGGCTCGA CACGGCGGAT
GCGGCGGATG CGGCGACCGT CGCGCATGCC GCCGACGTCC CCGTCGCGAC CGACGCCGGA
GCCGCGCGAT GA
 
Protein sequence
MMNTAAPAAS PARALAPDEL RDAFLHLKET RKLRNRDVAQ LLGVSEGEAL AAFAGERVVR 
LESSFVELFE EMPRLGGVMA LTRNAAAVHE KDGAFEQMSH DGPVGLALGA IDLRIFYRNW
AAGFAVYEPT AHGVMKSLQF FDAQGDAVHK VYLRKHSDHA AFDAFVSRWR MPVQSPAFAV
EPAPPAHVER PDGEIDAAGL RAAWDAMTDT HQFHGVVRRH GVSRTQALRL AGASRAHRVA
TDAARRVLER AAQTRLPIMV FVGNRGMIQI HTGAVTNIRR MGTWINVLDE DFNLHLREDL
VASAWVVRKP TSDGAVTSVE LFDAAGDNIA MLFGARKPGQ PELAGWRELA GALPRLDTAD
AADAATVAHA ADVPVATDAG AAR