Gene BMA10229_A1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_A1446 
SymbolnagE 
ID4792033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008836 
Strand
Start bp1472831 
End bp1474597 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content70% 
IMG OID 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_001027429 
Protein GI124383552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGAA ATCCGTTTCT GAAAATACAG AGCCTCGGCA GGGCGCTGAT GCTGCCGATC 
GCGGTGCTGC CGGTGGCGGG CATCCTGCTG CGCCTCGGGC AGCAGGACGT GCTCAACATC
AAGATGATCG CCGACGCGGG CGGCGCGATC TTCGAGAACC TGCCGCTGCT GTTCGCGATC
GGCGTCGCGG TCGGCTTCGC GAAGGACAAC AACGGCGTGG CGGCGCTCGC GGGCGCGATC
GGCTATCTGA TCGAAGTCGC GATCATGAAG GACATCGATC CGAAGCTGAA CATGGGCGTG
CTGTCCGGGA TCATCGCGGG CGTCGTCGCG GGGCTGCTGT ACAACCGCTA CAAGGACATC
AAGCTGCCCG ACTACCTCGC GTTCTTCGGC GGCAAGCGCT TCGTGCCGAT CATCACGGGG
CTCGCGTGCG TCGTGCTCGG GATCGTGTTC GGCTACGTAT GGCAGCCGGT GCAGCACGCG
ATCGACGCGG TCGGCCAGTG GCTGCTGACG GCGGGCGCGA TCGGCACGTT CGTCTACGGG
TTCCTGAACC GCCTGTTGCT CGTCACGGGG CTGCACCACA TCATCAATTC GCTCGTCTGG
TTCGTGTTCG GCACGTTCAC GCCGGCGGGC GGCGCCGCGG TGACGGGCGA TCTGCATCGC
TTCTTCGCGG GCGATCCGAG CGCGGGCGGC TTCATGGCGG GCTTCTTCCC GATCATGATG
TTCGGCCTGC CGGCCGCGTG CCTCGCGATG TTTCACGAGG CGCCGAAGGC GCGCCGCGCG
ATCGTCGGCG GCCTGCTGTT CTCGATGGCG CTCACCTCGT TCCTGACGGG CGTGACCGAG
CCGATCGAGT TCAGCTTCAT GTTCCTCGCG CCGGTGCTGT ACGTGATCCA CGCGGTGCTC
ACGGGCCTTT CGCTCGCGAT CTGCCAGTTG CTCGGCGTGA AGCTCGGCTT CACGTTCTCG
GCGGGCGCGA TCGACTATGT GCTGAACTAC GGGCTGTCGA CGAAGGGCTG GATCGCGATC
CCGCTCGGCC TCGCGTACGG TCTCGCCTAC TACGGCCTCT TCCGCTTCTT CATCCGCAAG
TTCAACATGG CGACGCCGGG CCGCGAGCCC GCGGGCGCGG ACGCGCAGGC GCAGTCGTTC
GCGTCGGGCG GTTTCGTCGC GCCGACGGCG GGCGCATCGG TGCCGCGCGC GCAGCGCTAC
ATCGCGGCGC TCGGCGGCGC GGCGAACCTG TCGGTCGTCG ATGCGTGCAC GACTCGGCTG
CGTCTTTCCG TCGTCGATCC CGAGAAGGTG TCCGAAGCGG ATCTGCGCAC GATCGGCGCG
CGCGGCGTGC TCAAGCGCGG CGGCAGCAGC GTGCAGGTGA TCATCGGGCC GGAGGCGGAC
CTCATCGCCG ATGAGATTCG CGCGACGCTC GGCAGCGGCG CGGCGGCGCC CGCGGCTGCG
GCTGCCGCGG CGCCTGCGGC GGCGGCAACG GCAACGGCGG CGGGCGCGCA GTCGGGCCCG
CTCGATCCGG AGCCGACGCG CTGGCTCGCG GTGTTCGGCG GCGCGACGAA CGTCGCTTCG
CTCGACGCGG TCGCGGCGAC GCGCCTGCGC GTCGTCGTAC GCGATCCGTC GGCGGTCGAT
CGCCAGCGCC TCGCGACGCT TGACGTCGCC TGGGTCGCGA GCGACACGTT CCATATCGTC
TGCGGCCAGT CGGCGCCGCG CTATGCGCAG CAGCTCGCCG CGCGCCTGCC GTCGTCCGAC
GGCGGCACGG CGGCCCAGCC CGCCTGA
 
Protein sequence
MDGNPFLKIQ SLGRALMLPI AVLPVAGILL RLGQQDVLNI KMIADAGGAI FENLPLLFAI 
GVAVGFAKDN NGVAALAGAI GYLIEVAIMK DIDPKLNMGV LSGIIAGVVA GLLYNRYKDI
KLPDYLAFFG GKRFVPIITG LACVVLGIVF GYVWQPVQHA IDAVGQWLLT AGAIGTFVYG
FLNRLLLVTG LHHIINSLVW FVFGTFTPAG GAAVTGDLHR FFAGDPSAGG FMAGFFPIMM
FGLPAACLAM FHEAPKARRA IVGGLLFSMA LTSFLTGVTE PIEFSFMFLA PVLYVIHAVL
TGLSLAICQL LGVKLGFTFS AGAIDYVLNY GLSTKGWIAI PLGLAYGLAY YGLFRFFIRK
FNMATPGREP AGADAQAQSF ASGGFVAPTA GASVPRAQRY IAALGGAANL SVVDACTTRL
RLSVVDPEKV SEADLRTIGA RGVLKRGGSS VQVIIGPEAD LIADEIRATL GSGAAAPAAA
AAAAPAAAAT ATAAGAQSGP LDPEPTRWLA VFGGATNVAS LDAVAATRLR VVVRDPSAVD
RQRLATLDVA WVASDTFHIV CGQSAPRYAQ QLAARLPSSD GGTAAQPA