Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_A1446 |
Symbol | nagE |
ID | 4792033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008836 |
Strand | - |
Start bp | 1472831 |
End bp | 1474597 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | PTS system, N-acetylglucosamine-specific IIBC component |
Protein accession | YP_001027429 |
Protein GI | 124383552 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGAA ATCCGTTTCT GAAAATACAG AGCCTCGGCA GGGCGCTGAT GCTGCCGATC GCGGTGCTGC CGGTGGCGGG CATCCTGCTG CGCCTCGGGC AGCAGGACGT GCTCAACATC AAGATGATCG CCGACGCGGG CGGCGCGATC TTCGAGAACC TGCCGCTGCT GTTCGCGATC GGCGTCGCGG TCGGCTTCGC GAAGGACAAC AACGGCGTGG CGGCGCTCGC GGGCGCGATC GGCTATCTGA TCGAAGTCGC GATCATGAAG GACATCGATC CGAAGCTGAA CATGGGCGTG CTGTCCGGGA TCATCGCGGG CGTCGTCGCG GGGCTGCTGT ACAACCGCTA CAAGGACATC AAGCTGCCCG ACTACCTCGC GTTCTTCGGC GGCAAGCGCT TCGTGCCGAT CATCACGGGG CTCGCGTGCG TCGTGCTCGG GATCGTGTTC GGCTACGTAT GGCAGCCGGT GCAGCACGCG ATCGACGCGG TCGGCCAGTG GCTGCTGACG GCGGGCGCGA TCGGCACGTT CGTCTACGGG TTCCTGAACC GCCTGTTGCT CGTCACGGGG CTGCACCACA TCATCAATTC GCTCGTCTGG TTCGTGTTCG GCACGTTCAC GCCGGCGGGC GGCGCCGCGG TGACGGGCGA TCTGCATCGC TTCTTCGCGG GCGATCCGAG CGCGGGCGGC TTCATGGCGG GCTTCTTCCC GATCATGATG TTCGGCCTGC CGGCCGCGTG CCTCGCGATG TTTCACGAGG CGCCGAAGGC GCGCCGCGCG ATCGTCGGCG GCCTGCTGTT CTCGATGGCG CTCACCTCGT TCCTGACGGG CGTGACCGAG CCGATCGAGT TCAGCTTCAT GTTCCTCGCG CCGGTGCTGT ACGTGATCCA CGCGGTGCTC ACGGGCCTTT CGCTCGCGAT CTGCCAGTTG CTCGGCGTGA AGCTCGGCTT CACGTTCTCG GCGGGCGCGA TCGACTATGT GCTGAACTAC GGGCTGTCGA CGAAGGGCTG GATCGCGATC CCGCTCGGCC TCGCGTACGG TCTCGCCTAC TACGGCCTCT TCCGCTTCTT CATCCGCAAG TTCAACATGG CGACGCCGGG CCGCGAGCCC GCGGGCGCGG ACGCGCAGGC GCAGTCGTTC GCGTCGGGCG GTTTCGTCGC GCCGACGGCG GGCGCATCGG TGCCGCGCGC GCAGCGCTAC ATCGCGGCGC TCGGCGGCGC GGCGAACCTG TCGGTCGTCG ATGCGTGCAC GACTCGGCTG CGTCTTTCCG TCGTCGATCC CGAGAAGGTG TCCGAAGCGG ATCTGCGCAC GATCGGCGCG CGCGGCGTGC TCAAGCGCGG CGGCAGCAGC GTGCAGGTGA TCATCGGGCC GGAGGCGGAC CTCATCGCCG ATGAGATTCG CGCGACGCTC GGCAGCGGCG CGGCGGCGCC CGCGGCTGCG GCTGCCGCGG CGCCTGCGGC GGCGGCAACG GCAACGGCGG CGGGCGCGCA GTCGGGCCCG CTCGATCCGG AGCCGACGCG CTGGCTCGCG GTGTTCGGCG GCGCGACGAA CGTCGCTTCG CTCGACGCGG TCGCGGCGAC GCGCCTGCGC GTCGTCGTAC GCGATCCGTC GGCGGTCGAT CGCCAGCGCC TCGCGACGCT TGACGTCGCC TGGGTCGCGA GCGACACGTT CCATATCGTC TGCGGCCAGT CGGCGCCGCG CTATGCGCAG CAGCTCGCCG CGCGCCTGCC GTCGTCCGAC GGCGGCACGG CGGCCCAGCC CGCCTGA
|
Protein sequence | MDGNPFLKIQ SLGRALMLPI AVLPVAGILL RLGQQDVLNI KMIADAGGAI FENLPLLFAI GVAVGFAKDN NGVAALAGAI GYLIEVAIMK DIDPKLNMGV LSGIIAGVVA GLLYNRYKDI KLPDYLAFFG GKRFVPIITG LACVVLGIVF GYVWQPVQHA IDAVGQWLLT AGAIGTFVYG FLNRLLLVTG LHHIINSLVW FVFGTFTPAG GAAVTGDLHR FFAGDPSAGG FMAGFFPIMM FGLPAACLAM FHEAPKARRA IVGGLLFSMA LTSFLTGVTE PIEFSFMFLA PVLYVIHAVL TGLSLAICQL LGVKLGFTFS AGAIDYVLNY GLSTKGWIAI PLGLAYGLAY YGLFRFFIRK FNMATPGREP AGADAQAQSF ASGGFVAPTA GASVPRAQRY IAALGGAANL SVVDACTTRL RLSVVDPEKV SEADLRTIGA RGVLKRGGSS VQVIIGPEAD LIADEIRATL GSGAAAPAAA AAAAPAAAAT ATAAGAQSGP LDPEPTRWLA VFGGATNVAS LDAVAATRLR VVVRDPSAVD RQRLATLDVA WVASDTFHIV CGQSAPRYAQ QLAARLPSSD GGTAAQPA
|
| |