Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10229_1866 |
Symbol | metZ |
ID | 4790095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10229 |
Kingdom | Bacteria |
Replicon accession | NC_008835 |
Strand | + |
Start bp | 1912427 |
End bp | 1913698 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | O-succinylhomoserine sulfhydrylase |
Protein accession | YP_001025663 |
Protein GI | 124383049 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGGC TTTTTGTTGG TCTGCGCATT CGCCGCGCGA CGCGCACGGC GGCGATGCGC TTGAAAACGG ATGAACGGAA CATGGACGAC TCTCTCAACT TCGACACGCT CGCCGTCCGC TCGGGCACGC TGCGCAGTGA ATTCAACGAG CATTCGGAAG CGCTCTTCCT CACGTCGAGC TTCTGCTACG CGAGCGCGGC CGAGGCGGCC GAGCGCTTCA AGCATTCGGA AGACTACTAC ACGTACTCGC GCTTCACGAA TCCGACCGTC ACGATGTTCC AGGATCGCCT CGCGGCGCTC GAGGGCGGCG AGGCGTGCAT CGCGACCGCG TCCGGAATGG CGGCGATCAC GTCGGTCGTG ATGGCCGCGC TGCAGGTGGG CGATCACCTC GTCAGCTCGC GCAGCCTGTT CGGCTCGACG CTCGGGCTGT TCGGGCAGAT CTTCGCGAAG TTCGGCATCG AGACGACGTT CGTCGATCCG GCCGATCTCG ACGCATGGCG CGCCGCCGTG CGCCCCGAGA CGAAGATGTT CTTCCTCGAG ACGCCGTCGA ACCCGATGAC CGAGCTCGCC GACATCGAGG CGGTGGGCAG GATCGCGAAG GCGGCCCATG CGCTGTTCGT CGTCGACAAC TGTTTCTGCA GCCCGGTGCT GCAGCAGCCG CTCAAGCTCG GCGCGGACGT CGTGATGCAT TCGGCCACCA AGTTCCTCGA CGGTCAGGGG CGCGTGCTCG GCGGCGCGCT CGTCGGCTCG AAGGCGTTCA TCATGGAGAA GGTGTTCCCG TTCGTGCGCA CCGCGGGGCC GACGCTGTCC GCGTTCAATG CGTGGGTGCT GCTCAAGGGC ATGGAGACGC TGTCGCTGCG CGTGAACCGG CAATCGGAGA ACGCGCTTGA GATCGCCCGC TGGCTCGAGG CGCATCCGGC CGTGAAGCGC GTGTTCTATC CGGGGCTCGA ATCGCATCCG CAGCATGCGC TCGCGAAGCG CCAGCAGAAA TCGGGCGGCT CGGTCGTGTC GTTCGAGCTG AACGGCGATA CGCCGGAGCG GCAGCGCGCG AACGCGTGGC GTGTGATCGA CGGCACGAAG ATCGTATCGA TCACCGCGAA CCTCGGCGAC ACGCGCACGA CGATCACGCA TCCGGCGACG ACGACGCACA GCCGGATCGC GCCTGAAGCG CGCGAGGCGG CGGGGATCAC CGAAGGGCTG ATCCGGCTCG CGGTCGGGCT CGAGGATCCG GCCGACATCC GCGACGATCT CGCGCGCGGG CTCGCCGGCT GA
|
Protein sequence | MSGLFVGLRI RRATRTAAMR LKTDERNMDD SLNFDTLAVR SGTLRSEFNE HSEALFLTSS FCYASAAEAA ERFKHSEDYY TYSRFTNPTV TMFQDRLAAL EGGEACIATA SGMAAITSVV MAALQVGDHL VSSRSLFGST LGLFGQIFAK FGIETTFVDP ADLDAWRAAV RPETKMFFLE TPSNPMTELA DIEAVGRIAK AAHALFVVDN CFCSPVLQQP LKLGADVVMH SATKFLDGQG RVLGGALVGS KAFIMEKVFP FVRTAGPTLS AFNAWVLLKG METLSLRVNR QSENALEIAR WLEAHPAVKR VFYPGLESHP QHALAKRQQK SGGSVVSFEL NGDTPERQRA NAWRVIDGTK IVSITANLGD TRTTITHPAT TTHSRIAPEA REAAGITEGL IRLAVGLEDP ADIRDDLARG LAG
|
| |