Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2299 |
Symbol | metZ |
ID | 4903701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2277461 |
End bp | 2278732 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640145404 |
Product | O-succinylhomoserine sulfhydrylase |
Protein accession | YP_001076332 |
Protein GI | 126456415 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases |
TIGRFAM ID | [TIGR01325] O-succinylhomoserine sulfhydrylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.21935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGGC TTTTTGTTGG TCTGCGCATT CGCCGCGCGA CGCGCACGGC GGCGATGCGC TTGAAAACGG ATGAACGGAA CATGGACGAC TCTCTCAACT TCGACACGCT CGCCGTCCGC TCGGGCACGC TGCGCAGTGA ATTCAACGAG CATTCGGAAG CGCTCTTCCT CACGTCGAGC TTCTGCTACG CGAGCGCGGC CGAGGCGGCC GAGCGCTTCA AGCATTCGGA AGACTACTAC ACGTACTCGC GCTTCACGAA TCCGACCGTC ACGATGTTCC AGGATCGCCT CGCGGCGCTC GAGGGCGGCG AGGCGTGCAT CGCGACCGCG TCCGGAATGG CGGCGATCAC GTCGGTCGTG ATGGCCGCGC TGCAGGCGGG CGATCACCTC GTCAGCTCGC GCAGCCTGTT CGGCTCGACG CTCGGGCTGT TCGGGCAGAT CTTCGCGAAG TTCGGCATCG AGACGACGTT CGTCGATCCG GCCGATCTCG ACGCATGGCG CGCCGCCGTG CGCCCCGAGA CGAAGATGTT CTTCCTCGAG ACGCCGTCGA ACCCGATGAC CGAGCTCGCC GACATCGAGG CGGTGGGCAG GATCGCGAAG GCGGCCCATG CGCTGTTCGT CGTCGACAAC TGTTTCTGCA GCCCGGTGCT GCAGCAGCCG CTCAAGCTCG GCGCGGACGT CGTGATGCAT TCGGCCACCA AGTTCCTCGA CGGTCAGGGG CGCGTGCTCG GCGGCGCGCT CGTCGGCTCG AAGGCGTTCA TCATGGAGAA GGTGTTCCCG TTCGTGCGCA CCGCGGGGCC GACGCTGTCC GCGTTCAATG CGTGGGTGCT GCTCAAGGGC ATGGAGACGC TGTCGCTGCG CGTGAACCGG CAATCGGAGA ACGCGCTCGA GATCGCCCGC TGGCTCGAGG CGCATCCGGC CGTGAAGCGC GTGTTCTATC CGGGGCTCGA ATCGCATCCG CAGCATGCGC TCGCGAAGCG CCAGCAGAAA TCGGGCGGCT CGGTCGTGTC GTTCGAGCTG AACGGCGATA CGCCGGAGCA GCAGCGCGCG AACGCGTGGC GTGTGATCGA CGGCACGAAG ATCGTATCGA TCACCGCGAA CCTCGGCGAC ACGCGCACGA CGATCACGCA TCCGGCGACG ACGACGCACA GCCGGATCGC GCCGGAAGCG CGCGAGGCGG CGGGGATCAC CGAAGGGCTG ATCCGGCTCG CGGTCGGGCT CGAGGATCCG GCCGACATCC GCGACGATCT CGCGCGCGGG CTCGCCGGCT GA
|
Protein sequence | MSGLFVGLRI RRATRTAAMR LKTDERNMDD SLNFDTLAVR SGTLRSEFNE HSEALFLTSS FCYASAAEAA ERFKHSEDYY TYSRFTNPTV TMFQDRLAAL EGGEACIATA SGMAAITSVV MAALQAGDHL VSSRSLFGST LGLFGQIFAK FGIETTFVDP ADLDAWRAAV RPETKMFFLE TPSNPMTELA DIEAVGRIAK AAHALFVVDN CFCSPVLQQP LKLGADVVMH SATKFLDGQG RVLGGALVGS KAFIMEKVFP FVRTAGPTLS AFNAWVLLKG METLSLRVNR QSENALEIAR WLEAHPAVKR VFYPGLESHP QHALAKRQQK SGGSVVSFEL NGDTPEQQRA NAWRVIDGTK IVSITANLGD TRTTITHPAT TTHSRIAPEA REAAGITEGL IRLAVGLEDP ADIRDDLARG LAG
|
| |