Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2001 |
Symbol | |
ID | 4883303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 1978932 |
End bp | 1980059 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640127929 |
Product | cysteine synthase |
Protein accession | YP_001059036 |
Protein GI | 126439027 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.469808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA TCATGACCCA GACATTCGAC AGGCCCGCCA TCCCGGTGTA CGACAGCGTA TCCGGCTCGC TCGACCATCC CGACCTGATC CGGCTCGCGC CGGGGCTCGT CGCGGCGGCG TTCCGGCTGA TGAAGCTCGT GCCCGCGAAG TACATCATCG AGAACGCGAT CGCGAGCGGG CAGTTGAATC CCGGGATGCC GGTGCTCGAG ACGTCGAGCG GCACGTTCGC GATGGGCATC GGGATCGTCT GCGCGGAAAA GCGCATTCCG TTTCACATCG TCAGCGACGC GGCGATCGAC GAACGGCTGC AGGCGCGCCT GCGGCAGTTG GGCGGGCGCG TGCAGATCGT CGGGGCGAAC GCGACCGGCT CCAACGTCCA GGTGCTGCGG CTCGAAGCGC TGCAGGAGCG GTTGCGCGAG AACCCCGGCG GCTTCTGGCC GCGGCAGTAC GACAATCCGG ACAATCAGCA CGCGTACCGC GCGTTCGCCG CGCAACTGAT CCGCACGTTC GGCACGAATC TCACGATCGT CGGCACGGTC GGCTCCGGCG CGTCGACGTG CGGCACGATC CGGGCGCTGC GCGAGGTCGA TCCGTCGATT CCGCTCGTCG GCGTCGATAC GTTCGGCAGC GTGCTGTTCG GGCTGCCCGT CGGCCCGCGC GCGCTGCGCG GCCTCGGCAA TTCGATCTAT CCGAACAACC TCGACCACAC GTGCTTCGAC CAGGTGCACT GGGTCGCGCC CGACGCAGCC TTCGGGAGCA CGCGCCGGCT GCACCGGCAA CACGGGCTGT ATTGCGGGCC GACGTCCGGC GCGGCGTTCA TGGTCGCCGA ATGGCTGCGG GCGCAGCGCG ACGACGGCAG GACGATCGTG TTCATCGCGC CCGACGAAGG GCACCGCTAC GCCGACACGG TCTACGACGA CGCATGGCTG CGCGGGCAAG GGTACGCCGG CGCGCACGCC GCGCCGGCCG CCGCTCCCGT GCGGGCGGTG AGCCCGAACG CCGCGTCCGG CCCGTGGGCG TATGTCGAAT GGGGGCGCCG CACGTTCGAG CAGGCGAGCG GCCGCCCGCG TCCGGCGGGC AGCGCGCTCG AGCAGATCCG CGACGTCCGG CCCGCGCCCG TCGCATGA
|
Protein sequence | MNDIMTQTFD RPAIPVYDSV SGSLDHPDLI RLAPGLVAAA FRLMKLVPAK YIIENAIASG QLNPGMPVLE TSSGTFAMGI GIVCAEKRIP FHIVSDAAID ERLQARLRQL GGRVQIVGAN ATGSNVQVLR LEALQERLRE NPGGFWPRQY DNPDNQHAYR AFAAQLIRTF GTNLTIVGTV GSGASTCGTI RALREVDPSI PLVGVDTFGS VLFGLPVGPR ALRGLGNSIY PNNLDHTCFD QVHWVAPDAA FGSTRRLHRQ HGLYCGPTSG AAFMVAEWLR AQRDDGRTIV FIAPDEGHRY ADTVYDDAWL RGQGYAGAHA APAAAPVRAV SPNAASGPWA YVEWGRRTFE QASGRPRPAG SALEQIRDVR PAPVA
|
| |