Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1904 |
Symbol | |
ID | 4888849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 1850978 |
End bp | 1852960 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640131842 |
Product | cysteine desulphurases, SufS |
Protein accession | YP_001062899 |
Protein GI | 126443732 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.256271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCCG CACCCGTCGC GCTGCCGGAC GCGGCGCCGC CCGCCGGGCT GCCCGATCCG GCGACGCTTG CGCGCCTTGC GTCGGAGTTT CTCGCCGCGC TGCCCGGGCA GCCCGCCGCG CCGAATGCCG GCGCGGGCAG CGGCGCCGTC GGCGGTGTGC CGTCGGCGTT GCCGGCCGCC GCGCCGATGC TTGCGTCGGT TTCGAATCCC GCGCCGCCGG GCTCGCCGCT TGCCGGGCCG GGCGGCACCG GCACGGGCGT GCCGGGCATC GGGGCGCCGC CGGGCAAGGT GCCCGGCGCG AACCTCGTAC CCGCGCCGAC ACATGTGCTG TCGCTCGGCA ATCGCACGCC CGCGCTCGTT GGGCACGCGG CCGCGCAAAA CGGATGGCCG GACAGCGCGG TTGCGATCGC GCCGGCGCTC GAGCCGCGCG CGGGCGGCGT CGCGCTCGGC GTGCCGCCCG TGCCGGAACC CGATGCCGTC CGTCGCGCGG GCGATGCGTC CGCGGCGGCG GCGCCTTCGC CTTGGTCGTA TTACTTTGTC GAGCCCGCCT CGGATGATTG GTGGCGCGAC GCCGCGCGCA CGCCGATCGA CGTGCCGCGC GACGGCGTCG CGTCGCCGCG CGCGTTCGGC CTGCCCGACG AAAACGCGTG GCGCGATCTG CTGTCGATCG GACGGCCGGC CGCCGATCGG CATCGCGCGT CGCGCTATTT CGTCGACGAC GCGCAGCCCA CGAATGCGCA TGCGCCTGGC GCCGGCGCGC ATCCGCCGTT CGACGTCGCC GCGATTCGCC GCGATTTCCC GATACTCGCC GAGCGGGTGA ACGGCAAGCC GCTCGTCTGG TTCGACAACG CGGCGACGAC GCACAAGCCG CAGGCGGTGA TCGATCGTCT CGCGCACTTC TATGCACACG AGAATTCGAA CATCCATCGC GCGGCGCATG CGCTCGCCGC GCGCGCGACC GACGCGTACG AGCACGCGCG CGCGACCGTG CAGCGCTTCA TCGGCGCGGC GTCGCCGGAC GAGATCGTGT TCGTGCGCGG CGCGACGGAG GCGATCAATC TGATCGCGAA AACATGGGGT GTCGGCAACG TCGGGGAAGG CGACGAGATC GTCGTGTCGC ATCTCGAGCA TCACGCGAAC ATCGTGCCGT GGCAGCAGCT CGCCGCGTCG GTGGGCGCCG CGCTGCGCGT GATTCCCGTC GACGATGCCG GCCAGGTCTT GCTCGGCGAG TACCGGAAGC TGCTCAACGA TCGCACGAAG ATCGTCTCCG TCACGCAGGT ATCGAACGCG CTCGGCACGG TCGTGCCGGT GAAGGAGATC GTCGAGCTCG CGCATCGCGC GGGCGCGAAG GTGCTCGTCG ACGGCGCACA GTCGATTTCG CACATGCGCG TCGACGTGCA GGCGCTCGAC GCCGATTTCT TCGTGTTCTC CGGCCACAAG ATCTACGGCC CGACGGGAAT CGGCGTCGTC TATGGCAAGC GCGCGCTGCT CGACGGCATG CCGCCGTGGC AAGGCGGCGG CAACATGATC GCGGACGTGA CGTTCGAGCG CACCGTATTC CAGCCGCCGC CGAACCGTTT CGAGGCGGGA ACGGGCAACA TCGCCGATGC GGTCGGGCTC GGTGCAGCGC TCGATTACGT GGCGCGGATC GGCATCGAGC GGATCGCGCG CTACGAGCAC GATCTGCTCG CCTATGCGGC GGGCGTGCTC GCGCCGGTGC CGGGTGTGCG GCTGATCGGC ACCGCGCGCG ATAAGGCGAG CGTGCTGTCG TTCGTGCTGA AGGGCTATGA GACGGAAGAA GTCGGGCGAG CGCTGAATGC GGCCGGCATC GCCGTGCGGT CCGGGCACCA CTGCGCGCAG CCGATTCTGC GCCGCTTCGG GCTCGAAGCG ACCGTGCGTG CGTCGCTCGC GTTCTACAAC ACGCGCGACG AGGTCGATGC GATGGTCGAC GTCGTGCGCG AGCTTGCGGC GCGGCGCATC TAG
|
Protein sequence | MSAAPVALPD AAPPAGLPDP ATLARLASEF LAALPGQPAA PNAGAGSGAV GGVPSALPAA APMLASVSNP APPGSPLAGP GGTGTGVPGI GAPPGKVPGA NLVPAPTHVL SLGNRTPALV GHAAAQNGWP DSAVAIAPAL EPRAGGVALG VPPVPEPDAV RRAGDASAAA APSPWSYYFV EPASDDWWRD AARTPIDVPR DGVASPRAFG LPDENAWRDL LSIGRPAADR HRASRYFVDD AQPTNAHAPG AGAHPPFDVA AIRRDFPILA ERVNGKPLVW FDNAATTHKP QAVIDRLAHF YAHENSNIHR AAHALAARAT DAYEHARATV QRFIGAASPD EIVFVRGATE AINLIAKTWG VGNVGEGDEI VVSHLEHHAN IVPWQQLAAS VGAALRVIPV DDAGQVLLGE YRKLLNDRTK IVSVTQVSNA LGTVVPVKEI VELAHRAGAK VLVDGAQSIS HMRVDVQALD ADFFVFSGHK IYGPTGIGVV YGKRALLDGM PPWQGGGNMI ADVTFERTVF QPPPNRFEAG TGNIADAVGL GAALDYVARI GIERIARYEH DLLAYAAGVL APVPGVRLIG TARDKASVLS FVLKGYETEE VGRALNAAGI AVRSGHHCAQ PILRRFGLEA TVRASLAFYN TRDEVDAMVD VVRELAARRI
|
| |