Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1794 |
Symbol | |
ID | 3848969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 2008914 |
End bp | 2010152 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637841463 |
Product | cysteine desulfurase SufS |
Protein accession | YP_442326 |
Protein GI | 83720072 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.885674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGACA TGCTGCGTTC GACCACGATG GCGAGCGCCT TCCCGGCGCT CGCGCAGCGC GTGAACGGCG CGCCGCTCGC GTATCTCGAC AACGCGGCGA CGACGCACGT GCCGCAGCCC GTTCTCGCCG CGATCCGCGG CTTCGACGAG CACGATCGCG CCAACATCCA CCGCGGCGTC CACACGCTCA GCCAACGCGC CACCGATGCC TACGAGCGCT CGCGCGACAC GCTCGCGCGC TTCGTCGGTG CAAGCGACGA TCACCTGCTC GTCTTCACGT CGGGCACGAC CGATTCGCTG AATCTCGTCG CGCACGGGCT GTCGCTCGCG GGCCACACGC GATCCGTGCT GCGGGAAGGC GACGAGATCG TCGTCAGCGC ACTCGAGCAT CACGCGAACC TCGTCCCGTG GCAGATGGCC GCGCGTCGTT GCGGCGCGAC ACTGCGAATC CTGCATCCCG ATTCGCACGG CCGCCTGCAT GTGCAGGATC TCGAACGATT GCTGACGCCG CGCACGCGCG TGTTCGCGGT CACGGCGTGC TCGAACGCGA CGGGCGAGCG GCCGCCCTAC GAGGCGCTGC TCGCCGCCGC ACGGGCGGCC GGCGCGCTGA CGGTGCTCGA CGCCGCTCAG GCGGTAGGCC ATGAAGTGCC GGACCTGTCA AAGCTCGCGT GCGATTTCAT GGCGTTCTCC GGCCACAAGA TGTACGGGCC GATGGGAACG GGCGCGCTCG TCGGCCGCCG GGACGCGCTC GAGCGGCTCG TCCCGCTGCG CTTCGGCGGC GACATGGTGA GCTGGGTCGG CGAGACGGAC GCGACGTTCG CCGCGCTGCC CGCGCGGCTC GAAGGCGGCA CGCCGAACGT CGCCGGCGCG GTCGGAATCG CGGCCGCCGC CGACTATATC GACGCGATCG GCCGCACCCT GATCGATGCC CACGTGCGCG CGCTGCGCGA TCACGCCGCA GCGGGTCTCG CGGCGCTCGA CGGCGTGACC GTGCTCGCGC CGCAGTCGCC TTCGGCGATC GTGTCCTTCG TCGCCGACGA CGCGCATCCG CACGATATCG GCACATTACT CGACGAGCGC GGCATCGCCG TGCGCACGGG TTTCCACTGC GCGCAGCCGC TTCTCGACAG GCTCGGCTGC GGGCCGACGA CGCGCGCGTC GTTCGCGCTC TACAACACGC ACGACGAGGT CGAGCGCCTC GTCGCGGGCG TCGCGCAAGC ATTGAAGGTA CTGAGATGA
|
Protein sequence | MGDMLRSTTM ASAFPALAQR VNGAPLAYLD NAATTHVPQP VLAAIRGFDE HDRANIHRGV HTLSQRATDA YERSRDTLAR FVGASDDHLL VFTSGTTDSL NLVAHGLSLA GHTRSVLREG DEIVVSALEH HANLVPWQMA ARRCGATLRI LHPDSHGRLH VQDLERLLTP RTRVFAVTAC SNATGERPPY EALLAAARAA GALTVLDAAQ AVGHEVPDLS KLACDFMAFS GHKMYGPMGT GALVGRRDAL ERLVPLRFGG DMVSWVGETD ATFAALPARL EGGTPNVAGA VGIAAAADYI DAIGRTLIDA HVRALRDHAA AGLAALDGVT VLAPQSPSAI VSFVADDAHP HDIGTLLDER GIAVRTGFHC AQPLLDRLGC GPTTRASFAL YNTHDEVERL VAGVAQALKV LR
|
| |