Gene BURPS1106A_A1818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1818 
Symbol 
ID4905147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1784748 
End bp1786730 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content71% 
IMG OID640144924 
ProductSufS family cysteine desulfurase 
Protein accessionYP_001075852 
Protein GI126458040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.295345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCG CACCTGTCGC GCTGCCGGAC GCGGCGCCGC CCGCCGGGCT GCCCGATCCG 
GCGACGCTTG CGCGCCTTGC GTCGGAGTTT CTCGCCGCGC TGCCCGGGCA GCCCGCCGCG
CCGAATGCCG GCGCGGGCAG CGGCGCCGTC GGCGGTGTGC CGTCGGCGTT GCCGGCCGCC
GCGCCGATGC TTGCGTCGGT TTCGAATCCC GCGCCGCCGG GCTCGCCGCT TGCCGGGCCC
GGCGGCACCG GCACGGGCGT GCCGGGCATC GGGGTGCCGC CGGGCAAGGT GCCCGGCGCG
AACCTCGTAC CCGCGCCGAC ACATGTGCTG TCGCTCGGCA ACCGCACGCC CGCGCTCGTT
GGGCACGCGG CCGCGCAAAA CGGATGGCCG GACAGCGCGG TTGCGATCGC GCCGGCGCTC
GAGCCGCGCG CGGGCGGCGT CGCGCTCGGC GTGCCGCCCG TGCCGGAACC CGATGCCGTC
CGTCGCGCGG GCGATGCGTC CGCGGCGGCG GCGCCTTCGC CTTGGTCGTA TTACTTTGTC
GAGCCCGCCT CGGATGATTG GTGGCGCGAC GCCGCGCGCA CGCCGATCGA CGTGCCGCGC
GACGGCGTCG CGTCGCCGCG CGCGTTCGGC CTGCCCGACG AAAACGCGTG GCGCGATCTG
CTGTCGATCG GATGGCCGGC CGCCGATCGG CATCGCGCGT CGCGCTATTT CGTCGACGAC
GCGCAGCCCA CGAATGCGCA TGCGCCTGGC GCCGGCGCGC ATCCGCCGTT CGACATCGCC
GCGATTCGCC GCGATTTCCC GATACTCGCC GAGCGGGTGA ACGGCAAGCC GCTCGTCTGG
TTCGACAACG CGGCGACGAC GCACAAGCCG CAGGCGGTGA TCGATCGTCT CGCGCACTTC
TATGCACACG AGAATTCGAA CATCCATCGC GCGGCGCATG CGCTCGCCGC GCGCGCGACC
GACGCGTACG AGCACGCGCG CGCGACCGTG CAGCGCTTCA TCGGCGCGGC GTCGCCGGAC
GAGATCGTGT TCGTGCGCGG CGCGACGGAG GCGATCAATC TGATTGCGAA AACATGGGGT
GTCGGCAACG TCGGGGAAGG CGACGAGATC GTCGTGTCGC ATCTCGAGCA TCACGCGAAC
ATCGTGCCGT GGCAGCAGCT CGCCGCGTCG GTGGGCGCCG CGCTGCGCGT GATTCCCGTC
GACGATGCCG GCCAGGTCTT GCTCGGCGAG TACCGGAAGC TGCTCAACGA TCGCACGAAG
ATCGTCTCCG TCACGCAGGT ATCGAACGCG CTCGGCACGG TCGTGCCGGT GAAGGAGATC
GTCGAGCTCG CGCATCGCGC GGGCGCGAAG GTGCTCGTCG ACGGCGCACA GTCGATTTCG
CACATGCGCG TCGACGTGCA GGCGCTCGAC GCCGATTTCT TCGTGTTCTC CGGCCACAAG
ATCTACGGCC CGACGGGAAT CGGCGTCGTC TATGGCAAGC GCGCGCTGCT CGACGGCATG
CCGCCGTGGC AAGGCGGCGG CAACATGATC GCGGACGTGA CGTTCGAGCG CACCGTATTC
CAGCCGCCGC CGAACCGTTT CGAGGCGGGA ACGGGCAACA TCGCCGATGC GGTCGGGCTC
GGTGCGGCGC TCGATTACGT GGCGCGGATC GGCATCGAGC GGATCGCGCG CTACGAGCAC
GATCTGCTCG CCTATGCGGC GGGCGTGCTC GCGCCGGTGC CGGGTGTGCG GCTGATCGGC
ACCGCGCGCG ATAAGGCGAG TGTGCTGTCG TTCGTGCTGA AGGGCTATGA GACGGAAGAA
GTCGGGCGAG CGCTGAATGC GGCCGGCATC GCCGTGCGGT CCGGGCACCA CTGCGCGCAG
CCGATTCTGC GCCGCTTCGG GCTCGAAGCG ACCGTGCGTG CGTCGCTCGC GTTCTACAAC
ACGCGCGACG AGGTCGATGC GATGGTCGAC GTCGTGCGCG AGCTTGCGGC GCGGCGCATC
TAG
 
Protein sequence
MSAAPVALPD AAPPAGLPDP ATLARLASEF LAALPGQPAA PNAGAGSGAV GGVPSALPAA 
APMLASVSNP APPGSPLAGP GGTGTGVPGI GVPPGKVPGA NLVPAPTHVL SLGNRTPALV
GHAAAQNGWP DSAVAIAPAL EPRAGGVALG VPPVPEPDAV RRAGDASAAA APSPWSYYFV
EPASDDWWRD AARTPIDVPR DGVASPRAFG LPDENAWRDL LSIGWPAADR HRASRYFVDD
AQPTNAHAPG AGAHPPFDIA AIRRDFPILA ERVNGKPLVW FDNAATTHKP QAVIDRLAHF
YAHENSNIHR AAHALAARAT DAYEHARATV QRFIGAASPD EIVFVRGATE AINLIAKTWG
VGNVGEGDEI VVSHLEHHAN IVPWQQLAAS VGAALRVIPV DDAGQVLLGE YRKLLNDRTK
IVSVTQVSNA LGTVVPVKEI VELAHRAGAK VLVDGAQSIS HMRVDVQALD ADFFVFSGHK
IYGPTGIGVV YGKRALLDGM PPWQGGGNMI ADVTFERTVF QPPPNRFEAG TGNIADAVGL
GAALDYVARI GIERIARYEH DLLAYAAGVL APVPGVRLIG TARDKASVLS FVLKGYETEE
VGRALNAAGI AVRSGHHCAQ PILRRFGLEA TVRASLAFYN TRDEVDAMVD VVRELAARRI