Gene BURPS668_A1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1904 
Symbol 
ID4888849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1850978 
End bp1852960 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content71% 
IMG OID640131842 
Productcysteine desulphurases, SufS 
Protein accessionYP_001062899 
Protein GI126443732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.256271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCCG CACCCGTCGC GCTGCCGGAC GCGGCGCCGC CCGCCGGGCT GCCCGATCCG 
GCGACGCTTG CGCGCCTTGC GTCGGAGTTT CTCGCCGCGC TGCCCGGGCA GCCCGCCGCG
CCGAATGCCG GCGCGGGCAG CGGCGCCGTC GGCGGTGTGC CGTCGGCGTT GCCGGCCGCC
GCGCCGATGC TTGCGTCGGT TTCGAATCCC GCGCCGCCGG GCTCGCCGCT TGCCGGGCCG
GGCGGCACCG GCACGGGCGT GCCGGGCATC GGGGCGCCGC CGGGCAAGGT GCCCGGCGCG
AACCTCGTAC CCGCGCCGAC ACATGTGCTG TCGCTCGGCA ATCGCACGCC CGCGCTCGTT
GGGCACGCGG CCGCGCAAAA CGGATGGCCG GACAGCGCGG TTGCGATCGC GCCGGCGCTC
GAGCCGCGCG CGGGCGGCGT CGCGCTCGGC GTGCCGCCCG TGCCGGAACC CGATGCCGTC
CGTCGCGCGG GCGATGCGTC CGCGGCGGCG GCGCCTTCGC CTTGGTCGTA TTACTTTGTC
GAGCCCGCCT CGGATGATTG GTGGCGCGAC GCCGCGCGCA CGCCGATCGA CGTGCCGCGC
GACGGCGTCG CGTCGCCGCG CGCGTTCGGC CTGCCCGACG AAAACGCGTG GCGCGATCTG
CTGTCGATCG GACGGCCGGC CGCCGATCGG CATCGCGCGT CGCGCTATTT CGTCGACGAC
GCGCAGCCCA CGAATGCGCA TGCGCCTGGC GCCGGCGCGC ATCCGCCGTT CGACGTCGCC
GCGATTCGCC GCGATTTCCC GATACTCGCC GAGCGGGTGA ACGGCAAGCC GCTCGTCTGG
TTCGACAACG CGGCGACGAC GCACAAGCCG CAGGCGGTGA TCGATCGTCT CGCGCACTTC
TATGCACACG AGAATTCGAA CATCCATCGC GCGGCGCATG CGCTCGCCGC GCGCGCGACC
GACGCGTACG AGCACGCGCG CGCGACCGTG CAGCGCTTCA TCGGCGCGGC GTCGCCGGAC
GAGATCGTGT TCGTGCGCGG CGCGACGGAG GCGATCAATC TGATCGCGAA AACATGGGGT
GTCGGCAACG TCGGGGAAGG CGACGAGATC GTCGTGTCGC ATCTCGAGCA TCACGCGAAC
ATCGTGCCGT GGCAGCAGCT CGCCGCGTCG GTGGGCGCCG CGCTGCGCGT GATTCCCGTC
GACGATGCCG GCCAGGTCTT GCTCGGCGAG TACCGGAAGC TGCTCAACGA TCGCACGAAG
ATCGTCTCCG TCACGCAGGT ATCGAACGCG CTCGGCACGG TCGTGCCGGT GAAGGAGATC
GTCGAGCTCG CGCATCGCGC GGGCGCGAAG GTGCTCGTCG ACGGCGCACA GTCGATTTCG
CACATGCGCG TCGACGTGCA GGCGCTCGAC GCCGATTTCT TCGTGTTCTC CGGCCACAAG
ATCTACGGCC CGACGGGAAT CGGCGTCGTC TATGGCAAGC GCGCGCTGCT CGACGGCATG
CCGCCGTGGC AAGGCGGCGG CAACATGATC GCGGACGTGA CGTTCGAGCG CACCGTATTC
CAGCCGCCGC CGAACCGTTT CGAGGCGGGA ACGGGCAACA TCGCCGATGC GGTCGGGCTC
GGTGCAGCGC TCGATTACGT GGCGCGGATC GGCATCGAGC GGATCGCGCG CTACGAGCAC
GATCTGCTCG CCTATGCGGC GGGCGTGCTC GCGCCGGTGC CGGGTGTGCG GCTGATCGGC
ACCGCGCGCG ATAAGGCGAG CGTGCTGTCG TTCGTGCTGA AGGGCTATGA GACGGAAGAA
GTCGGGCGAG CGCTGAATGC GGCCGGCATC GCCGTGCGGT CCGGGCACCA CTGCGCGCAG
CCGATTCTGC GCCGCTTCGG GCTCGAAGCG ACCGTGCGTG CGTCGCTCGC GTTCTACAAC
ACGCGCGACG AGGTCGATGC GATGGTCGAC GTCGTGCGCG AGCTTGCGGC GCGGCGCATC
TAG
 
Protein sequence
MSAAPVALPD AAPPAGLPDP ATLARLASEF LAALPGQPAA PNAGAGSGAV GGVPSALPAA 
APMLASVSNP APPGSPLAGP GGTGTGVPGI GAPPGKVPGA NLVPAPTHVL SLGNRTPALV
GHAAAQNGWP DSAVAIAPAL EPRAGGVALG VPPVPEPDAV RRAGDASAAA APSPWSYYFV
EPASDDWWRD AARTPIDVPR DGVASPRAFG LPDENAWRDL LSIGRPAADR HRASRYFVDD
AQPTNAHAPG AGAHPPFDVA AIRRDFPILA ERVNGKPLVW FDNAATTHKP QAVIDRLAHF
YAHENSNIHR AAHALAARAT DAYEHARATV QRFIGAASPD EIVFVRGATE AINLIAKTWG
VGNVGEGDEI VVSHLEHHAN IVPWQQLAAS VGAALRVIPV DDAGQVLLGE YRKLLNDRTK
IVSVTQVSNA LGTVVPVKEI VELAHRAGAK VLVDGAQSIS HMRVDVQALD ADFFVFSGHK
IYGPTGIGVV YGKRALLDGM PPWQGGGNMI ADVTFERTVF QPPPNRFEAG TGNIADAVGL
GAALDYVARI GIERIARYEH DLLAYAAGVL APVPGVRLIG TARDKASVLS FVLKGYETEE
VGRALNAAGI AVRSGHHCAQ PILRRFGLEA TVRASLAFYN TRDEVDAMVD VVRELAARRI