Gene BURPS1106A_A2865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2865 
SymboliscS-2 
ID4906204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2807186 
End bp2808307 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID640145968 
Productcysteine desulfurase 
Protein accessionYP_001076894 
Protein GI126458359 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCATA TCTACCTCGA CTACAACGCA AGCACCCCCA TCGACCCGGC GGTCGCGGAT 
GCGATGCGCC CGCTGCTGCC TGAAGCCTTC GGCAATCCGT CCAGCGGACA TTGGGCAAGC
GTGCCGGCAA AGGTTGCACT GGATCGGGCG CGCGCCCAGG TCGCGACGCT GATCGGCGCC
GCACCGGACG AAATTGTCTT CACGAGCGGC GGCAGTGAAG CGAACAACCT TGCGCTGAAA
GGAACTGTAC TCGCATCGTC GAACGCAGGC GGCCACGTGA TCACCCAGTC GACCGAACAT
CCGGCAATCC TGAAGCCGCT GCAATTCCTT GAGCGACTGG GCACTAAAGT CACCTGCGTT
CCCGTGGACG GCAACGGCCG CGTCGATCCC GACGCCATCC GTCGCGCAAT CACGCCGCGC
ACCACGCTCA TCACGGTCAT GCATGCGAAC AACGAAACCG GCACCATTCA GCCGATCGAG
GATATCGGCA CGATCGCCCG CGAGCACGGC ATCCGGTTTC ACACGGACGC GGCCCAGTCG
ATCGGCAAGA TCCCGGCCGA CGTCGATGCG CTCGGCGTCG ATCTGCTGTC GATCGCCGGT
CACAAGCTCT ACGCGCCGAA GGGCATCGGC GCGCTCTACG TTCGAAAGGG AGTCGGGCTC
GAGCCGCTGA TTCACGGCGC AGGCCATGAA AATGGCCGGC GTGCGGGGAC GGAATCCGCA
CTCCTCGCGG CAGGGTTCGG CGAGGCCTGT GCGCTCGCGC GCGATCTCGC GGCGATGGAC
GGCATTCGCG CGTTGCGCGA CCGGCTGTGG CACGCGTTGC AGGATCGCTT CGACGATCGC
GTTCGCCTGA ACGGACATCT CGAGTACCGC TTGCCGAACA CGGTGAACGT GTCGTTCGTC
GGCCGCGTCG GCGCTGACGT GCTTGCCGCG CTCGACGGCG TCGCCGCGTC CACCGGTTCG
GCCTGCCACG CCGGACGCAT GACGCTGTCG CCGGTGCTGG CCGCGATGGG CATCGCCGAG
CAGGTCGGGA TGGGGGCCAT CCGATTCAGC CTCGGCCGGC ACACGACAGC CGACGAGATC
GATACCGTCG TCGCGAAATT GAGCGCGATC GCCCGTGGCT GA
 
Protein sequence
MTHIYLDYNA STPIDPAVAD AMRPLLPEAF GNPSSGHWAS VPAKVALDRA RAQVATLIGA 
APDEIVFTSG GSEANNLALK GTVLASSNAG GHVITQSTEH PAILKPLQFL ERLGTKVTCV
PVDGNGRVDP DAIRRAITPR TTLITVMHAN NETGTIQPIE DIGTIAREHG IRFHTDAAQS
IGKIPADVDA LGVDLLSIAG HKLYAPKGIG ALYVRKGVGL EPLIHGAGHE NGRRAGTESA
LLAAGFGEAC ALARDLAAMD GIRALRDRLW HALQDRFDDR VRLNGHLEYR LPNTVNVSFV
GRVGADVLAA LDGVAASTGS ACHAGRMTLS PVLAAMGIAE QVGMGAIRFS LGRHTTADEI
DTVVAKLSAI ARG