Gene BURPS1710b_2157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2157 
Symbol 
ID3689433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2361428 
End bp2362555 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID637728613 
Productcysteine synthase 
Protein accessionYP_333552 
Protein GI76811543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TCATGACCCA GACATTCGAC AGGCCCGCCA TCCCGGTGTA CGACAGCGTA 
TCCGGCTCGC TCGACCATCC CGACCTGATC CGGCTCGCGC CGGGGCTCGT CGCGGCGGCG
TTCCGGCTGA TGAAGCTCGT GCCCGCGAAG TACATCATCG AGAACGCGAT CGCGAGCGGG
CAGTTGAATC CCGGGATGCC GGTGCTCGAG ACGTCGAGCG GCACGTTCGC GATGGGCATC
GGGATCGTCT GCGCGGAAAA GCGCATTCCG TTTCACATCG TCAGCGACGC GGCGATCGAC
GAACGGCTGC AGGCGCGCCT GCGGCAGTTG GGTGGGCGCG TGCAGATCGT CGGGGCGAAC
GCGACCGGCT CCAACGTCCA GGTGCTGCGG CTCGAAGCGC TGCAGGAGCG GTTGCGCGAG
AACCCCGGCG GCTTCTGGCC GCGGCAGTAC GACAATCCGG ACAATCAGCA CGCGTACCGC
GCGTTCGCCG CGCAACTGAT CCGCACGTTC GGCACGAATC TCACGATCGT CGGCACGGTC
GGCTCCGGCG CGTCGACGTG CGGCACGATC CGGGCGCTGC GCGAGGTCGA TCCGTCGATT
CCGCTCGTCG GCGTCGATAC GTTCGGCAGC GTGCTGTTCG GGCTGCCCGT CGGCCCGCGC
GCGCTGCGCG GCCTCGGCAA TTCGATCTAT CCGAACAACC TCGACCACAC GTGCTTCGAC
CAGGTGCACT GGGTCGCGCC CGACGCAGCC TTCGGGAGCA CGCGCCGGCT GCACCGGCAA
CACGGGCTGT ATTGCGGGCC GACGTCCGGC GCGGCGTTCA TGGTCGCCGA ATGGCTGCGG
GCGCAGCGCG ACGACGGCAG GACGATCGTG TTCATCGCGC CCGACGAAGG GCACCGCTAC
GCCGACACGG TCTACGACGA CGCATGGCTG CGCGGGCAAG GGTACGCCGG CGCGGACGCC
GCGCCGGCCG CCGCTCCCGT GCGGGCGGTG AGCCCGAACG CCGCGTCCGG CCCGTGGGCG
TACGTCGAAT GGGGGCGCCG CACGTTCGAG CAGGCGAGCG GCCGCCCGCG TCCGGCAGGC
AGCGCGCTCG AGCAGATCCG CGACGTCCGG CCCGCGCCCG TTGCATGA
 
Protein sequence
MNDIMTQTFD RPAIPVYDSV SGSLDHPDLI RLAPGLVAAA FRLMKLVPAK YIIENAIASG 
QLNPGMPVLE TSSGTFAMGI GIVCAEKRIP FHIVSDAAID ERLQARLRQL GGRVQIVGAN
ATGSNVQVLR LEALQERLRE NPGGFWPRQY DNPDNQHAYR AFAAQLIRTF GTNLTIVGTV
GSGASTCGTI RALREVDPSI PLVGVDTFGS VLFGLPVGPR ALRGLGNSIY PNNLDHTCFD
QVHWVAPDAA FGSTRRLHRQ HGLYCGPTSG AAFMVAEWLR AQRDDGRTIV FIAPDEGHRY
ADTVYDDAWL RGQGYAGADA APAAAPVRAV SPNAASGPWA YVEWGRRTFE QASGRPRPAG
SALEQIRDVR PAPVA