Gene BURPS1106A_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2021 
Symbol 
ID4901612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1984387 
End bp1985514 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID640135251 
Productcysteine synthase 
Protein accessionYP_001066286 
Protein GI126452738 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TCATGACCCA GACATTCGAC AGGCCCGCCA TCCCGGTGTA CGACAGCGTA 
TCCGGCTCGC TCGACCATCC GGACCTGATC CGGCTCGCGC CGGGGCTCGT CGCGGCGGCG
TTCCGGCTGA TGAAGCTCGT GCCCGCGAAG TACATCATCG AGAACGCGAT CGCGAGCGGG
CAGTTGAATC CCGGGATGCC GGTGCTCGAG ACGTCGAGCG GCACGTTCGC GATGGGCATC
GGGATCGTCT GCGCGGAAAA GCGCATTCCG TTTCACATCG TCAGCGACGC GGCGATCGAC
GAACGGCTGC AGGCGCGCCT GCGGCAGTTG GGCGGGCGCG TGCAGATCGT CGGGGCGAAC
GCGACCGGCT CCAACGTCCA GGTGCTGCGG CTCGAAGCGC TGCAGGAGCG GTTGCGCGAG
AACCCCGGCG GCTTCTGGCC GCGGCAGTAC GACAATCCGG ACAATCAGCA CGCGTACCGC
GCGTTCGCCG CGCAACTGAT CCGCACGTTC GGCACGAATC TCACGATCGT CGGCACGGTC
GGCTCCGGCG CGTCGACGTG CGGCACGATC CGGGCGCTGC GCGAGGTCGA TCCGTCGATT
CCGCTCGTCG GCGTCGATAC GTTCGGCAGC GTGCTGTTCG GGCTGCCCGT CGGCCCGCGC
GCGCTGCGCG GCCTCGGCAA TTCGATCTAT CCGAACAACC TCGACCACAC GTGCTTCGAC
CAGGTGCACT GGGTCGCGCC CGACGAAGCC TTCGGGAGCA CGCGCCGGCT GCACCGGCAA
CACGGGCTGT ATTGCGGGCC GACGTCCGGC GCGGCGTTCA TGGTCGCCGA ATGGCTGCGG
GCGCAGCGCG ACGACGGCAG GACGATCGTG TTCATCGCGC CCGACGAAGG GCACCGCTAC
GCCGACACGG TCTACGACGA CGCATGGCTG CGCGGGCAAG GGTACGCCGG CGCGGACGCC
GCGCCGGCCG CCGCTCCCGT GCGGGCGGTG AGCCCGAACG CCGCGTCCGG CCCGTGGGCG
TACGTCGAAT GGGGGCGCCG CACGTTCGAG CAGGCGAGCG GCCGCCCGCG TCCGGCAGGC
AGCGCGCTCG AGCAGATCCG CGACGTCCGG CCCGCGCCCG TTGCATGA
 
Protein sequence
MNDIMTQTFD RPAIPVYDSV SGSLDHPDLI RLAPGLVAAA FRLMKLVPAK YIIENAIASG 
QLNPGMPVLE TSSGTFAMGI GIVCAEKRIP FHIVSDAAID ERLQARLRQL GGRVQIVGAN
ATGSNVQVLR LEALQERLRE NPGGFWPRQY DNPDNQHAYR AFAAQLIRTF GTNLTIVGTV
GSGASTCGTI RALREVDPSI PLVGVDTFGS VLFGLPVGPR ALRGLGNSIY PNNLDHTCFD
QVHWVAPDEA FGSTRRLHRQ HGLYCGPTSG AAFMVAEWLR AQRDDGRTIV FIAPDEGHRY
ADTVYDDAWL RGQGYAGADA APAAAPVRAV SPNAASGPWA YVEWGRRTFE QASGRPRPAG
SALEQIRDVR PAPVA