Gene BURPS668_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2001 
Symbol 
ID4883303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1978932 
End bp1980059 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content71% 
IMG OID640127929 
Productcysteine synthase 
Protein accessionYP_001059036 
Protein GI126439027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.469808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA TCATGACCCA GACATTCGAC AGGCCCGCCA TCCCGGTGTA CGACAGCGTA 
TCCGGCTCGC TCGACCATCC CGACCTGATC CGGCTCGCGC CGGGGCTCGT CGCGGCGGCG
TTCCGGCTGA TGAAGCTCGT GCCCGCGAAG TACATCATCG AGAACGCGAT CGCGAGCGGG
CAGTTGAATC CCGGGATGCC GGTGCTCGAG ACGTCGAGCG GCACGTTCGC GATGGGCATC
GGGATCGTCT GCGCGGAAAA GCGCATTCCG TTTCACATCG TCAGCGACGC GGCGATCGAC
GAACGGCTGC AGGCGCGCCT GCGGCAGTTG GGCGGGCGCG TGCAGATCGT CGGGGCGAAC
GCGACCGGCT CCAACGTCCA GGTGCTGCGG CTCGAAGCGC TGCAGGAGCG GTTGCGCGAG
AACCCCGGCG GCTTCTGGCC GCGGCAGTAC GACAATCCGG ACAATCAGCA CGCGTACCGC
GCGTTCGCCG CGCAACTGAT CCGCACGTTC GGCACGAATC TCACGATCGT CGGCACGGTC
GGCTCCGGCG CGTCGACGTG CGGCACGATC CGGGCGCTGC GCGAGGTCGA TCCGTCGATT
CCGCTCGTCG GCGTCGATAC GTTCGGCAGC GTGCTGTTCG GGCTGCCCGT CGGCCCGCGC
GCGCTGCGCG GCCTCGGCAA TTCGATCTAT CCGAACAACC TCGACCACAC GTGCTTCGAC
CAGGTGCACT GGGTCGCGCC CGACGCAGCC TTCGGGAGCA CGCGCCGGCT GCACCGGCAA
CACGGGCTGT ATTGCGGGCC GACGTCCGGC GCGGCGTTCA TGGTCGCCGA ATGGCTGCGG
GCGCAGCGCG ACGACGGCAG GACGATCGTG TTCATCGCGC CCGACGAAGG GCACCGCTAC
GCCGACACGG TCTACGACGA CGCATGGCTG CGCGGGCAAG GGTACGCCGG CGCGCACGCC
GCGCCGGCCG CCGCTCCCGT GCGGGCGGTG AGCCCGAACG CCGCGTCCGG CCCGTGGGCG
TATGTCGAAT GGGGGCGCCG CACGTTCGAG CAGGCGAGCG GCCGCCCGCG TCCGGCGGGC
AGCGCGCTCG AGCAGATCCG CGACGTCCGG CCCGCGCCCG TCGCATGA
 
Protein sequence
MNDIMTQTFD RPAIPVYDSV SGSLDHPDLI RLAPGLVAAA FRLMKLVPAK YIIENAIASG 
QLNPGMPVLE TSSGTFAMGI GIVCAEKRIP FHIVSDAAID ERLQARLRQL GGRVQIVGAN
ATGSNVQVLR LEALQERLRE NPGGFWPRQY DNPDNQHAYR AFAAQLIRTF GTNLTIVGTV
GSGASTCGTI RALREVDPSI PLVGVDTFGS VLFGLPVGPR ALRGLGNSIY PNNLDHTCFD
QVHWVAPDAA FGSTRRLHRQ HGLYCGPTSG AAFMVAEWLR AQRDDGRTIV FIAPDEGHRY
ADTVYDDAWL RGQGYAGAHA APAAAPVRAV SPNAASGPWA YVEWGRRTFE QASGRPRPAG
SALEQIRDVR PAPVA