Gene BURPS1106A_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3059 
SymboldcyD 
ID4900572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2989200 
End bp2990219 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID640136285 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001067298 
Protein GI126452372 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGA CGACGCTCTC GACGAAACTG AACGCCTTCG CCCGCCTGAA CCTGATCGAC 
GCGCCGACGC CGCTGCAGTA CTTGCCGCGC CTGTCCGCGC ACGTCGGCCG CGACATCCAC
GTGAAGCGCG ACGACTGCAC GCCGCTCGCG ATGGGCGGTA ACAAGCTGCG CAAGCTCGAA
TTCCTCGCCG CGGACGCGCT CGGCCAAAAC GCGGACGTGC TGGTGACCGC GGGCGCGATC
CAGTCGAATC ACGTCCGGCA GACGGCCGCG CTCGCCGCGC AGCTCGGCCT CGGCTGTGTC
GCGCTGCTCG AAAACCCGAT CGCAGCCGCG CGCGACGATT ATCTGCAAAG CGGCAACCGG
CTGCTGCTCG ATCTGTTCGA CGTGCGCGCG CACGTCGTCG ACGGGCTCGA CGACGTCGAC
CGGCAACTCG AAGCCGCCGC GCGGCGGCTG CGCGACGAAG GGCGGCGCCC GTACGTGATC
CCGATCGGCG GATCGAATCC GCTCGGCGCG CTCGGCTACG TGCGCGCGGG CCTCGAGCTC
GCGCAGCAGA TCCGCGCGGC CGAGCGCGAT TTCGCGGCTG TCGTGCTCGC GTCCGGCAGC
GCCGGCACGC ACGCGGGCCT CGCGTTCGCG CTCGCGCACG CGTTGCCCGG GCTGCCGGTG
ATCGGCGTGA CGGTATCGCG CACCGACGCG CAGCAGCGCC CGAAGGTGCG GCATCTGCTC
GACGGCATGA GCGGGCTGCT CGACGTCGCG TTGCCGGCGG GCGCACGCAT CGATCTGTGG
GACGACTACT TCGCGCCGCG CTACGGCGAG CCCAATCGCG CGGGCATCGA CGCGCTGCGG
CTGCTCGCGC GAACCGAGGG GCTGCTGCTC GATCCGGTGT ACACGGGCAA GGCGATGGCG
GGTCTCATCG ACGGCGTCGC GCGCGGCCGC TTCGACGGAA ACGGCCCGGT GCTGTTCGTG
CATACGGGCG GCGCGCCCGC GCTGTTCGCG TATCAGGACG CGTGCCGCGC GAGCCGATGA
 
Protein sequence
MNTTTLSTKL NAFARLNLID APTPLQYLPR LSAHVGRDIH VKRDDCTPLA MGGNKLRKLE 
FLAADALGQN ADVLVTAGAI QSNHVRQTAA LAAQLGLGCV ALLENPIAAA RDDYLQSGNR
LLLDLFDVRA HVVDGLDDVD RQLEAAARRL RDEGRRPYVI PIGGSNPLGA LGYVRAGLEL
AQQIRAAERD FAAVVLASGS AGTHAGLAFA LAHALPGLPV IGVTVSRTDA QQRPKVRHLL
DGMSGLLDVA LPAGARIDLW DDYFAPRYGE PNRAGIDALR LLARTEGLLL DPVYTGKAMA
GLIDGVARGR FDGNGPVLFV HTGGAPALFA YQDACRASR