Gene SbBS512_E1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1047 
SymboldcyD 
ID6271005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp960762 
End bp961748 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID641725189 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001879708 
Protein GI187733033 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.106931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTGC ATAATTTAAC CCGTTTTCCA CGGCTGGAGT TTATCGGCGC GCCAACGCCG 
CTCGAATATC TGCCGCGCTT TTCTGATTAT CTTGGACGGG AAATTTTCAT CAAACGGGAT
GACGTCACAC CCATGGCAAT GGGCGGCAAT AAATTACGTA AGCTGGAATT TCTCGCGGCA
GATGCCCTGC GCGAAGGTGC CGATACGCTG ATTACTGCCG GCGCGATCCA GTCTAACCAT
GTGCGGCAGA CTGCCGCAGT TGCGGCGAAA CTCGGCCTGC ACTGCGTGGC GCTGCTGGAA
AATCCTATTG GCACAACCGC AGAAAACTAT TTAACCAACG GCAATCGTTT GTTGCTGAAT
CTGTTCAATA CCCAGATTGA AATGTGCGAC GCACTGACCG ATCCCAATGC CCAACTGGAA
GAGCTGGCGA CGCGAGTCGA AGCACAAGGC TTTCGCCCGT ATGTCATTCC GGTTGGCGGT
TCTAATGCTC TGGGCGCGCT GGGTTATGTG GAGAGTGCGC TGGAAATCGC GCAACAGTGT
GAAGGGGCGG TTAATATTTC GTCGGTGGTG GTCGCATCGG GCAGTGCCGG AACTCACGCC
GGATTGGCTG TTGGGCTGGA ACACCTGATG CCTGAAAGCG AACTGATTGG CGTGACCGTG
TCGCGTTCCG TTGCCGATCA ATTGCCGAAA GTGGTTAACC TACAACAGGC GATTGCGAAA
GAACTGGAGC TGACCGCATC AGCGGAAATT TTACTCTGGG ATGACTATTT TGCACCTGGC
TACGGCGTGC CGAACGACGA AGGCATGGAA GCAGTGAAAT TGCTGGCGCG TCTGGAAGGC
ATTCTGCTTG ATCCTGTGTA TACCGGAAAA GCGATGGCGG GGCTGATTGA CGGTATCAGT
CAGAAACGCT TCAAAGATGA AGGGCCGATT CTGTTTATTC ATACCGGCGG CGCGCCTGCG
CTGTTCGCCT ATCATCCCCA CGTTTAG
 
Protein sequence
MPLHNLTRFP RLEFIGAPTP LEYLPRFSDY LGREIFIKRD DVTPMAMGGN KLRKLEFLAA 
DALREGADTL ITAGAIQSNH VRQTAAVAAK LGLHCVALLE NPIGTTAENY LTNGNRLLLN
LFNTQIEMCD ALTDPNAQLE ELATRVEAQG FRPYVIPVGG SNALGALGYV ESALEIAQQC
EGAVNISSVV VASGSAGTHA GLAVGLEHLM PESELIGVTV SRSVADQLPK VVNLQQAIAK
ELELTASAEI LLWDDYFAPG YGVPNDEGME AVKLLARLEG ILLDPVYTGK AMAGLIDGIS
QKRFKDEGPI LFIHTGGAPA LFAYHPHV