Gene ECH74115_4075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4075 
SymbolcsdA 
ID6971927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3770134 
End bp3771339 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content56% 
IMG OID643387833 
Productcysteine sulfinate desulfinase 
Protein accessionYP_002272276 
Protein GI209397968 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.023337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTT TTAATCCCGC GCAGTTTCGC GCCCAGTTTC CCGCTCTACA GGATGCGGGC 
GTCTATCTCG ACAGCGCCGC GACCGCGCTT AAACCTGAAG CCGTGGTTGA AGCCACCCGA
CAGTTTTACA GTCTGAGCGC CGGAAACGTC CATCGCAGCC AGTTTGCCGA AGCCCAACGC
CTGACCGCGC GTTATGAAGC TGCACGAGAG AAAGTGGCGC AATTACTGAA TGCACCGGAT
GATAAAACTA TCGTCTGGAC GCGCGGCACC ACTGAATCCA TCAACATGGT GGCACAATGC
TATGCGCGTC CGCGTCTGCA ACCGGGCGAT GAGATTATTG TCAGCGTGGC AGAACACCAC
GCCAACCTCG TCCCCTGGCT GATGGTCGCC CAACAAACTG GAGCCAAAGT GGTCAAATTG
CCGCTTAATG CGCAGCGACT GCCAGATGTC GATTTGCTGC CGGAACTGAT TACTCCCCGT
AGTCGGATTC TGGCGTTGGG TCAGATGTCG AACGTAACTG GCGGTTGCCC GGATCTGGCG
CGAGCGATTA CCTTTGCTCA TTCAGCCGGG ATAGTGGTGA TAGTTGATGG TGCTCAGGGG
GCGGTGCATT TCCCCGCGGA TGTTCAGCAA CTGGATATCG ATTTCTATGC TTTCTCTGGT
CACAAACTGT ATGGCCCGAC GGGTATCGGC GTGCTGTATG GCAAATCAGA ACTGCTGGAA
GCGATGTCGC CCTGGCTGGG CGGCGGCAAA ATGGTTCACG AAGTGAGTTT TGACGGCTTC
ACGACTCAAT CTGCGCCGTG GAAACTGGAA GCAGGAACGC CAAATGTGGC TGGTGTCATA
GGATTAAGCG CGGCGCTGGA ATGGCTGGCA GATTACGATA TCAACCAGGC CGAAAACTGG
AGCCGTAGCT TAGCAACGCT GGCGGAAGAT GCGCTGGCGA AACGTCCAGG CTTTCGTTCA
TTCCGCTGCC AGGATTCCAG CCTGCTGGCC TTTGATTTTG CTGGCGTTCA TCACAGCGAT
ATGGTGACGC TGCTGGCGGA GTACGGTATT GCCTTGCGGG CCGGGCAACA TTGCGCTCAG
CCGCTACTGG CAGAATTAGG CGTGACCGGC ACACTGCGCG CCTCTTTTGC GCCATATAAT
ACAAAGAGTG ATGTGGATGC GCTGGTGAAT GCCGTTGACC GCGCGCTGGA ATTATTGGTG
GATTAA
 
Protein sequence
MNVFNPAQFR AQFPALQDAG VYLDSAATAL KPEAVVEATR QFYSLSAGNV HRSQFAEAQR 
LTARYEAARE KVAQLLNAPD DKTIVWTRGT TESINMVAQC YARPRLQPGD EIIVSVAEHH
ANLVPWLMVA QQTGAKVVKL PLNAQRLPDV DLLPELITPR SRILALGQMS NVTGGCPDLA
RAITFAHSAG IVVIVDGAQG AVHFPADVQQ LDIDFYAFSG HKLYGPTGIG VLYGKSELLE
AMSPWLGGGK MVHEVSFDGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLA DYDINQAENW
SRSLATLAED ALAKRPGFRS FRCQDSSLLA FDFAGVHHSD MVTLLAEYGI ALRAGQHCAQ
PLLAELGVTG TLRASFAPYN TKSDVDALVN AVDRALELLV D