Gene EcSMS35_2951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2951 
SymbolcsdA 
ID6146569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3026637 
End bp3027842 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content56% 
IMG OID641617820 
Productcysteine sulfinate desulfinase 
Protein accessionYP_001744975 
Protein GI170680672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily
[TIGR03392] cysteine desulfurase, catalytic subunit CsdA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00101745 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTT TTAATCCCGC GCAGTTTCGC GCCCAGTTTC CCGCGCTACA GGATGCGGGC 
GTCTATCTCG ACAGCGCCGC GACCGCGCTT AAACCTGAAG CCGTGGTTGA AGCCACCCGA
CAGTTTTATA GCCTGAGCGC CGGAAACGTC CATCGCAGCC AGTTTGCCGA AGCCCAACGC
CTGACCGCGC GTTACGAAGC TGCGCGGGAA AAAGTGGCAC AATTACTGAA TGCACCGGAT
GATAAAACTA TCGTCTGGAC GCGCGGCACC ACTGAATCCA TCAACATGGT GGCACAATGC
TATGCGCGTC CGCGTCTGCA ACCGGGCGAT GAAATTATTG TCAGTGTGGC AGAACACCAC
GCCAACCTCG TCCCCTGGCT GATGGTCGCC CAACAAACGG GGGCCAAAGT GGTGAAATTG
CCGCTTAATG CGCAGCGACT GCCGGATGTC GATTTGTTGC CAGAACTGAT TACTCCCCGT
AGTCGGATTC TGGCGTTGGG CCAGATGTCG AACGTCACTG GCGGTTGCCC GGATCTGGCG
AGAGCGATTA CTTTTGCTCA TTCAGCCGGG ATGGTGGTGA TGGTTGATGG TGCTCAGGGG
GCGGTGCATT TCCCCGCGGA TGTTCAGCAA CTGGATATTG ATTTCTATGC TTTTTCAGGT
CACAAACTGT ATGGCCCGAC AGGCATCGGC GTGCTGTATG GCAAACCAGA ACTGCTGGAA
GCGATGTCGC CCTGGCTGGG CGGCGGCAAA ATGGTTCACG AAGTGAGTTT TGACGGCTTC
ACGACTCAAT CTGCGCCGTG GAAACTGGAA GCAGGAACGC CAAATGTCGC TGGCGTCATA
GGATTAAGCG CGGCGCTGGA ATGGCTGACA GATTACGATA TCAACCAGGC CGAAAGCTGG
AGCCGTAGCT TAGCAACGCT TGCAGAAGAA GCGCTGGCGA AACGTCCAGG CTTTCGTTCA
TTCCGCTGCC AGGATTCCAG CCTGCTGGCC TTTGATTTTG CTGGAGTTCA TCACAGCGAT
ATGGTGACAC TGCTGGCGGA GTACGGTATT GCCCTGCGGG CCGGGCAGCA TTGCGCTCAG
CCGCTACTGG CAGAATTAGG CGTAACCGGC ACACTGCGCG CCTCTTTTGC GCCATATAAT
ACAAAGAGTG ATGTGGATGC GCTGGTGAAT GCCGTTGACC GCGCGCTGGA ATTATTGGTG
GATTAA
 
Protein sequence
MNVFNPAQFR AQFPALQDAG VYLDSAATAL KPEAVVEATR QFYSLSAGNV HRSQFAEAQR 
LTARYEAARE KVAQLLNAPD DKTIVWTRGT TESINMVAQC YARPRLQPGD EIIVSVAEHH
ANLVPWLMVA QQTGAKVVKL PLNAQRLPDV DLLPELITPR SRILALGQMS NVTGGCPDLA
RAITFAHSAG MVVMVDGAQG AVHFPADVQQ LDIDFYAFSG HKLYGPTGIG VLYGKPELLE
AMSPWLGGGK MVHEVSFDGF TTQSAPWKLE AGTPNVAGVI GLSAALEWLT DYDINQAESW
SRSLATLAEE ALAKRPGFRS FRCQDSSLLA FDFAGVHHSD MVTLLAEYGI ALRAGQHCAQ
PLLAELGVTG TLRASFAPYN TKSDVDALVN AVDRALELLV D