Gene EcE24377A_2815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_2815 
SymboliscS 
ID5586952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp2810390 
End bp2811604 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content53% 
IMG OID640926466 
Productcysteine desulfurase 
Protein accessionYP_001463853 
Protein GI157155974 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00976282 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC CGATTTATCT CGACTACTCC GCAACCACGC CGGTGGACCC GCGTGTTGCC 
GAGAAAATGA TGCAGTTTAT GACGATGGAC GGAACCTTTG GTAACCCGGC CTCCCGTTCT
CACCGTTTCG GCTGGCAGGC TGAAGAAGCG GTAGATATCG CCCGTAATCA GATTGCCGAT
CTGGTCGGCG CTGATCCGCG TGAAATCGTC TTTACCTCTG GTGCAACCGA ATCTGACAAC
CTGGCGATCA AAGGTGCAGC CAACTTTTAT CAGAAAAAAG GCAAGCACAT CATCACCAGC
AAAACCGAAC ACAAAGCGGT ACTGGATACC TGCCGTCAGC TGGAGCGCGA AGGTTTTGAA
GTCACCTACC TGGCACCGCA GCGTAACGGC ATTATCGACC TGAAAGAACT TGAAGCAGCG
ATGCGTGACG ACACCATCCT CGTGTCCATC ATGCACGTAA ATAACGAAAT CGGCGTGGTG
CAGGATATCG CGGCTATCGG CGAAATGTGC CGTGCTCGTG GCATTATCTA TCACGTTGAT
GCAACCCAGA GCGTGGGTAA ACTGCCTATC GACCTGAGCC AGTTGAAAGT TGACCTGATG
TCTTTCTCCG GTCACAAAAT CTATGGCCCG AAAGGTATCG GTGCGCTGTA TGTGCGTCGT
AAACCGCGCG TACGCATCGA AGCGCAAATG CACGGCGGCG GTCACGAACG CGGTATGCGT
TCCGGCACTC TGCCTGTTCA CCAGATCGTC GGCATGGGCG AAGCCTATCG CATCGCAAAA
GAAGAGATGG CGACCGAGAT GGAACGTCTG CGCGGCCTGC GTAACCGTCT ATGGAACGGC
ATCAAAGATA TCGAAGAAGT TTACCTGAAC GGTGACCTGG AACACGGTGC GCCGAACATT
CTCAACGTCA GCTTCAACTA CGTTGAAGGT GAGTCGCTGA TTATGGCGCT GAAAGACCTC
GCAGTTTCTT CAGGTTCCGC CTGTACGTCA GCAAGCCTCG AACCGTCCTA CGTGCTGCGC
GCGCTGGGGC TGAACGACGA GCTGGCACAT AGCTCTATCC GTTTCTCTTT AGGTCGTTTT
ACTACTGAAG AAGAGATCGA CTACACCATC GAGTTAGTTC GTAAATCCAT CGGTCGTCTG
CGTGACCTTT CTCCGCTGTG GGAAATGTAC AAGCAGGGCG TGGATCTGAA CAGCATCGAA
TGGGCTCATC ATTAA
 
Protein sequence
MKLPIYLDYS ATTPVDPRVA EKMMQFMTMD GTFGNPASRS HRFGWQAEEA VDIARNQIAD 
LVGADPREIV FTSGATESDN LAIKGAANFY QKKGKHIITS KTEHKAVLDT CRQLEREGFE
VTYLAPQRNG IIDLKELEAA MRDDTILVSI MHVNNEIGVV QDIAAIGEMC RARGIIYHVD
ATQSVGKLPI DLSQLKVDLM SFSGHKIYGP KGIGALYVRR KPRVRIEAQM HGGGHERGMR
SGTLPVHQIV GMGEAYRIAK EEMATEMERL RGLRNRLWNG IKDIEEVYLN GDLEHGAPNI
LNVSFNYVEG ESLIMALKDL AVSSGSACTS ASLEPSYVLR ALGLNDELAH SSIRFSLGRF
TTEEEIDYTI ELVRKSIGRL RDLSPLWEMY KQGVDLNSIE WAHH