Gene Elen_2273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2273 
Symbol 
ID8416597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2671235 
End bp2672563 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content70% 
IMG OID645025259 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_003182622 
Protein GI257792016 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCC CAACCACATG GCCGGAAGGC GACGTCCGGG AGCAACGCCG CACGCGAACG 
GCCGACCCGG GCGTCCCCGG GAGCGTCCCC GAAAGCAACC CCCTCGCAAT CGACATAGCC
GCGAACCCCT ACAAGCAGGA CTTCCCCCTG CTGGCGGCCA ACCCCGGCCT CGCCTTCCTC
GACAGCGCAG CCACGGCCCA GCGCCCTGCC GTCGCCCTCG ACGCGCAGCG CCGCTTCTAC
GAGAAGATGA ACGCGAACGC CCTGCGCGGC CTTTATCGCC TGTCGGTGGA CGCCACCGAG
GCCATCGACG AGGCCCGCGC CCACGTCGCG CGCTTCATCG GCGCCGCCGA TGCGCGCGAG
GTCGTGTTCT GCCGCAACGC CAGCGAGGCC CTGAACCTCG TGGCGAAAGC GTTCGCTCCC
ACCGTGCTGG AGCCCGGCGA CGAGGTATGC ATCACCATCA TGGAGCACCA CTCGAACCTC
ATCCCCTGGC AGCAGGCGTG CCGCGCGGCG GGCGCGCGCC TCGTGTACCT GTTCCCCGAC
GAGGACGGCG TAATCGGCGA GGAGGAGCTG GACGCGAAGA TCGGACCGCG CACCAAGATC
GTCGCGGCCG CCCACGTGTC GAACGTCCTC GGCATCGAGA ACCCCATCGA GGCCATGGCC
GAGCGCGTGC ATGCGCACGA CGGCTTCATG GTGGTTGACG GCGCGCAATC GGTGCCGCAC
CTGCCCGTCG ACGTGCGGAA GCTCGGCTGT GACTTCTTCG CGTTCTCAGC GCACAAGGCG
CTGGGGCCCT TCGGCGTGGG CGTGCTGTGG GGCAAGCTCG ACCTGCTGGA GGCCATGCCG
CCGTTCCTCA CGGGCGGCGA GATGATCTCG TCGGTCACGC AGGAAGGGGC CGTGTGGGCG
CCCGTGCCCG AGAAGTTCGA GGCCGGCACG CAGGACGCCG CCGGCATCGT GGCGACGGCC
GCCGCGCTGG GCTACCTCGA GGGCATCGGC TGGGACGCGT TGCAGGCGCG CGAGCAAGCG
CTCGTGCGCG CCGCCATGGA ACGTCTGGCG GCGCTGCCCT ACATCCGCAT CATCGGCCAC
CCCGACCCGG CGCAGCACCA CGGCGCCATC AGCTTCGAGG TGGACGGCAT CCACCCGCAC
GACGTGGCCA GCATCCTCGA CGAGCACGAC GTGGCCATCC GCGCCGGGCA CCATTGCGCC
CAGCCGCTGC TGGCGTGGCA GGGCGTGGAG TCGTGCTGCC GCGCGTCGCT GGCGTTCTAC
AACGACGAGG GCGACATCGA CGCGCTCGTC GACGGCCTGG ACGGCGTTTG GAGGACCTTC
AATGGCTAG
 
Protein sequence
MSTPTTWPEG DVREQRRTRT ADPGVPGSVP ESNPLAIDIA ANPYKQDFPL LAANPGLAFL 
DSAATAQRPA VALDAQRRFY EKMNANALRG LYRLSVDATE AIDEARAHVA RFIGAADARE
VVFCRNASEA LNLVAKAFAP TVLEPGDEVC ITIMEHHSNL IPWQQACRAA GARLVYLFPD
EDGVIGEEEL DAKIGPRTKI VAAAHVSNVL GIENPIEAMA ERVHAHDGFM VVDGAQSVPH
LPVDVRKLGC DFFAFSAHKA LGPFGVGVLW GKLDLLEAMP PFLTGGEMIS SVTQEGAVWA
PVPEKFEAGT QDAAGIVATA AALGYLEGIG WDALQAREQA LVRAAMERLA ALPYIRIIGH
PDPAQHHGAI SFEVDGIHPH DVASILDEHD VAIRAGHHCA QPLLAWQGVE SCCRASLAFY
NDEGDIDALV DGLDGVWRTF NG