Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2273 |
Symbol | |
ID | 8416597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2671235 |
End bp | 2672563 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645025259 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003182622 |
Protein GI | 257792016 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCC CAACCACATG GCCGGAAGGC GACGTCCGGG AGCAACGCCG CACGCGAACG GCCGACCCGG GCGTCCCCGG GAGCGTCCCC GAAAGCAACC CCCTCGCAAT CGACATAGCC GCGAACCCCT ACAAGCAGGA CTTCCCCCTG CTGGCGGCCA ACCCCGGCCT CGCCTTCCTC GACAGCGCAG CCACGGCCCA GCGCCCTGCC GTCGCCCTCG ACGCGCAGCG CCGCTTCTAC GAGAAGATGA ACGCGAACGC CCTGCGCGGC CTTTATCGCC TGTCGGTGGA CGCCACCGAG GCCATCGACG AGGCCCGCGC CCACGTCGCG CGCTTCATCG GCGCCGCCGA TGCGCGCGAG GTCGTGTTCT GCCGCAACGC CAGCGAGGCC CTGAACCTCG TGGCGAAAGC GTTCGCTCCC ACCGTGCTGG AGCCCGGCGA CGAGGTATGC ATCACCATCA TGGAGCACCA CTCGAACCTC ATCCCCTGGC AGCAGGCGTG CCGCGCGGCG GGCGCGCGCC TCGTGTACCT GTTCCCCGAC GAGGACGGCG TAATCGGCGA GGAGGAGCTG GACGCGAAGA TCGGACCGCG CACCAAGATC GTCGCGGCCG CCCACGTGTC GAACGTCCTC GGCATCGAGA ACCCCATCGA GGCCATGGCC GAGCGCGTGC ATGCGCACGA CGGCTTCATG GTGGTTGACG GCGCGCAATC GGTGCCGCAC CTGCCCGTCG ACGTGCGGAA GCTCGGCTGT GACTTCTTCG CGTTCTCAGC GCACAAGGCG CTGGGGCCCT TCGGCGTGGG CGTGCTGTGG GGCAAGCTCG ACCTGCTGGA GGCCATGCCG CCGTTCCTCA CGGGCGGCGA GATGATCTCG TCGGTCACGC AGGAAGGGGC CGTGTGGGCG CCCGTGCCCG AGAAGTTCGA GGCCGGCACG CAGGACGCCG CCGGCATCGT GGCGACGGCC GCCGCGCTGG GCTACCTCGA GGGCATCGGC TGGGACGCGT TGCAGGCGCG CGAGCAAGCG CTCGTGCGCG CCGCCATGGA ACGTCTGGCG GCGCTGCCCT ACATCCGCAT CATCGGCCAC CCCGACCCGG CGCAGCACCA CGGCGCCATC AGCTTCGAGG TGGACGGCAT CCACCCGCAC GACGTGGCCA GCATCCTCGA CGAGCACGAC GTGGCCATCC GCGCCGGGCA CCATTGCGCC CAGCCGCTGC TGGCGTGGCA GGGCGTGGAG TCGTGCTGCC GCGCGTCGCT GGCGTTCTAC AACGACGAGG GCGACATCGA CGCGCTCGTC GACGGCCTGG ACGGCGTTTG GAGGACCTTC AATGGCTAG
|
Protein sequence | MSTPTTWPEG DVREQRRTRT ADPGVPGSVP ESNPLAIDIA ANPYKQDFPL LAANPGLAFL DSAATAQRPA VALDAQRRFY EKMNANALRG LYRLSVDATE AIDEARAHVA RFIGAADARE VVFCRNASEA LNLVAKAFAP TVLEPGDEVC ITIMEHHSNL IPWQQACRAA GARLVYLFPD EDGVIGEEEL DAKIGPRTKI VAAAHVSNVL GIENPIEAMA ERVHAHDGFM VVDGAQSVPH LPVDVRKLGC DFFAFSAHKA LGPFGVGVLW GKLDLLEAMP PFLTGGEMIS SVTQEGAVWA PVPEKFEAGT QDAAGIVATA AALGYLEGIG WDALQAREQA LVRAAMERLA ALPYIRIIGH PDPAQHHGAI SFEVDGIHPH DVASILDEHD VAIRAGHHCA QPLLAWQGVE SCCRASLAFY NDEGDIDALV DGLDGVWRTF NG
|
| |