Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2091 |
Symbol | |
ID | 8416409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2458887 |
End bp | 2460146 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645025074 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003182443 |
Protein GI | 257791837 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000000201738 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGACG GACGGGCGTG CGCGCCGCTG GAGGACTTCC CCCTCCTCGC CGCGCACGCC GCGGGCGGCC CTCCGCTCGC CTACCTCGAC AACGCGGCAA CCACGCAGAA GCCGTCGTGC GTCCTCGACG CCATGGATGA TTTCTACCGC ACGGCATGCG GAAACCCGCA CCGTTCCGCG CACGCGCTGG CCGCTGCGGC CACCCGGACC TACGAGGACG CGCGCTCGAC GGTCGCGCGC TTCATCGGCG CGTCGCCGGA GGAGACGGCG TTCACGAGCG GGGCCACGCA CGCCCTGAAC ACCGTGGCGC TCTGTTATTG CGCCGAGCGG CTGGAGCCGG GCGACGAGAT CGCGCTCACG CTGCTGGAGC ATCACAGCAA CCTCGTGCCG TGGCAGACCG CGGCGCGGCT TTCCGGGGCG AGGCTCTCCT TCATCGTGCC CGACCGCGAC GGGGTCGTCT CAGACGACGA GATCGACCGG GCAATCGGCA CGCGCACCCG CGTGGTGGCG TTCACCGGCA TGTCGAACGT GCTGGGAACC GTGCCGCCCG TCAAGCGCAT CATCGAAGCG GCGCACGCCT GCGGCGCCGT CGCCGTGCTC GACTGCGCGC AGAGCATCGC GCACGAGCCC CTCGACGTGC ACGACCTCGA CGTCGACTTC GCCGCGTTCT CGGGCCACAA GCTGTACGGC CCTATGGGCA TCGGCGTGCT GTACGGCAAG CGGCGCCTCC TCGAGGAAAC GCCGCCCTTG CTGCGAGGCG GGGGCATGGT GGAGGCGGTG TTCGAGCGCG CGTCCTCGTT CGACGGAACG CCGGGGCGCT TCGAAGCGGG CACGCAGAAC GTGGCAGGAG CCGTGGGGCT GGCAGAAGCC GTGCGCTATC TCGACCGCAT CGGCTTCGAC GCCGTGCGCG CGCACGAGCG CGAGCTGACC CGCGCCCTGG TGAGCGGCCT GGACTCCATC CCCTCCGTCA GGCTCTACGG CCCCGGGCCG AACGCCGAGA CGCCGCGCGG CGGCATCGTC TCGTTCAACG TGAAGGGCGT GGGCGCAGCC GAGGTCGCCC ACGTGCTGGA CCGCCGCGGA GTGGCAGTGC GCGCCGGCGC CCACTGCGCC CAGCCCCTCC TGCGCCACAT CGGCGCGGAA GCCGTCTGCC GCGCCAGCAT AGCCGCCTAC ACCACCATGC ACGACATCGA CCGCCTCCTA GAAGCCGTGG AGTCGTCCCG CAACGAAGCC GTAGCCCTGG CAACATCGCG CATGCTGTGA
|
Protein sequence | MADGRACAPL EDFPLLAAHA AGGPPLAYLD NAATTQKPSC VLDAMDDFYR TACGNPHRSA HALAAAATRT YEDARSTVAR FIGASPEETA FTSGATHALN TVALCYCAER LEPGDEIALT LLEHHSNLVP WQTAARLSGA RLSFIVPDRD GVVSDDEIDR AIGTRTRVVA FTGMSNVLGT VPPVKRIIEA AHACGAVAVL DCAQSIAHEP LDVHDLDVDF AAFSGHKLYG PMGIGVLYGK RRLLEETPPL LRGGGMVEAV FERASSFDGT PGRFEAGTQN VAGAVGLAEA VRYLDRIGFD AVRAHERELT RALVSGLDSI PSVRLYGPGP NAETPRGGIV SFNVKGVGAA EVAHVLDRRG VAVRAGAHCA QPLLRHIGAE AVCRASIAAY TTMHDIDRLL EAVESSRNEA VALATSRML
|
| |