Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2403 |
Symbol | |
ID | 8416727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2822021 |
End bp | 2823241 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025387 |
Product | Polysulphide reductase NrfD |
Protein accession | YP_003182750 |
Protein GI | 257792144 |
COG category | [C] Energy production and conversion |
COG ID | [COG5557] Polysulphide reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.418724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.657025 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT CGACAACGTT CAAAGCGACC GCGGGCGTGC TCGCAGCGCT GACCGTGGCC GGCGTCGCCG CCTGGATCTA CCAGCTGGTC AACGGCTTGG GCGTCACCGG CATGAGCAAC TCCACGTCGT GGGGCCTCTA CATCACGTGC TTCATGTTCT TCGTGGGCCT GTCGGCCGGC GGCCTCATCG TGGCTTCGTC GGCCAGCGTG TTCCATGTGG CCGAGTACAA GAAGGTGGCG CTTCCCGCTG TCGTCCTGTC CACGGTGTGC ATCTGCTGCG CCGGCATGTT CGTGCTCATC GACCTTGGCG GCATCGGGCG CGTGTGGCGC ATCCTCACGG GGCCGAACCC GATGTCTCCT CTGTTCTGGG ATATCTGCGT CATCACGTTG TACCTGGTCA TAAACGTCGT GTACCTGTAC TTCATGAAGT CGAAAAAGCC GGGCGCGCAG GGCAAGGTGG CCGTCGTGTC GCGCTTCGCC CTGCCTGTCG CCATCCTCGT GCACTCGGTG ACGGCGTGGA TCTTCGGCCT GGAAATGGCG CGCGAAGGCT GGTACTCGGC AATCATGGCG CCGCTGTTCG TGGTATCGGC CATGGACTCC GGCCTGGCCC TGCTGCTGCT CTCGCTCATG GGTCTCAACA AGTCCGGCCG TTTCGCCACC GACAAGAAGC TGCTGTCGAA CCTGGCCGGC CTGCTGGCCG TGTGCGTCGC TGTCGACGGG TTCCTCGTGG GCTGCGAGGC GCTGACTATG GCGTACCCGG GCGCCGCCGG CGCCGAGACG CTCGCCATCA TGGCGACGGG CGCAACCGCC CCGTTCTTCT GGTTCGAGAT CGTCGTGGGC ATCCTCATCC CGTTCTGCAT CCTCGTGTTC GCGAAGAACC GCGCGCGGAT GGGCTTGGTG GCCGTGGCCA GCGTGTGCGT GGTGACCGGC GTGTTCTTCA AGCGCGTGTG GCTGCTGCTC ACCTCCTTCG TCGGGTTCAA CGTGGCGGGC GCGCCGGGCG TCTCGCTTGG CACCGCGGCG GCCCAGCAGG GCGGCTCGAG CATGTGGGCG CTCACGGGGA CGTACGCTCC CACCTGGGTG GAGATCGTCG TGGTCATCGG CGTGGTGTCC CTCGGCGCGC TCGCGTTCCT CGTTCTCGCG CAGAAGCTGC TGCCGGGACG CGCGGCGCCG CGCGATGCCG AGGCCGCTGC CGTCGAGCGC CCGGCGGGCG AGGCGGCTTA G
|
Protein sequence | MKTSTTFKAT AGVLAALTVA GVAAWIYQLV NGLGVTGMSN STSWGLYITC FMFFVGLSAG GLIVASSASV FHVAEYKKVA LPAVVLSTVC ICCAGMFVLI DLGGIGRVWR ILTGPNPMSP LFWDICVITL YLVINVVYLY FMKSKKPGAQ GKVAVVSRFA LPVAILVHSV TAWIFGLEMA REGWYSAIMA PLFVVSAMDS GLALLLLSLM GLNKSGRFAT DKKLLSNLAG LLAVCVAVDG FLVGCEALTM AYPGAAGAET LAIMATGATA PFFWFEIVVG ILIPFCILVF AKNRARMGLV AVASVCVVTG VFFKRVWLLL TSFVGFNVAG APGVSLGTAA AQQGGSSMWA LTGTYAPTWV EIVVVIGVVS LGALAFLVLA QKLLPGRAAP RDAEAAAVER PAGEAA
|
| |