Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2966 |
Symbol | |
ID | 8417298 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3440924 |
End bp | 3442243 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645025943 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003183298 |
Protein GI | 257792692 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATAG CACTCAGTTT GCTGCTCGTT CTGTTTTTCC TGCTCATGAA TGCGTTCTTC GTCGCCGCCG AGTTCTCGCT CGTGCGCGTA CGCAAGTCGC AAGTCGAAAT CCTCGTGGAC GAGGGTCGCA AGGGCGCCAA GTACACCAAG CTCGTCGCCG ACAACGTCAA CGCCTACCTG TCAGCCTGCC AGCTGGGCAT CACCCTTGCC TCGCTCGCCC TCGGCTGGTT GGGCGAGCCT GCGGTGTCAG CCCTGTTCGA ACCGCTGTTC AAGGCACTCA ACGTGCCCGA GGCGGCCACG CACGGCATCT CCATCGTCAT CGGTTTCGTC ATCATCACCG CGTTGCACAT CGTGGTGGGC GAGCTCATCC CGAAGTCGCT GGCCATCTTC TCCACCGAGC GCTACGCCCT GTTCACGGCC ACGCCGCTCG TGTGGTTCTA CCGCATCACG TACCCCGTCA TGTGGCTGTT CAACAGCATT ACGAACGGCG TCATGAAGAT GCTGGGCCAC GACGTGGCCA ACGAACACGA GGTGTACACC GACGAGGAGA TCAAGCTGCT CATCGACGAG AGCACCGAAA GCGGACTCAT CGACCCCGAG CAGAACGAAT ACGTGGACAA CATCTTCGAC CTGGGCGACA AAGACGCCGA GGCCATCATG ACGCCGCGCA CCGATGTGGT GTGCATCGAC CTCGACGATC CGCTGGAGGA GAGTCTCCAG ACCGTGTTGC AGTACAAGTA CACGCGCTAC CCGGTGTGCC GCGGCAGCAA GGACCGCATC GTCGGCTTCG TGCACGTGAA GGACCTCTAC ACGATGCCCA AGGACGCGAC GGTCGACGAC CTGCGCGTTC GCATGATCCA GGCCGTACCC GAAGGCGTGC CCATCGCGAA GTTGCTGCAA ACGCTGCAGG AGAAGCGCAC GAAAATCGCC GTGGTCATCG ACGAGCATGG CGGCACGGCC GGCATCGTTA CGATGAGCGA CATCATGGAG CAGATCGTCG GCCGCATCGA CGACGAGTAC GCGCATGGCG GCTCGGACGA GATCGTGCAG TTGGACGATG GCAGCTACCT CATCGACGGC TCGCTTCCCA TCGACGAGGT GGGCGAGCTC ATCGGTTTCG AGCCTCTCGA GTCCGAGGAA TGCGAGACGG CGGGCGGCCT GCTGCTCACC GTGTTCGACC GTATCCCCGA CGAGGGCGAT TCCGTGACCA TCGAGGACGG CGACGACAGG GCCACGTTCA CCGTAGTCGA CATGGACCGC CACCGCATCG ACAAGATTCG GGTGGTGCTC GAGCACGCTC CGGAAAGCGA CGAAAGCTAA
|
Protein sequence | MPIALSLLLV LFFLLMNAFF VAAEFSLVRV RKSQVEILVD EGRKGAKYTK LVADNVNAYL SACQLGITLA SLALGWLGEP AVSALFEPLF KALNVPEAAT HGISIVIGFV IITALHIVVG ELIPKSLAIF STERYALFTA TPLVWFYRIT YPVMWLFNSI TNGVMKMLGH DVANEHEVYT DEEIKLLIDE STESGLIDPE QNEYVDNIFD LGDKDAEAIM TPRTDVVCID LDDPLEESLQ TVLQYKYTRY PVCRGSKDRI VGFVHVKDLY TMPKDATVDD LRVRMIQAVP EGVPIAKLLQ TLQEKRTKIA VVIDEHGGTA GIVTMSDIME QIVGRIDDEY AHGGSDEIVQ LDDGSYLIDG SLPIDEVGEL IGFEPLESEE CETAGGLLLT VFDRIPDEGD SVTIEDGDDR ATFTVVDMDR HRIDKIRVVL EHAPESDES
|
| |