Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0247 |
Symbol | |
ID | 8414531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 336788 |
End bp | 338128 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023225 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003180628 |
Protein GI | 257790022 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTT GGATTAGCAT CGTCGTCACG TTCGTGCTGG TGCTGGTGAA CGGCTATTTC TCGATGTCGG AGATGGCGTT GGTGAACGCG CGCCACGTAC TGCTGCAGCA CGATGCCGAC GAGGGCGATA AAAGCGCCCA ACGCGCGCTG GGTCTGGCCG CCGATTCGGG GCAGTTCCTG GCCACCATCC AGGTGGCCAT CACGCTCGTC GGGTTCTTCG CCTCCGCGGC TGCCGCCACG AACCTCTCCG ATCCGCTGGC GCAGTGGCTG TCCGGCTTCA ACATCGGGTG GCTTTCCGTT ATCGCGCCCG GTTTGGCCCC CGTGGTCATC ACGCTCATCG TGTCATACCT CAGCATCGTG GTGGGCGAGC TGGTGCCGAA GCGCATCGCG CTGGCCGATG CCGAGCGCGT CAGCAAGATG GTGGCCGGAC CGCTCATGGT GTTCCAGAAA ATCGCTTCGC CTTTGGTGGC GTTGACCTCG GCGTCCGCGA ACGGGCTGTC GCGCCTGTTC GGCATCAAGA ACGCCGACGA GCGCCAGAAC GTGTCCGAAG AAGAGATCAA GTACATGGTC ACGGACAACG ACGAGCTGCT CGAGGACGAG AAGCGCATGA TCCACGACAT CCTCGATTTG GGCGACATGA CCGTGCACGA GATCATGACG CCGCGCGTGG ACGTGATGTT CGCGGAAGAC ACCGACACGG TGCGCCAGAC GGTGGAGCGC ATGCGCGGCA CGGGCTACTC GCGTCTGCCG GTGTATCACG AGGACATCGA CCGCATCGTG GGCATCGTCC ACTTCAAGGA CCTCGTGGCG CCGCTCATGG ACGGCAAGGA GCACGAGCCG GTGGCCGAGT ACGCCTACGA GGCCATGTTC GTGCCCGAGA CGAAGGATCT GTTCCCGCTG CTCGCCGAGA TGCAAACGAA TCGTCAACAG ATGGCTATCG TCGTTGACGA GTACGGTGGC ACCGATGGTT TAATTACCGT TGAGGACATC GTAGAGGAGG TCGTCGGCGA GATCGTGGAC GAGACGGATC GAGAGAATCC GTTCATCGAG CAGGAAAGCG AGAACGTCTG GGTGGTCGAC GGGCGATTCC CCGTCGAAGA TGCCGCAGAG CTTGGATGGC CGGTGGAGGA TTCGGCCGAC TACGAGACCA TCGCGGGCTG GCTCATGAGC ATGCTCGACT CGGTGCCCCA GGTGGGCGAG GAACTTGCGT TCGACGGATA CCGCTTCAAG ATTCAGGCTA TGCGCCGCCG TCGCATTTCG ACGGTGCGCG TGGAACGACT GGACGATCCC TCCCCATCAT GCGTGGACGC TGTCGAGGCG ATCGACCGGG AGGAAGCGTG A
|
Protein sequence | MDIWISIVVT FVLVLVNGYF SMSEMALVNA RHVLLQHDAD EGDKSAQRAL GLAADSGQFL ATIQVAITLV GFFASAAAAT NLSDPLAQWL SGFNIGWLSV IAPGLAPVVI TLIVSYLSIV VGELVPKRIA LADAERVSKM VAGPLMVFQK IASPLVALTS ASANGLSRLF GIKNADERQN VSEEEIKYMV TDNDELLEDE KRMIHDILDL GDMTVHEIMT PRVDVMFAED TDTVRQTVER MRGTGYSRLP VYHEDIDRIV GIVHFKDLVA PLMDGKEHEP VAEYAYEAMF VPETKDLFPL LAEMQTNRQQ MAIVVDEYGG TDGLITVEDI VEEVVGEIVD ETDRENPFIE QESENVWVVD GRFPVEDAAE LGWPVEDSAD YETIAGWLMS MLDSVPQVGE ELAFDGYRFK IQAMRRRRIS TVRVERLDDP SPSCVDAVEA IDREEA
|
| |