Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0056 |
Symbol | |
ID | 8414336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 69940 |
End bp | 71709 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 645023032 |
Product | hypothetical protein |
Protein accession | YP_003180439 |
Protein GI | 257789833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCATGAGA GCATTCTTTC CGAGATCGAA CACACGCGTC GCGAGGAGCG TCGGAAGGGT CGCGCACCGC ATGCGCGCCG AGCGGGCGTT CGCGGAGATA CGGCAGAGGA AGCGCGCGAC GGTGCAACGG CGGCGGTCGT TCGGCGGTGC CGGTGGAGGC CGGTTATTGC GGGGGCGTGC GCGCTTGCAG TCGTGGCGGC GCTCGTGCCG TTCAGCGGCC AGCTTGCCGG GTTGCCGGGC GATATGGGTG GCCATGGCAG CGCCGTTGCG GGTCAGGCGT TCTCCGTGCG GGCGTACGCG TCGGACACGC GCATGCTGGT CCCTCCTACC GATAACGGGA TGATCGTGTT CGACCGCTCC GGCAGCCTGA CCACGCCAAC GAAGGATTGG TACTTGCAAT ACGGGAAGTA CACCGGCTGC ATGTTCACCG TGGAGGGCGA GGGCATCGTG CGCATCCAGG CCACCACCTC GGCGGGGATG TTCTACCGCA ACAGCTACGA GACCATCAAC GGGCGAGACG ATCCTGAGCG CGTGGCCGAG GTCACTGCAT GGAAACCGGA GAAGGCGGGC TTGGGCGATC ATTACGGAAA GTACGAGAGC GTGCAAGTGG TCGACGGATT TGGCCTGCCC GATCCCGATC GTGACATGAC GGTGCGCCTC ACGAAAATGC TCGGTTCCAC TATCGACCTG CCTATCAGCG CGGACGATGA CGGCGACGCG AAGAGCTTCG GCCTGTGGAC CAACGAGGAT TACGGCGACG CGGTCGAAAC CGCGGAAGAC CCACTTGCGG CGACCGATGC GGTGTTCGAC ACGTTCGAGG GCCAAACCAT AACGGTGACG GCCTACTTCG AGGACGGCAG CTGCGCCACG CAAACCATCG AGCTGCACAC GGCAGACTTC AAGGCGACGA TGAACGATCC GATGGCGTTC TACGGTTACG GTTCCATCGA GGTGTATCCC GAGATCGTCG ACCGCTCGAT GCTGCCGAAC GTGTTGGGGG ACGGGGTGGA TGAGGGCGCT CCCTTCTGTC TGCACTCGTT GTACGGCGTC ATCGTCGACG AGAACGACGG GCCGCATCCG TACTCGCTCG ACAACGCGAA CGAGTGGCTC GATGCGGCGG TGCCCTACAC GTTCGAGCGG CGCGAGACGT TCACCAGCAT GGGAGATGCG GTGTTGGCTG ACCAGTCGAT AAGCGATCCC GACGGACGGG TGACGGTTTC GGTGCCGGCG GATCCGCTGT CAGAAGAGCG CTACGGCGAG ATGTGCGACA TCGAGCTGTC GAACCTTCGC ATCGAGCGGA GCGACCGGCT GCCGTTCGGG CTGGCGGTCG AAGACACCTC CGACTACGCG GGCTTTCTGG GAGACTTCGC CTATATGAAC AAGGTGTCGG ACGAGACGCT GGGCTACCAA ATCGCAGACG ACGGCACGTT GACGCCGGGG TTCTCGTACC GCACGGTCAC GTTCGAGGCG ACGAACCCGT CCGACGAGGA GGTTCCCGTC GATGCGAGAA CATTGGGCAC GTTCGCCGTT CGCGACGCCG ACGGCCGCTG CTCTGCGTTG GCGACGCGCG CCCTATGGAT GACCGGCTTC GAAGCCGGGG TCGATTCGAA TTGGGGAGCG GCACTGGCAC CTCACGAAAC GCGGGATCTT TCGGTTGTGT ACGTCGTGCC CGACGAGTTC GACGAACAGG CCGATCCGCT GTTCGTCTTC TCCTCGTATG CGAACGACGA TGCCGATAGG GTTGCTTTCC GCATCAAGTC GCTGCTGTAG
|
Protein sequence | MHESILSEIE HTRREERRKG RAPHARRAGV RGDTAEEARD GATAAVVRRC RWRPVIAGAC ALAVVAALVP FSGQLAGLPG DMGGHGSAVA GQAFSVRAYA SDTRMLVPPT DNGMIVFDRS GSLTTPTKDW YLQYGKYTGC MFTVEGEGIV RIQATTSAGM FYRNSYETIN GRDDPERVAE VTAWKPEKAG LGDHYGKYES VQVVDGFGLP DPDRDMTVRL TKMLGSTIDL PISADDDGDA KSFGLWTNED YGDAVETAED PLAATDAVFD TFEGQTITVT AYFEDGSCAT QTIELHTADF KATMNDPMAF YGYGSIEVYP EIVDRSMLPN VLGDGVDEGA PFCLHSLYGV IVDENDGPHP YSLDNANEWL DAAVPYTFER RETFTSMGDA VLADQSISDP DGRVTVSVPA DPLSEERYGE MCDIELSNLR IERSDRLPFG LAVEDTSDYA GFLGDFAYMN KVSDETLGYQ IADDGTLTPG FSYRTVTFEA TNPSDEEVPV DARTLGTFAV RDADGRCSAL ATRALWMTGF EAGVDSNWGA ALAPHETRDL SVVYVVPDEF DEQADPLFVF SSYANDDADR VAFRIKSLL
|
| |