Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0126 |
Symbol | |
ID | 8414409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 170456 |
End bp | 172048 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645023105 |
Product | hypothetical protein |
Protein accession | YP_003180509 |
Protein GI | 257789903 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCA TTCTGAAAAA CATGAAGCGA GAGCAGGTGC TGGCCGCGCT CGCCTACGGG TTCTTCCTGG CGTGCACCTC GGTGACGCTG TGGGGAGGGT ATGTGCGGTT TCTGACCGGG CAGGTGGATC CGGGCTCTTT CATGCTCGAG TATTTCGTGC GAAGCGCGAC CCTGCCTCTC TCGCTTGCGG TTTCCGGCTT GGCCGCCTTC TATTGGCCGA AGGCTCGGCT GTGGCGTTCT CCGATACTGG CGCTGGGGTT TTTTCTTGCG GGTTCGCTGC TGGTTGCCTT GCAGCATGGC GGCGTGGTTT CGTCCGAGTG GTCGCTTGTC GTGGTGGGCG CGTGCTTCGG ATGGGGAAGC GGGATCATGT TCGGCGCCTT GCAGGAAATC GTGGCTGCGC AAAAGGTGTT CACGGCCGGT ATCGTCGTGT TTGCCGCCGC TGGGATTTCG GCGTTGCTGT TTTTCGCTGT TGAGGTTTTG CCCGCCGATG CAGTACCGTG GATGTCGCTG TTCGTGTTCG TCGGCGCGGC GGTGGCGCTA ACCTGTGCTG CATGGAAATG CGCTCCGAAG GTGCATCCCA TGTTCGACAC GGTTCCCGAT CAGCGGCGCG ATCGGTGCCG CGAAGCGGTT TCCGAGTTGT GGCGCCCTTT GCTCTGCGTC GCTTTCTCCG CGTTCATTGT GGGTATCGTG CGCGTCGGTT CGGCCTCGGG TGGCGGGTCG ATAGGGCAAA CGAATGAAAG CAACATGATC GGGCTGCTTG CCGCTTCGGT CGCGCTGCTG GCAACTTGGA GGTTCTTGTA CGAGCGCGTT ACGCTCATGC GTTTGTATCA GATTCTGTTT CCGCTTACGG CGACGGCGTT TCTGCTGTTG CCGCTTCTTG AGGGAACGTT TCGCCAGGTG GTGTACTCCC TCGTGTTCCT GGTGTTTTCG GTCACGTCTT CGCTCATGGT AGTGTCGTGC GCTCGGACGG CGCGCAACCA ATCGCTCACT CCCGTGCTCG TGTACGGCAC CTTCGCCAGC ATCGTGTATG CAAGCTCGCT GGCGGGTTCC GTCGTGGGCT TGTTCGTGGG CGCGGGTCGA GGCGTTGGGC TGGCGGAGCT GTCCGTGGTG GCATTGGTGG CGGTGTACGC GCTGTCGGTA GCGATGGTGG CGCCGCATGG GAGGAAGGCG GGTAGCGTGA AAGCCGCTTC GGGAGAGAAG CTGGCTCCTG TAGTCGGAGC TGTAGGCGAT TCCGTGACTG CAGGCTGCGC GGTTGCTGTC GAGCGTTACG GTCTGTCGCG TCGCGAGGCC GAGTCTCGAC CTCCTTGCCC GTGGCCGCGA TGTGCCGTAT GTGGCCGAAG AGCTGGTCAT ATCGAAGAAC ACCGTGCGGA CCCACACGAA GAGCATTTTC GCAAAAACAG GGGTTCATTC GCGCCAGGAG CTTATCGACT TGGTGGAGTC CATCGAAGCG TGAGCACGTT CCTGCCGAAC GCGAACTTTG CATCGCGAAA GGTCCTGCGC GCCCCATTAG CCATATGTCC CGCAGCGCAG AGGAATGCTT CACGTGAAAC ATTCTTGCAT TGGATTTCCC CTGTCGAAAA TACTAGTTTT TAA
|
Protein sequence | MRFILKNMKR EQVLAALAYG FFLACTSVTL WGGYVRFLTG QVDPGSFMLE YFVRSATLPL SLAVSGLAAF YWPKARLWRS PILALGFFLA GSLLVALQHG GVVSSEWSLV VVGACFGWGS GIMFGALQEI VAAQKVFTAG IVVFAAAGIS ALLFFAVEVL PADAVPWMSL FVFVGAAVAL TCAAWKCAPK VHPMFDTVPD QRRDRCREAV SELWRPLLCV AFSAFIVGIV RVGSASGGGS IGQTNESNMI GLLAASVALL ATWRFLYERV TLMRLYQILF PLTATAFLLL PLLEGTFRQV VYSLVFLVFS VTSSLMVVSC ARTARNQSLT PVLVYGTFAS IVYASSLAGS VVGLFVGAGR GVGLAELSVV ALVAVYALSV AMVAPHGRKA GSVKAASGEK LAPVVGAVGD SVTAGCAVAV ERYGLSRREA ESRPPCPWPR CAVCGRRAGH IEEHRADPHE EHFRKNRGSF APGAYRLGGV HRSVSTFLPN ANFASRKVLR APLAICPAAQ RNASRETFLH WISPVENTSF
|
| |