Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0093 |
Symbol | |
ID | 8414376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 122513 |
End bp | 124516 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645023072 |
Product | membrane protein-like protein |
Protein accession | YP_003180476 |
Protein GI | 257789870 |
COG category | [S] Function unknown |
COG ID | [COG4907] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAACA TGAGCGGTTC GCAACTCCGC GAATCCTTCG CTTTTTCCCG ACGAAACGCC GTGCTGTTTT GCGCGGCGCT CCTGGCGCTC GCATGCGCGC TTGTGGCGCT GGCTCCCGGG CAGGCTCACG CGAAATCGTA CACCATGCCG AAGGTGGATA TCCAGGCCCA GGTGGAAACC GACGGCGCGT TGCAAGTGAC CGAGCAGCGC ACGTTCGATT TCGACGGCGA TTTCTCCGCG GTGTGGTGGG CCTTCGACGG GCTTCCGCAG AACGCTTCCC TCAAGATCAA CGGCGTGCGT ATGGCGAACG TCGATGCCGA TGGCACGGTG GTGGGCGATT GGACGACGTT GCCGAGCGAG GCGTTCGTGC TCGGCTGGCG CGAATCGGGC GGGCCTGGGA AAGATTCCTA CTCGTTCGAC GCGCCGAAGA ACAGCGTGTA CGTGTTCTTC AACGCCTCCG ACGACCGTCG CATCATCGAG CTCGATTATA CGGTGGTCAA CGGGGCGCAG GCGTATTCCG ACATAGGCGA GGTGTATTGG AAATACGTGG GCTCCCAATG GAAAGAGGCT TCCGACAACG TCACCATGAC GTTGGCGCTG CCCGTACCGC AGGGTACCGA GGTCGTGCCG GGAGAGAACG TGCGCGCATG GGGTCATGGA CCGCTCGACG GCAAGGTGAC CGTGAACGCC GACGGAACCG TGACGTATGC CGTGCCGCAT GTTGCGGCAG GTCAATTCGC CGAAGCGCGC GTGGCGTTTC CGGTGAAGTG GTTAACGAAC CTGTCGCCCG AAAGCGCGGC GCTGCATCAA GGGGAAAACC GTTTGGACAC GGTGCTGAAA GAAGAAAAGG ACTGGAGCGA CCAGGCGAAC CGCACTCGCG TTCTTTCGTT GGCGTTCGTC ATCGGCTGCG GTGTTGTCTG CGTGCTGCTG CTCGCCTGGG CGCTGCGCGC CTACTTCAAG TACGGACGCG AATACCAGCC CCGGTTTACC GACGAGTATT GGCGCGACGT GCCTGACCCC TCCATCCATC CGGCGGCTAT CGGACGCCTG TGGCGCTGGG ATCGCGAGAG CCAAGACGAT TTCACGGCTA CGCTCATGCA TCTTGCGCAC GTCGGCGCCA TCCGCATCGA CGCGGGCAGT TACGAGGAAC CCGGCGCGTT CGGGCGCATG AAGACCGTGG ACGACTATTA CATCACAAGG TTGCCTGCTG CCGACAACGT GACCGATCCC ATCGATCGTC AGGCGTTGGA TTTGCTGTTC GGCACGTTGG CCGGGGGAGC CGACTCGCTG TGGTTCGGCA CTATCAAGAA GTACGGCGAA GATCATCCGC AGGAGTTTGT CGACGCGATG CAGGGCTGGC AGGGCGCGCT TTCGGCGGCG ACGAACCGCG AGGATTTCTT CGAGGCGAAA GGCAAGCGAT ACCAGGGCTA CCTCATCGCG CTTGCCGTGG TGGTCGCTTT GTCCGGTGTC GCCATTTGGA TACTCATGTC GAACTTCATC CCTCTGATAT TCATGATTCC CACCGCCATC GCGTTGGGCG TGATCGGGAA CTACATGCCG CGTCGCAGTG TGAAGGGCAA CGAGCTGACG GCCAAGAGCA AGGCGCTGCG CAACTGGCTG ACCGACTTCT CGTCTCTTGA CGAGCGTCCG CCCACCGACG TGAAGGTATG GGGCGAGTTC ATGGTGTACG CCTACCTGTT CGGCGTGGCC GACCAGGCCA TCAAGCAGCT GCAAACCACC ATGCCGCAGC TGTTCGAGTA CGACGGATCT ATGGGCATGA CGTACATGCC TTGGTGGTTC TGGTACACGG GAGGCCATAC GGCTGCCGGC TCAGCGATGC CTTCGGTTAG CGACATGCTG CAAACGTCGA TGACGAACAC GATGTCAACG GCTCAGGCGG CGCTTTCGGG CGCTAGCGGG AACTTCTCCA GCGGCGGTGG ATTCGGCGGC GGCTTCTCAG GCGGCGGCGG AGGCGGCTTC GGCGGCGGAG GCGGCGCCCG GTAA
|
Protein sequence | MQNMSGSQLR ESFAFSRRNA VLFCAALLAL ACALVALAPG QAHAKSYTMP KVDIQAQVET DGALQVTEQR TFDFDGDFSA VWWAFDGLPQ NASLKINGVR MANVDADGTV VGDWTTLPSE AFVLGWRESG GPGKDSYSFD APKNSVYVFF NASDDRRIIE LDYTVVNGAQ AYSDIGEVYW KYVGSQWKEA SDNVTMTLAL PVPQGTEVVP GENVRAWGHG PLDGKVTVNA DGTVTYAVPH VAAGQFAEAR VAFPVKWLTN LSPESAALHQ GENRLDTVLK EEKDWSDQAN RTRVLSLAFV IGCGVVCVLL LAWALRAYFK YGREYQPRFT DEYWRDVPDP SIHPAAIGRL WRWDRESQDD FTATLMHLAH VGAIRIDAGS YEEPGAFGRM KTVDDYYITR LPAADNVTDP IDRQALDLLF GTLAGGADSL WFGTIKKYGE DHPQEFVDAM QGWQGALSAA TNREDFFEAK GKRYQGYLIA LAVVVALSGV AIWILMSNFI PLIFMIPTAI ALGVIGNYMP RRSVKGNELT AKSKALRNWL TDFSSLDERP PTDVKVWGEF MVYAYLFGVA DQAIKQLQTT MPQLFEYDGS MGMTYMPWWF WYTGGHTAAG SAMPSVSDML QTSMTNTMST AQAALSGASG NFSSGGGFGG GFSGGGGGGF GGGGGAR
|
| |