Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1026 |
Symbol | |
ID | 8415316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1243020 |
End bp | 1245071 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645023990 |
Product | hypothetical protein |
Protein accession | YP_003181387 |
Protein GI | 257790781 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0776826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.319246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGAAC AAACGATCAT CCCTGCGGGG GTCATGCGGC GGGCGTGGGC CCTCCTGCTG GCCGCCGCGC TCTGCCTGGG CCTCATGCCC AGCGCAGCCT GGGCCGAAGA AAGCGAGGGT GCGGGGACGT TCTCGGTGGC GCTCACCATC GTGGACACGT CCGACCCCGC GGACGGCGTC CTGTACAACG GCAAGGTCGA CGGCATGACC TCCGACGACA CGGTTGCCGA CCTGCTGGCG AAGGCGGGCT TCACCGCCGC GGCCAGCGCG GAGGAGACCG AGGGGAACGA CAAGGCGTAC TTCGACTCCT GGGGCTCCCC GACGTTCCGC GGCAACAAGT CGGTCCAGCA GCCCGACGGC TCGTGGGCCT ATTGGGCGAC GATGTTCGAC GGCGACAGCG CGAACTACGC CAGCGCCCAG CTGACGTCGA AGCTGCAGGA GAACGGCCGC TACCAGTACA TCTACACGTC CGACGCGACG TTCGCCTACG ACGAGGAGGC TTCCGGATTT CCGGTGCAGC TCACCATCGT GGACACGTCC GACCCCGCGG ACGGCGTCCT GTACAACGGC AAGGTCGACG GCATGACCTC CGACGACACG GTTGCCGACC TGCTGGCGAA GGCGGGCTTC ACCGCCGCGG CCAGCGCGGA GGAGACCGAG GGGAACGACA AGGCGTACTT CGACTCCTGG GGCTCCCCGA CGTTCCGCGG CAACAAGTCG GTCCAGCAGC CCGACGGCTC GTGGGCCTAT TGGGTGACGA TGTTCGACGG CGACAGCGCG AACTACGCCA GCGCCCAGCT GACGTCGAAG CTGCAGGAGA ACGGCCGCTA CCAGTACATC TACACGTCCG ACGCGACGTT CGCCTACGAC GAGGAGATTC CTCAGCTCGC GATCTACACC GTCAACGACC CCTTGGCGGG GGCGATCAAG CCCGATCCCA AGCCCGATCC CAAGCCCGAA CCCGAACCTG ATGAGCCTGC CAACGATGCC ATCGCAGTAG ACAGCGCTGC GTACAACACG CTGTTCGGCA CCATCGCCAG TTCCTATGCG GGTACGTCCG AGGAATGGAA AGCCCTTGAG CTCGCGGCCG CCGGTCGTGT GTCGTCGGTT GACGTGGCGA CGCTCGTGGC GAATGCGAAA GCGGCGAACG GTTCTCCCGA AACCACCAAT TTGCAACGCT TCATCTTGGC GCTCACGGCC GTCGGCAAAA CAGAGGAAGC AGCGGAGCTC GTGCAAACGA TGGCGACGTC CGACATTTCG ACTACCTATG TGAACGGTCA GGCGTTTGCT CTGCTCTCCT ACGAAAGCGG TGCGTACGAC GCGCCCGCGA ACGCTCTTGA AACCGAAGCT GAGCTTGTGG CCAAGCTTCT GAGCGCGCAG CAGGCTTCCG GCGGCTGGAC CTGGAAAGGC GCTGCCGAAG GGGACGATCC TGATACGACG GCCATGGTGA TAACCGCTCT TGCCTCTCGC GTTTCGGACG CCTCCGTCAA AGCGGCGGTC GACAAGGGTC TCGAGGCGCT GCGCGCAATG CAGCACGAGG ACGGCGGTTT CCGCGCATCC GGGGACGCGG CCGATGGTCC CATCAACGTC AGCTCTACGT CCTGCGTCGT GGTCGCCCTG TGCGCCTTGG GCGCGGATCC GGCGGCATCC ATGGTCACCG AAAGCGGCGC GACGCCGTTG AGCGCGCTGC TCTCGCAGGC CACTTCCGAC TTGTCCGGAT TCGTTTACAA CGGCGCTGCG AACGACCTCG CCACCGAGCA GGGGTTCCGG GCGCTTGTTG CGTACCAGGG CCTCAAGAAC ACCGGGGCGG CGTACAACGT CTACACGCAG GCGAAGCTCG GCCAGGCTGC GCTGCCCGCC GAGAAGCAGG AAGAAAGCGA CGTCAAGCCC GCAGGGGCTC CGGCGGCCGA CAAGAAGGCG CTCGCCAAGA CCGGGGACGG CTCTGCGCCG TTCGCGGCCG GCACTGCCGC GCTCGCGCTC GGCGCGCTCG CGGCGGGCAT CGCCGCCACG CGGCGCATGC GCGCTTCCGA TGAGCTCTCG TTGCGCCGAT AG
|
Protein sequence | MKEQTIIPAG VMRRAWALLL AAALCLGLMP SAAWAEESEG AGTFSVALTI VDTSDPADGV LYNGKVDGMT SDDTVADLLA KAGFTAAASA EETEGNDKAY FDSWGSPTFR GNKSVQQPDG SWAYWATMFD GDSANYASAQ LTSKLQENGR YQYIYTSDAT FAYDEEASGF PVQLTIVDTS DPADGVLYNG KVDGMTSDDT VADLLAKAGF TAAASAEETE GNDKAYFDSW GSPTFRGNKS VQQPDGSWAY WVTMFDGDSA NYASAQLTSK LQENGRYQYI YTSDATFAYD EEIPQLAIYT VNDPLAGAIK PDPKPDPKPE PEPDEPANDA IAVDSAAYNT LFGTIASSYA GTSEEWKALE LAAAGRVSSV DVATLVANAK AANGSPETTN LQRFILALTA VGKTEEAAEL VQTMATSDIS TTYVNGQAFA LLSYESGAYD APANALETEA ELVAKLLSAQ QASGGWTWKG AAEGDDPDTT AMVITALASR VSDASVKAAV DKGLEALRAM QHEDGGFRAS GDAADGPINV SSTSCVVVAL CALGADPAAS MVTESGATPL SALLSQATSD LSGFVYNGAA NDLATEQGFR ALVAYQGLKN TGAAYNVYTQ AKLGQAALPA EKQEESDVKP AGAPAADKKA LAKTGDGSAP FAAGTAALAL GALAAGIAAT RRMRASDELS LRR
|
| |