Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0943 |
Symbol | |
ID | 8415233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1147528 |
End bp | 1149225 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 645023907 |
Product | hypothetical protein |
Protein accession | YP_003181304 |
Protein GI | 257790698 |
COG category | [S] Function unknown |
COG ID | [COG4938] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000190881 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00360431 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGCTG GTTTTAAGAT TAAGGAAATT GAGCTCCGTT CAGAAGAAGG AACTCGGTTC AGCCCAAAAG CACTAACAAT CATTGTTGGG CCAAATAATG CTGGGAAAAG TAGGTTTCTG AAAGAAGTTC GGTCTGCCCT TTTGGGTAGA TTGAGCGACG AAGACGGCGG CCTAATCCTG GGTCGGAAAA TAATAAGCAC TATTGAATTG CTGCTTCCCG AATCGACGGA CGCTCTATTT GAGTGGTTTG ATCTGGATCG AAAAGTGGTT CGCGATGAAA ATGGAAACTA TGGTGTAAGA GAGTACTGCA ATACTGGAAT TAATATTAAT CAATATGGTC AAATAGTTCG AGAAGAGTGT GCGACAAAAT ATCAGGGACA GTGGAAGGAT GTAATCGAAT CTCATTATGC CTCTCCCAAT GATGCGCGTG CTTTAGATGC GCTTCTCAAT TTCATAGGGC CACTGCTTGT CGGCTATTCT GGCACAGAAG ATAGATTGAT TCTTTCCGCT GGAGAACCGT ACTACGGCGT GGCAGATTCA AATACGAATT TTCTTTCACG AGTTCGATCC CAAGATCAAA TCCTTGATGA CTTGTCGGAA ATTTCTAAAA GATTATTTGG GAAAGATGTC GTACTCGATG ACGTTACAAA AGGTGGAATG ATCCAATTCA AGACTGGTTC GGATTTTTCG AGTTACAGAA CAAGTGCTCG TGGAACCTCA GATTTCGAAT TCCTGTTAGA GCAGGGTGTT TCCTTGAAAG ACGAAGGAGA TGGGTTTCGA AGCTTCGTTT CTGTATATCT TGCTCTTAGG TCGGGAGACA AACCTGTAGT TCTAATTGAC GAGCCAGAGT CTTTTCTTCA TCCGCCTCAA GCTTACGAAC TGGGGAAGGT GATTGGCTCT TCAGCTGAAC AATGCAGCCA AATGATCATA GCGACGCATA GTACGCATCT TCTTAATGGC ATCATGTCAA CATGCGACTG GGATAACTGC GACATTTTAA GACTACAGCG CGACGGAGAC TCTCTTCGAG CGAACTTGCT TGACCGCGAG GGCTTGGATA GGGTGAAAGG GGATCCTCTG CTGAGAAGCA CGCGTCTTTT GGAGGGCGTT TTTACGCGTG TCGTAGTTGT AGTGGAGTCG GAGTCGGATG AGCTTGTGTA TCGGGAAATC CTGAATAAAG TGGGCGTAGC CGACGAGGCG TTCTTTGTTA ACGTGCACAG CAAGGACAGA ATTGCATTCG CGGTGGAATT TTATAAGAAC GTTGGCGTTC CCTGCTGCGC AGTGATGGAT TTTGATATTT TGAATGATAA GAATAAGTTC AAAAGAGTCT TGAAGTGTTT TGAATGTGAC CCTAGCGGAA GGCTTTCCCA GATAGCTCAA GAGACAAGAG ACGCTATAGA ATGCGATGCG GGAAAGCCCG AAGAGACAAA GCTACGGTAC AAACGCGATC CTTTGATGTA TCTAGATAAG ATTGAGAATG AAGTTGAAGA GTTGCTAGAT CGATGCTTGG AGTGCGGTTG TCTTATTGTG AGGACGGGCG AACTCGAAAC CGTTTTTGGA GAAAAGGTAG CCTATCGATC TTCGAAACGG GCTTGGCTCT CCGAAGCCCT GGATTATCTG AACCATTTGG AGCCTGGCGA ATTAACTTCT CTTGCGATCG TTTCCGATCT CATAAAGATG TTGCAGGTTG CGAAATAG
|
Protein sequence | MVAGFKIKEI ELRSEEGTRF SPKALTIIVG PNNAGKSRFL KEVRSALLGR LSDEDGGLIL GRKIISTIEL LLPESTDALF EWFDLDRKVV RDENGNYGVR EYCNTGININ QYGQIVREEC ATKYQGQWKD VIESHYASPN DARALDALLN FIGPLLVGYS GTEDRLILSA GEPYYGVADS NTNFLSRVRS QDQILDDLSE ISKRLFGKDV VLDDVTKGGM IQFKTGSDFS SYRTSARGTS DFEFLLEQGV SLKDEGDGFR SFVSVYLALR SGDKPVVLID EPESFLHPPQ AYELGKVIGS SAEQCSQMII ATHSTHLLNG IMSTCDWDNC DILRLQRDGD SLRANLLDRE GLDRVKGDPL LRSTRLLEGV FTRVVVVVES ESDELVYREI LNKVGVADEA FFVNVHSKDR IAFAVEFYKN VGVPCCAVMD FDILNDKNKF KRVLKCFECD PSGRLSQIAQ ETRDAIECDA GKPEETKLRY KRDPLMYLDK IENEVEELLD RCLECGCLIV RTGELETVFG EKVAYRSSKR AWLSEALDYL NHLEPGELTS LAIVSDLIKM LQVAK
|
| |