Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0745 |
Symbol | |
ID | 8415035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 936623 |
End bp | 939538 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 645023716 |
Product | pentapeptide repeat protein |
Protein accession | YP_003181113 |
Protein GI | 257790507 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.888113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGTG AAGCGGTTAC CGTTTCTTCG GTTGTTTCTG CATTCATCTC TGAGTTCGTC GCCAAGACAA CGCAATGGGC TGAACAAGGA CGCAAAAAGG CCAAAACAAA GGAATCGCTT TCAGGTGCGC TCGACAACCT GACCTCAGAT CTGAGGTCAG CGCTTGAGCA AGCAGCCGAT GTAGGCCTCG ATGAAAACGG CTTGTCGTTT TGCGACTGGC TCGGCGAAGA GACCGAGTAT ATCTACGACT CGGCACTGGA GACGAAAGAG GAAAGGGATT CGCGAATCGA AGCGCTCCTC CTTGAAGGTC AAAGAAAATT CGGTTTCAAA GGCGGCGAGG ACAAGGCGGA ATCCATAGCC GAAGCGATAA GCATCGCATT CGATTACTGT CTCAAACTGA GATTTGGGCA TCTAAACGAC GACGACAGAA CCGTCGTCAA CATGACGGTA ACATACGGAG AAGAGACGGT CGACCGGCAA ACGACTGAGA TAATCTACGC CCTAGATCCA ACACTCGCGA CCGATCGTGA GTGGCTGAAG TACCAGGATT TCCTTGCTTC CCAATCCATG ACGGCGATAG GCACCAAATT CACCGTCGAC GAGCTCTACA TACCGCTCAA CGCCATCGAG ACTGCGGAGG CCACGTTTTA TCCTCCACAC CAAGATTCCC ATGGCCGCCT CTTCGATCTC ATCGAGAGTA ATAATCGTTT AGAAATCCAG TTGTTTCCCG AAACCGTTGC AGAGGCATCC GTTGCCCATT GGCCTATCGA GCATAGCCAT ACCTCAACTT TCACCGATAC AGTAAGCTTA GTCGTTTCAG TAGATGAGCG TATCGCGCAG TGGCTGGAAA CGCCCGATTT TAGGGACGAC CACAACCCTG TCCTTGTTCT TTCCGGAGAT CCCGGAAGCG GAAAATCCAC CGTTGCCAGG CGCCTTTCTA AAACACTGGC CAAAACCGAA GAGGTCAACG TTGCTTACAT CAGCCTCAAA GACGTGGCGA CGGATTCGCA AATCGACGAT ATCGAGGCCC TCGTCGTGAA ATACATCCTA TCGTTGCCCG GTCGCGACTT CGTTGTTTCG AAGCTTCCAA CGAGCAAGCC TCTGGTTCTC ATATTCGATG GACTCGATGA ATATGCAGCG CGAGGACCTA AGAGCAAAAA GGCAGCATGG GGTCTCCTGG GATCCATACT TCGATACGGA CAGCGTTGCA GCGAAGCCTC CTTCCCAACC AGAGTCCTCG TCACTTCAAG GACCACACTG CTTCGAGATA TGAAAAAAGA AGTATCTCCA AAAGAGTGCA AGCCGATTCG ACTCGAGCTG CTTCCCTACG TTTGCTCCGA AAGTGAGTTA GGCGACATTT GCGACCCAGA CTTCCTCCTT TGTGAAGATA AAAGGATTAC TTGGTGGACG AACTACGGAG AGAGACTTGG TAGGGATATG AGCTCGACGA TGGAAAAGAT TTTCAAGTGG GACGACGAAG TGAAGTTATC GGCCCAGCCT ATCCTCAACC ATCTGATAGC CGTGTTCGCA AAGAATATTC TCGAGACGGC TGCACCCAAC CGCGCCGAGG TGTACGGAAC TTTGCTTAAG GGCGTGATAA ACCGGAATTA CGATCACGCG CAAGGAAAGG CGAAAATACA TAGCAGAACA CGCGCAGCGG ACGTGGCGGG GTATCTCGAG TTCATGGAGG CCGCTGCTAT TATGGCATGG CACAACGGCG GATCTATCAC GGATTTAAAG CCCTTGAAAA ACGAATGCGG GAGCGAACGC GCTAAAGAAG CCTTTGATTG CTTCAAAAGC GAGGCGCGCA TCAAGGATAA TATAATCGCT GGTTACTTTC ACCTAAGAAA CAAAAACGGC GAGGAGAGTT ACGAGTTCGT TCATTACAGT TTCATGGAGT ACCTCGTATC GCGCCGCATC GTCGGCGAGT TGACCAAAAT GCTCGACAGG AAAGTTTGCC CCATCACATC GATGCCAAAG CTTTACGACA TGCTGGGCTG TTCGGAACTC ACGGACAACA CGCTATCGTT CATAAGGGCC GAACTCTCTC TGTTCACCAA AAAGAAAGCC GCCTCGCTTC AAAAATACAT GATGTCGTTA TTCATGGAAT CCCTATTGGA ATACAGCTTC GATGAGACGT TTTTTGATAA GGAAAGAAAC GGAACACTGT TTTCGACAAA GTTATCATCC ATTCGCAACG TCCAGGGGAA CATTCTGGCG CTACACAGCT GCGCTGCCAA AGTGACTGGC GAGAGGATGG GCGTATCTTT CAACCTATTG TTACAATGGT TGGGAGGGAT CGGTGTGTTT GCATGGGAAA GCAACGTTTC TTCTTTCTTC AATGGTCTTG AGCCGATCAT AGAGAGCGGG GAAGATGAAT ACATTCTGAC GCTGCCTTTC GCGCAACTCA GTTCGTCGGA TATGTCCAAT TCGTTCATGG AACACGCAAT ATTCACCGGA GCAATGCTTG ATAACACCAT CGTAGTCAAC GCGGATATGA AGCATGCGAA TTTCAATGAC GCCCGACTCG TTGACGCGAA ATGCTCATAT GCCCATTTCG AACACGCGTC CCTGGAAAAA GCCACTCTTC ATGGAGCGCA CTTCGATCAC GCCCACCTCG AGAACGCCCA CTTGCTAGGA GCAGAGTTGG AGGGGGCGAA ATTCCAGCAC GCCCACCTCG AAGATGCGGA TTTGCGGTTC GCAAAGCTCA TAGAGGCAGA ATTTGGATGG GCGAAAATGA GGGGCTCAAA TATGGGAGGG GCCATCCTCC GCGAGGCTGA TCTTCGGTAT GCCGAACTCA AGGGATGCAA TCTCGAAAAC GCGATCCTCG ACAAGGCGAA GGTGCTGAAG AAGGACATCA AAACACTCCA CGAATGCGGT GCGGACGTTT CAAAAGTTAT AGCGTACGAC GAATGA
|
Protein sequence | MFGEAVTVSS VVSAFISEFV AKTTQWAEQG RKKAKTKESL SGALDNLTSD LRSALEQAAD VGLDENGLSF CDWLGEETEY IYDSALETKE ERDSRIEALL LEGQRKFGFK GGEDKAESIA EAISIAFDYC LKLRFGHLND DDRTVVNMTV TYGEETVDRQ TTEIIYALDP TLATDREWLK YQDFLASQSM TAIGTKFTVD ELYIPLNAIE TAEATFYPPH QDSHGRLFDL IESNNRLEIQ LFPETVAEAS VAHWPIEHSH TSTFTDTVSL VVSVDERIAQ WLETPDFRDD HNPVLVLSGD PGSGKSTVAR RLSKTLAKTE EVNVAYISLK DVATDSQIDD IEALVVKYIL SLPGRDFVVS KLPTSKPLVL IFDGLDEYAA RGPKSKKAAW GLLGSILRYG QRCSEASFPT RVLVTSRTTL LRDMKKEVSP KECKPIRLEL LPYVCSESEL GDICDPDFLL CEDKRITWWT NYGERLGRDM SSTMEKIFKW DDEVKLSAQP ILNHLIAVFA KNILETAAPN RAEVYGTLLK GVINRNYDHA QGKAKIHSRT RAADVAGYLE FMEAAAIMAW HNGGSITDLK PLKNECGSER AKEAFDCFKS EARIKDNIIA GYFHLRNKNG EESYEFVHYS FMEYLVSRRI VGELTKMLDR KVCPITSMPK LYDMLGCSEL TDNTLSFIRA ELSLFTKKKA ASLQKYMMSL FMESLLEYSF DETFFDKERN GTLFSTKLSS IRNVQGNILA LHSCAAKVTG ERMGVSFNLL LQWLGGIGVF AWESNVSSFF NGLEPIIESG EDEYILTLPF AQLSSSDMSN SFMEHAIFTG AMLDNTIVVN ADMKHANFND ARLVDAKCSY AHFEHASLEK ATLHGAHFDH AHLENAHLLG AELEGAKFQH AHLEDADLRF AKLIEAEFGW AKMRGSNMGG AILREADLRY AELKGCNLEN AILDKAKVLK KDIKTLHECG ADVSKVIAYD E
|
| |