Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1056 |
Symbol | |
ID | 8415346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1278392 |
End bp | 1280227 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645024019 |
Product | myosin-cross-reactive antigen |
Protein accession | YP_003181416 |
Protein GI | 257790810 |
COG category | [S] Function unknown |
COG ID | [COG4716] Myosin-crossreactive antigen |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.488016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.00102136 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTACTATT CCAGCGGAAA CTACGAAGCA TTCGCCCGCC CGGCAAAGCC GGAGGGCATC GAGGACAAAT CGGCCTACAT CGTCGGCACC GGGCTGGCGG GCTTGTCGGC CGCGTGCTAC CTGATCAGGG ATGCTCAGAT GGACGGCTCG CGCGTGCACC TGTTCGAACG CGACGCCGAA CCGGGCGGCG CCTGCGACGG GTGGGAGTAT CCCCAGCTCG GCTTCACTAT GCGCGGCGGT CGCGAGATGG ACAACCACTT CGAAGTGATG TGGGACCTGT TCCGATCCAT CCCCTCCATC GAGGACGAGA ACATGAGCAT CCTCGATTAC TACTATCAAT TGAACAAGCG CGATCCCAAT TATTCCCTGT GCCGCGCAAC GGTCAACCGG GGTGAAGACG CCGGCCTCGA CAATACATTC AACCTGTCCG ACAAGGCGTG CATGGAGATC ATGAACCTGT TCTTCACCCC CGAAGAGCAA CTCGACGACA AGCCCATAAC CGACTATTTT TCCGACGATG TGCTCGATTC CAACTTCTGG CTGTACTGGC GCACCATGTT CGCCTTCGAG AGCTGGCACA GCGCGCTCGA GATGAAGCGC TACATCCAGC GTTTCGTCCA CCACGTCGGC GGGCTCCCCG ACTTCAGCGC ACTGCGCTTC ACCCGGTACA ACCAGTTCGA ATCCCTCATC CTGCCCATGG TGAACTACCT GAAAGGCCAG GGCGTGCAGA TCCATCTGAA CACCGAGGTC GTCGACATCA CGTTCTCCAG CACCCCGGAG CGCAAAGTAG CCACCGAGGT GCAGACGATA TGCGAAGGGG CCGACAAGAC GTTTCACCTC ACCGACGACG ACCTGCTGTT CATCACCAAC GGCAGCTGCG TCGCCAATTC GTCCTTCGGA TCGCAAGACG AGCCGGCCCA GTTCAACGCC GTGCTGGAAA AAGGAACCGG ATTCGACCTG TGGCGACGTA TCGCACGGCA GGATTCCGCA TTCGGGAATC CGGAAAAATT CATCGGCGAT CCGGAGAAGT CCAACTGGAT GAGCGCCACC GTCACGACGC TCGATGAGAA GATCGTTCCC TATATCGAGA GCATTTGCCG TCGCGATCCG TTCTCGGGCG GCGTCGTAAC CGGCGGCATC GTTACCGTAA AAGATTCGAA CTGGCTTTTA AGTTGGACGT TCAACAGGCA ACCCCAGTTC AGAGCCCAGC CCGGCGATCA GCTGTGCGGC TGGATATACG GCCTGTTCAC CGATGTGCCC GGCAACTACG TGAAGAAAAC CCTGCGCGAA TGCACCGGCA AGGAAATATG CATGGAGTGG CTGTACCACC TGGGCGTTCC CGAATCGCAG ATCGAGGAGC TCGCCGAGAA CAGCGCCAAC ACGGTTCCGT GCATGATGCC CTACATCACG GCATTCTTCA TGCCGCGCGC CGCAGGCGAC CGACCCGACG TCGTGCCCGA AGGCGCGGTG AACTTCGCGT TCATCGGCCA ATTCGCGGAA ACGCCGCGCG ACACCATTTT CACCACCGAG TATTCCATGC GCACCGGCAT GGAAGCCGTC TACACGCTGT GCAACGTCGA CCGCGGCGTG CCCGAAGTAT GGGGCAGCGC GTTCGACATC CGCGATTTGC TGAACGCCAC CACGTTGATC AGGGACGGCA AGCCCATCAC CGACATGGAG ATGAATCCCC TCGAAAAGCT CGCCCTGCAC GAGGGAATCG AGAAGATCAA GGGGACCGAC CTCTACGGTC TGCTGGTCGA GTTCGGCGTG ATTCCGCCCG ATCGCGGCGA CGCGCCCGCG CCCGCCACCG GAGCCGTATA CCCCGGCATG CATTAG
|
Protein sequence | MYYSSGNYEA FARPAKPEGI EDKSAYIVGT GLAGLSAACY LIRDAQMDGS RVHLFERDAE PGGACDGWEY PQLGFTMRGG REMDNHFEVM WDLFRSIPSI EDENMSILDY YYQLNKRDPN YSLCRATVNR GEDAGLDNTF NLSDKACMEI MNLFFTPEEQ LDDKPITDYF SDDVLDSNFW LYWRTMFAFE SWHSALEMKR YIQRFVHHVG GLPDFSALRF TRYNQFESLI LPMVNYLKGQ GVQIHLNTEV VDITFSSTPE RKVATEVQTI CEGADKTFHL TDDDLLFITN GSCVANSSFG SQDEPAQFNA VLEKGTGFDL WRRIARQDSA FGNPEKFIGD PEKSNWMSAT VTTLDEKIVP YIESICRRDP FSGGVVTGGI VTVKDSNWLL SWTFNRQPQF RAQPGDQLCG WIYGLFTDVP GNYVKKTLRE CTGKEICMEW LYHLGVPESQ IEELAENSAN TVPCMMPYIT AFFMPRAAGD RPDVVPEGAV NFAFIGQFAE TPRDTIFTTE YSMRTGMEAV YTLCNVDRGV PEVWGSAFDI RDLLNATTLI RDGKPITDME MNPLEKLALH EGIEKIKGTD LYGLLVEFGV IPPDRGDAPA PATGAVYPGM H
|
| |