Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_0016 |
Symbol | |
ID | 8414295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 21777 |
End bp | 23657 |
Gene Length | 1881 bp |
Protein Length | 626 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 645022991 |
Product | Ricin B lectin |
Protein accession | YP_003180399 |
Protein GI | 257789793 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.111216 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCAG GATTCAGACC CCGCTGGATG GACTGCGACG AGCCGGGATG GAACCCCGAC AGCCGTAGGG ATTCCACTGC CGAACCCGAA CAAGCCAGCG ACCGCATGGC TCTGGCACTG CCGTCCGAAT GCATCGTCTA TCCCATCCGC TACACGATGC ACGTCAACGC GGAGGGATCC GAAAGAGACA GGAAGAGCGC CAGCTTGCTG GACGGCAAAA CCGAGTCCGA CACCCCGCTG GTGCTGACTA CGATGACGGA AGAAAGCTCC ACGTTTATCC TGGTTTCTTT CGACGACGAC AGCTATCTGA TCATCGACGA GCGGTCGTCG GGAGTTTTGC AGATCGAGCA CGCATCGGAC GACGCATACG CCCGCATCGT GCTAGGCGCA TACGAAGAAG GGGCTACCCA CCAGCTTTGG CGTTTCGAGC CAACCGGCCA CGGATCGTAT TATCTCGTGT CTAAGAGAAA CGGGCTGGTT GTCGATCTGT GCAACTTCCA GCTGGCGGAT GGAAGCCCCC TTATCGCCTA CCCGAGGAAC AACGGATGGA ACCAGCAGTT CCATTTCGAC TGCGCCGGCA CCCTGGATAC GGATCTCTCC GCCTTGCCGG AAGAGGCTGA CGACTCGGGC GAGAGCGAAC TTCTGTGCGC GGACGACGGA AGTTCGAAGC CCCTCAGCTT TTTCGACCTC TTCAAAACCA TCGAACCCAC GTTTTCCCTC GACCAGATGA CCGACACGGA ATCCTGGGGC TCATACCTCT CCGCCCCCGA AGCCGAGTAT CATCCGTTCG CGGGGGATGA AGCCACGTTG TTGCTGATGT CGAAGAAATC CGAACGGATC AACGTCCGAT GCAAGCACAC GAAGCCGCTC ATCCCCGAAT ACATGAAAGA GAGAACCATC AACTACCTCG TTTTGACGAA CTTCCTTCCC GAAGGCGTGG AGCCCGCCCG GTTCGTCAAC GATCGCCCGA TTCCGGCGAA GGGAGAGCAC CCGGCGCCCG ATCCGTATCT CTACACCCTT TCGTCTTGCT TCGACATGAT CTCGATCTCG CAGCTCACCG AAGAATCGCT TCGAAAGCTT CCCGATGGAA CCGCCCTCAA TAAAGTTCCC GCGTTCGAGG GCTACTTCCT GACGTACGAC CAGAAAACCA CCTCTCTCGA CCAAGAGGAG GTCATGCACG GTATCTTCAA CGACCTGGAG ATTTGGGGGC GCGATCACAA AACCAAAACA CCTCTCAACA TCGACCTGGT GATATTCCTC TCCAACACGA ACGACGGCGG GAGCAGCGCC CAGTTCATCG ACATGAACGA AATACGCATC CCGAAGCCTG GCGAACTTCA AGAGGGCTTC GACCACAGGC GCTGCATGGC GATACGCTTA AATCTAGGAA TGCTGAACAA GAAAGACGGT TTCCAGCGGG ACGTCGTGCC CGTTCGCAAC AACCCGTTTC GATTCGATCC GCTGTACGGA TTCGTCACCT CATGGAGAAG GAAGGGGTTC AGGCACTTCT CCCATGCCGA CTACATCACG AAGAGGAAGC TTGTGAGGGC GTGGGCGTAC CTCGTCAAAC TCGAAGAAAT AGTCGGGAAT CAAACGCTCG CCCCCGTGTT CTTCCACGAG TTCAGCCATG TGATCGAGTT CACCCTCAAC AGCGAGGGAA GCTCGCTCGG CGGTGCGATC TTCAACCCGG ACGATTCGGG GATAAAGCTG AGTTCGCTGG GCATCAACCA TCAGCAATTC GCGATGAGAA TCGTGTTGAA CGGAGCATAC GAGAACATGC ATATAAGCCA TTCGAAAATG TTGTCCCATC TTGGCGTCGA CTACGACTAT CTGCACGAGC GGCTTGATCG TTTTTGGCGT TCATGGGGAA AGACCTTCTA A
|
Protein sequence | MSSGFRPRWM DCDEPGWNPD SRRDSTAEPE QASDRMALAL PSECIVYPIR YTMHVNAEGS ERDRKSASLL DGKTESDTPL VLTTMTEESS TFILVSFDDD SYLIIDERSS GVLQIEHASD DAYARIVLGA YEEGATHQLW RFEPTGHGSY YLVSKRNGLV VDLCNFQLAD GSPLIAYPRN NGWNQQFHFD CAGTLDTDLS ALPEEADDSG ESELLCADDG SSKPLSFFDL FKTIEPTFSL DQMTDTESWG SYLSAPEAEY HPFAGDEATL LLMSKKSERI NVRCKHTKPL IPEYMKERTI NYLVLTNFLP EGVEPARFVN DRPIPAKGEH PAPDPYLYTL SSCFDMISIS QLTEESLRKL PDGTALNKVP AFEGYFLTYD QKTTSLDQEE VMHGIFNDLE IWGRDHKTKT PLNIDLVIFL SNTNDGGSSA QFIDMNEIRI PKPGELQEGF DHRRCMAIRL NLGMLNKKDG FQRDVVPVRN NPFRFDPLYG FVTSWRRKGF RHFSHADYIT KRKLVRAWAY LVKLEEIVGN QTLAPVFFHE FSHVIEFTLN SEGSSLGGAI FNPDDSGIKL SSLGINHQQF AMRIVLNGAY ENMHISHSKM LSHLGVDYDY LHERLDRFWR SWGKTF
|
| |