Gene Elen_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_0016 
Symbol 
ID8414295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp21777 
End bp23657 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content56% 
IMG OID645022991 
ProductRicin B lectin 
Protein accessionYP_003180399 
Protein GI257789793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.111216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAG GATTCAGACC CCGCTGGATG GACTGCGACG AGCCGGGATG GAACCCCGAC 
AGCCGTAGGG ATTCCACTGC CGAACCCGAA CAAGCCAGCG ACCGCATGGC TCTGGCACTG
CCGTCCGAAT GCATCGTCTA TCCCATCCGC TACACGATGC ACGTCAACGC GGAGGGATCC
GAAAGAGACA GGAAGAGCGC CAGCTTGCTG GACGGCAAAA CCGAGTCCGA CACCCCGCTG
GTGCTGACTA CGATGACGGA AGAAAGCTCC ACGTTTATCC TGGTTTCTTT CGACGACGAC
AGCTATCTGA TCATCGACGA GCGGTCGTCG GGAGTTTTGC AGATCGAGCA CGCATCGGAC
GACGCATACG CCCGCATCGT GCTAGGCGCA TACGAAGAAG GGGCTACCCA CCAGCTTTGG
CGTTTCGAGC CAACCGGCCA CGGATCGTAT TATCTCGTGT CTAAGAGAAA CGGGCTGGTT
GTCGATCTGT GCAACTTCCA GCTGGCGGAT GGAAGCCCCC TTATCGCCTA CCCGAGGAAC
AACGGATGGA ACCAGCAGTT CCATTTCGAC TGCGCCGGCA CCCTGGATAC GGATCTCTCC
GCCTTGCCGG AAGAGGCTGA CGACTCGGGC GAGAGCGAAC TTCTGTGCGC GGACGACGGA
AGTTCGAAGC CCCTCAGCTT TTTCGACCTC TTCAAAACCA TCGAACCCAC GTTTTCCCTC
GACCAGATGA CCGACACGGA ATCCTGGGGC TCATACCTCT CCGCCCCCGA AGCCGAGTAT
CATCCGTTCG CGGGGGATGA AGCCACGTTG TTGCTGATGT CGAAGAAATC CGAACGGATC
AACGTCCGAT GCAAGCACAC GAAGCCGCTC ATCCCCGAAT ACATGAAAGA GAGAACCATC
AACTACCTCG TTTTGACGAA CTTCCTTCCC GAAGGCGTGG AGCCCGCCCG GTTCGTCAAC
GATCGCCCGA TTCCGGCGAA GGGAGAGCAC CCGGCGCCCG ATCCGTATCT CTACACCCTT
TCGTCTTGCT TCGACATGAT CTCGATCTCG CAGCTCACCG AAGAATCGCT TCGAAAGCTT
CCCGATGGAA CCGCCCTCAA TAAAGTTCCC GCGTTCGAGG GCTACTTCCT GACGTACGAC
CAGAAAACCA CCTCTCTCGA CCAAGAGGAG GTCATGCACG GTATCTTCAA CGACCTGGAG
ATTTGGGGGC GCGATCACAA AACCAAAACA CCTCTCAACA TCGACCTGGT GATATTCCTC
TCCAACACGA ACGACGGCGG GAGCAGCGCC CAGTTCATCG ACATGAACGA AATACGCATC
CCGAAGCCTG GCGAACTTCA AGAGGGCTTC GACCACAGGC GCTGCATGGC GATACGCTTA
AATCTAGGAA TGCTGAACAA GAAAGACGGT TTCCAGCGGG ACGTCGTGCC CGTTCGCAAC
AACCCGTTTC GATTCGATCC GCTGTACGGA TTCGTCACCT CATGGAGAAG GAAGGGGTTC
AGGCACTTCT CCCATGCCGA CTACATCACG AAGAGGAAGC TTGTGAGGGC GTGGGCGTAC
CTCGTCAAAC TCGAAGAAAT AGTCGGGAAT CAAACGCTCG CCCCCGTGTT CTTCCACGAG
TTCAGCCATG TGATCGAGTT CACCCTCAAC AGCGAGGGAA GCTCGCTCGG CGGTGCGATC
TTCAACCCGG ACGATTCGGG GATAAAGCTG AGTTCGCTGG GCATCAACCA TCAGCAATTC
GCGATGAGAA TCGTGTTGAA CGGAGCATAC GAGAACATGC ATATAAGCCA TTCGAAAATG
TTGTCCCATC TTGGCGTCGA CTACGACTAT CTGCACGAGC GGCTTGATCG TTTTTGGCGT
TCATGGGGAA AGACCTTCTA A
 
Protein sequence
MSSGFRPRWM DCDEPGWNPD SRRDSTAEPE QASDRMALAL PSECIVYPIR YTMHVNAEGS 
ERDRKSASLL DGKTESDTPL VLTTMTEESS TFILVSFDDD SYLIIDERSS GVLQIEHASD
DAYARIVLGA YEEGATHQLW RFEPTGHGSY YLVSKRNGLV VDLCNFQLAD GSPLIAYPRN
NGWNQQFHFD CAGTLDTDLS ALPEEADDSG ESELLCADDG SSKPLSFFDL FKTIEPTFSL
DQMTDTESWG SYLSAPEAEY HPFAGDEATL LLMSKKSERI NVRCKHTKPL IPEYMKERTI
NYLVLTNFLP EGVEPARFVN DRPIPAKGEH PAPDPYLYTL SSCFDMISIS QLTEESLRKL
PDGTALNKVP AFEGYFLTYD QKTTSLDQEE VMHGIFNDLE IWGRDHKTKT PLNIDLVIFL
SNTNDGGSSA QFIDMNEIRI PKPGELQEGF DHRRCMAIRL NLGMLNKKDG FQRDVVPVRN
NPFRFDPLYG FVTSWRRKGF RHFSHADYIT KRKLVRAWAY LVKLEEIVGN QTLAPVFFHE
FSHVIEFTLN SEGSSLGGAI FNPDDSGIKL SSLGINHQQF AMRIVLNGAY ENMHISHSKM
LSHLGVDYDY LHERLDRFWR SWGKTF