Gene Elen_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1993 
Symbol 
ID8416304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2335997 
End bp2338102 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content67% 
IMG OID645024970 
ProductVacB and RNase II family 3'-5' exoribonuclease 
Protein accessionYP_003182346 
Protein GI257791740 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02063] ribonuclease R 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.124011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCGGA CGCGTTCCCA TACGCGCCGG CATCCCCGTA GCAACCCGCG GGGCGTGCTG 
AGCGTGCGCG GAGGCGGCTT CGGCTTCGTG CAGACGGCGG AGGGCGAGTT CTTCGTTCCG
GAATCCAAGA TGGCGGGTGC GTTCGACGGC GATCTCGTGG AAGTGGCGCC GCTGCCCGTG
CAGAGCGGCC GCAAGAAGCA GCCACATGAA CGCAAAGCAG ATCTTCGCGC AGGAGAGAAG
CCGGCGGCGC GCGTGCTGCG GGTCGTCGAC CGTGCGCACG ACACGCTCGT GGGGCGCTAC
GAGGTGGCCG AACCTTTCGG CGTGGTCGTG CCCGAAGATG CGAACATCCC CTATGACATA
TTCACCATGC GTGCCGACCG GCCCGACATC GAAGACGGAT CGCTCGTGCG CGTGCGCATC
ACGACGTTTC CTTCGCGCAA CACGGCGGCC ACCGGCGTTA TCGAGGAGGT GCTCGGGCTC
GCCGACGACG AGCACGCGGC CGTAGACGTG GTGATCGCCC GTCACAAGCT GGAGACGGTG
TTCTCCGAAG GGGCGCTTTC GGAGGCGCGA AGCGCGGTGC TCGACGAGGA CGGCGCGTTG
GCGTCGGGGT ACCGCGACTT GAGGGAGCGT TTCACGTTCA CCATCGATCC GGCCGATGCG
AGGGACTTCG ACGATGCGGT CAGCCTGGAG CCCGTTTCGA CCTGCGGGGT TGGAACCGGC
ATCCGCGTGG TGGACGAGCG CGGGCTCGGG GTGGCGCGTT GGCGGTTGGG CGTGCATATC
GCCGACGTGG CGCACTACGT GCCGTGGAAT TCCTCGCTCG ACCTCGATGC GCGCAGGCGG
GCGACCAGCG TGTACCTCGT CGACCGTGTG ATTCCCATGC TGCCCGACGA GTTGTCGGGC
GATCTGTGCT CGCTCAAGCC CGACGAAGTG CGTCGTACGA TGACGGCCGA CCTCTACCTC
GACGATCGGG CGCGGCTCGT GGCCTACGAT CTGTATCCGG CGCTCATCCG CTCGCATGCG
CGCCTGTCGT ACGAGGAAGC GCAGGCGCTG CTGGACCGTT GTCATCCTGA GCGGAGCGCC
GAAGGCGCGG AGTCGAAGGA TCTCGTGCGG CGCGACGGCG CTTGCGCGCC GGGCGACGGG
CTTTCCGACC GCCTCGCCGC GCTGTCGCGC CTGGCGAAGC AGCGGTTCGC CTCCCGCGAG
AGGGCGGGCG GCCTCGACTT CGACTCGGTC GAGGCGCGCG TGGTTCTCGA TGAGGAGGGG
AGTCCGACGG GTATCGACTT GCGCGTCAAG ACCGACGCGA CCTCGCTCAT CGAAGAGGCC
ATGATCCTTG CGAACGAGAC CGTCGCCAAG CACCTGCGCG ATGCGAAGTT CCCCAGCCTG
TACCGCGTCC ACGAGCAGCC TTCGGCCGAC AGCTTGGCGG CGCTCGTGCC CGTGTTCCAG
GACTTCGCGT GGTTCCGCGA CATCGATCAA GCCGACTTCA TCGCGGGCGA TGCGCACGTC
GTCCAGCGCG TGCTGGAGGC GAGCGCGGAG CGTCCCGAGG GCGAGCTCGT GTCGACGCTC
CTGCTGCGGT CGATGAAGCG GGCGGTGTAC CGTCCCGACT GCGCTGCGCA CTACGGCCTG
GCCAGCGGGG CCTACACGCA TTTCACGTCT CCCATCAGGC GCTATCCCGA CCTTGTGGTG
CATCGTATGC TCAAGGCCCT CATCGGCGGA CGCCCGGAGA AGTTCGACCA GGAAAAATCG
GCGCTGCCAT GGATCGCCGA GCACTCCTCC GACATGGAGC GCATAGCCGA GAAGGCTGCG
CGCGAGTCGC AGGAGGTCAA GCTCATCGAG TATCTCGAGC GGTTCGTGGG GCAGACGTTC
TCGGCGACGG TGTCGGGGGT GGCGACGTAC GGCGCGTACG TGCGCCTCGA CAACACGGCC
GAAGGTTTGA TCCCGTGCAA GAACCTTGGT TCGGAGTACT TCGCGCTCGA TCCTGTGCAG
CATCGTCTGA CCGGGCAGGA CACGGGCGCG TCGTATCGCC TGGCGCAGCG GCTCGCCGTC
GTGCTCGTCT CTGCCGATCC GCGTGCTCGG CGCTTGGACT TTCGTCCTGC CCGCGACGAG
CGATAG
 
Protein sequence
MGRTRSHTRR HPRSNPRGVL SVRGGGFGFV QTAEGEFFVP ESKMAGAFDG DLVEVAPLPV 
QSGRKKQPHE RKADLRAGEK PAARVLRVVD RAHDTLVGRY EVAEPFGVVV PEDANIPYDI
FTMRADRPDI EDGSLVRVRI TTFPSRNTAA TGVIEEVLGL ADDEHAAVDV VIARHKLETV
FSEGALSEAR SAVLDEDGAL ASGYRDLRER FTFTIDPADA RDFDDAVSLE PVSTCGVGTG
IRVVDERGLG VARWRLGVHI ADVAHYVPWN SSLDLDARRR ATSVYLVDRV IPMLPDELSG
DLCSLKPDEV RRTMTADLYL DDRARLVAYD LYPALIRSHA RLSYEEAQAL LDRCHPERSA
EGAESKDLVR RDGACAPGDG LSDRLAALSR LAKQRFASRE RAGGLDFDSV EARVVLDEEG
SPTGIDLRVK TDATSLIEEA MILANETVAK HLRDAKFPSL YRVHEQPSAD SLAALVPVFQ
DFAWFRDIDQ ADFIAGDAHV VQRVLEASAE RPEGELVSTL LLRSMKRAVY RPDCAAHYGL
ASGAYTHFTS PIRRYPDLVV HRMLKALIGG RPEKFDQEKS ALPWIAEHSS DMERIAEKAA
RESQEVKLIE YLERFVGQTF SATVSGVATY GAYVRLDNTA EGLIPCKNLG SEYFALDPVQ
HRLTGQDTGA SYRLAQRLAV VLVSADPRAR RLDFRPARDE R