Gene Elen_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1559 
Symbol 
ID8415857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1854105 
End bp1855139 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID645024527 
Productpseudouridine synthase, RluA family 
Protein accessionYP_003181916 
Protein GI257791310 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.627219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0055148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCGCA TGTTGAGCTA CGTCGCAGCA CCCGACGACG CGGGTCAGCG CCTCGATGCG 
CTTTTGGCCG CGCGCGGTCT GTATCCCAGC CGTAGCGCGG CTGCGCGCGC GGTGGACGAC
GGCCTCGTGT TCGTGAACGG CGCGGAGGTT GCGAAGAAGC ATCCCGTGGC GCCGGGCGAC
ACGATCGTGT ACCAGGTTGA GGAGCCGGTA GAGCCCGGCC CTTTGCGCGG CCAACCCATC
GATCTGGACA TACGCTTCGA GGACGAAGAC CTCATCGTGC TGTCGAAGCA GGTGGGGCTC
GTGTGCCATC CGTCGGTCGA CCATGACGAC GGCACGCTGG TGAACGCCCT CATCTACCAC
TGCGGCGCCG AGAACCTGTG CAACGTGCAG GGCGAGGACG ACCGTCTGGG CATCGTGCAC
CGCCTCGACC GCGACACGAG CGGCCTCATG CTGGCGGCGA AGAACGACGA GACGGGCTAT
GCCCTCATGT CGGACATCCG CGATCGCGCG GTCGACCGAC GTTACCTGGC GCTCGTGCAC
GGCGTGATCG CCCACGACAC CGGTATGATC GACGCTCCCA TCGCGCGCGC CGAGAAGGAG
CGCACGCGCA TGGCCGTCCG CGACACGCCG TCGGCTCGCG AGGCCATCAC GACGTTTCGG
GTGCTCGAGC GCTTCGAGCA CGGGGCGCGC GACGACGGCT ACACGCTCAT CGACTGCAAG
CTGTTCACAG GGCGCACCCA TCAGATACGC GTGCATCTGG AGTACGCGAA GCACCCTCTT
GTGGGCGACC CGGCGTACAC GTCGGGCGCG CCGAGCGCGC CTGCGGCCGA CCTCGGCCTC
GACCGCCAGT TCCTGCACTC GTTCCAGCTG TCGTTCCAGC ATCCCGTCAC GGGGGAGGGC
CTGCGCTTCG CGGACAACCT GCCCGTCGAC CTGCAGGAAG CGCTCGACGA CCTCGCCTCC
CGCAGCACGG GCCGCACGAC GGCGGGGGAG GAAGTGCGAG CGTTGCTGGA AGACGCCCCG
AGGCCGCGGC TGTAG
 
Protein sequence
MSRMLSYVAA PDDAGQRLDA LLAARGLYPS RSAAARAVDD GLVFVNGAEV AKKHPVAPGD 
TIVYQVEEPV EPGPLRGQPI DLDIRFEDED LIVLSKQVGL VCHPSVDHDD GTLVNALIYH
CGAENLCNVQ GEDDRLGIVH RLDRDTSGLM LAAKNDETGY ALMSDIRDRA VDRRYLALVH
GVIAHDTGMI DAPIARAEKE RTRMAVRDTP SAREAITTFR VLERFEHGAR DDGYTLIDCK
LFTGRTHQIR VHLEYAKHPL VGDPAYTSGA PSAPAADLGL DRQFLHSFQL SFQHPVTGEG
LRFADNLPVD LQEALDDLAS RSTGRTTAGE EVRALLEDAP RPRL