Gene Elen_1243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1243 
Symbol 
ID8415535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1489747 
End bp1492653 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content72% 
IMG OID645024207 
ProductATP-dependent nuclease subunit B-like protein 
Protein accessionYP_003181602 
Protein GI257790996 
COG category[L] Replication, recombination and repair 
COG ID[COG3857] ATP-dependent nuclease, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0167838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTC AACATCACAT ACTCGACCAT GCGGGCGACG TGCTGGACGC CGCCCGCGCA 
TATCTGGAAT CGTGGCTGGC GGCGGGTCTC GCGCCCGTGC TGCTGGTTCC CAGTCCCGCC
GCTGCCGACC GCGTGCGCCG CGCGCTCGCG GAGGGGCCGT GCGCGCTGGG AGCGCGCGTG
GAGACGCCTA TGATGTGGGT GCGCGACAGG TGGGATCTGT TCGGCGACGG GCGCCGCATC
GTGTCGTCGT TCGATCGCGC ATTGCTCGTG CGCCGCGCCT GCTTGGAGAC CGAACGGACG
GCGCTTGAAG CCACGCCGGG CGTCGTCGGC CTGCTCGCGC GCTTGGCGCG CGAGGCGTTG
CCGCATCTGA CGGCGCCGGT CGCCGCAAAC GCCGGTCTGA GCGCGGCCGA GCGCGATGCG
CTCATGGTGC TCGAGCGTTA CGCGGAGCTT CTGCAGGAGG CGGGGTGCTG CGAATCGTCG
CAGGCGGCGG CTCTGCTTCC CGAGGCTATG GAGGCGCCCG CGCCCCTTGT GCTGGCCGGG
TTCGACGACC TTGACTGCGC TGAGGAGCGG ATGACGGCCG CGCTCGCGCA GCGGACCGAC
GTCGTTCGCC TCGATGACGG CTGCCGCGCC CCGTCGGGTG CCGACGGGCG CGCCTCCGAG
CTGCAAGGGC TGTTGGAGCG TCTGTTCAAG CCCGCGATGC CCGAGCCTCT GCGCCCGACG
GGGCGGGTGC GCTTCTTGCT TCCGGCGGGA CGCTACGCCG CGCCGACCCT CGTGGCGAGC
ACGGTGGCGC AGGCCGTGGC CGACGAGCGC CTCCGCGCGG TCGAGGAGGG GCGCGATCCG
CTGCCCGTTG CGGTGACTGC CCGCGATCCT CGCGCGTTGT TCGACGACGT TGCGGGCGCG
CTGCTCGAAG CGGGCGCCGC CTCGGCCGTG AGTGCGAGCC GGGCGTTCGC CGACACCGCG
TTCGGACGCG CGTTTCTCGC TCTGCACGCG TTCGCCTGCG GCGCATGGAG CATCGCGCAG
GCGTCCGACT TCGCGCTCGG ACCGTTCTCG GGCATCGGGA TCCGAACGGC CTGCGAGCTG
GATGCCGCGT GGCGCGGGGA CCGCACGGTC GAGCGCGCGC GCATCGCGGC CGATCTCGCC
CGCGAAAGCG AGGCGGCGGC CGACGCGCTG ACGGCGCTGG AGGCGGGCGA CGCGGACGGC
GCGCTCGCGG GGTTCGAGGC GCGCCTGCGC GCACGTACGG ACCTCGATCC GGCGTTTCGC
GCGGAGCAAC TGGCGGCCGT GTCGTGCGCG CGCAGGTTCG CGGCGGCATG CAGTCGCGCG
AACGCGGTGT TCGACGGCGC ATTGCCGTTG CTGGAGCGCA TGCCGGTTTC CTCAAGCGCT
CGGTCGGCGC GCGACGAAGG CGCGCGCCCC GACGCGCTGT TCATGTCGCT CGACGAGGCG
GCCGAGCTTC CCGCGTGCTC TTGCGCCGCG CTCGTGCTGT GCGATCTCGA CGCGGGCTCC
TACCCGGTGC GGCTCGTCGA GGACGGCGGC ACGCTGCTTC TGGAGAAGCT GGGCCTCGGC
CGCCCCGCAG ACGCGCTCGC CGCCTCCCGC CGCCGATTCT TCCGCGCGCT TTCGAGCGCC
TGCGAGACGG TGGTGTGCGA GCGCGTGCTG AACACCGAGG ATGCGAACGA GGCGTATCCG
TCGGTTATGT TCGAGGAGCT GCTCGACTGC TACCGGGAAG CGGATGCGGG CGAGGACGAC
CGGGCGACGG GGCTTCCCAA GCCGCTCGTC GCCTTCGCCA TGCTGGCAGG GGAGGACGCG
CTCCACGACA ACCTCGCGCT CGTCTGCGAG GGTGTGCAGC CGCGCGTAGG CAAAGGAGGG
GCGACGCTGT CCTGGGAGCT GTCGGCCGCG GGCGCGGTGT CGCCCGAGCA GCGCCCGCGC
ATCGTGCTGC CGCGCTCGCC GCGCGAGGGG CTGCGCGGCG CCGACGCCGC GTTGGCGCTG
TCGCCGTCTG CGCTGGAAAG CTACCTTGAA TGTCCCTACA AGTGGTTCGC TCTGCGCCGT
CTTCGCCTGT CGGAGCCGGA CGCCGGGTTC GGGCCTCTCG AGATGGGCAG CTTCTCCCAC
AACGTGCTGC GTAGCTTCTA CGAGCATTTC CGCGAGGCGG GGCATGCGAA GGTGGACTCC
GCGACGCTTC CCGAAGCGCG CGCGCTTCTA GGCGAGACGT TCGACCGCCA TCTGGAATCG
CAACGCAACC TCAAGCGCTC GGTCAACCCG TTGATCCCGC GCACCGCCTT CGAACGCGCC
GAGACGGCCG ATCTGAGGAA GAGGCTCGTC CGGTTCCTCG ACCGCGAGGC GCTGCTGCTG
CCGGGCTTCG AGCCCGTGCG CTTCGAGTTC GATTTCGGCT CGAGCGAGCC GTTCCCGTAC
GCAGGATGCC TGCTGCGCGG CAGTGTGGAC CGCATCGACG TGAACGGGGC CGGCCAGGCG
GTGGTTATCG ACTACAAGGG GTCGCTCAAT GGCGACTACG CGCTGGATTC GGCATCGCCC
GCCGCGCAGG CGGGCGGCGC GGTGCTTCCC CATAAGGTGC AGACGCTCAT GTACGCTCAG
GTGGCGCGCA AGGTGCTGGG TCTCGACGTG GTGGGTGCGC TGTACGTGTC GTACGGCCGC
GATCGGCGCG TCTCGGGCGC CTTCGACCGC ACGGTCGTGG GCGAGCGAGA CGTGCCCGGC
ATCGACGTCG AGCGCTGCGG CGTGCCGGGG CCGGCAGGCG AGGCGCTGGG CGTCTCGTCG
TTCGGCGAGC TGGTGGACGC GGTGGAGGAT CGCATCGCCC GGGCCGTGCG CACGCTGGCC
GACGGCTGCA TCGGTCCCGA TCCGCGCGGG GGAGACCCGT GCGGGTATTG CCCCGTGCTC
GCCTGCGAGA AAAGGATGGG CGCATGA
 
Protein sequence
MPFQHHILDH AGDVLDAARA YLESWLAAGL APVLLVPSPA AADRVRRALA EGPCALGARV 
ETPMMWVRDR WDLFGDGRRI VSSFDRALLV RRACLETERT ALEATPGVVG LLARLAREAL
PHLTAPVAAN AGLSAAERDA LMVLERYAEL LQEAGCCESS QAAALLPEAM EAPAPLVLAG
FDDLDCAEER MTAALAQRTD VVRLDDGCRA PSGADGRASE LQGLLERLFK PAMPEPLRPT
GRVRFLLPAG RYAAPTLVAS TVAQAVADER LRAVEEGRDP LPVAVTARDP RALFDDVAGA
LLEAGAASAV SASRAFADTA FGRAFLALHA FACGAWSIAQ ASDFALGPFS GIGIRTACEL
DAAWRGDRTV ERARIAADLA RESEAAADAL TALEAGDADG ALAGFEARLR ARTDLDPAFR
AEQLAAVSCA RRFAAACSRA NAVFDGALPL LERMPVSSSA RSARDEGARP DALFMSLDEA
AELPACSCAA LVLCDLDAGS YPVRLVEDGG TLLLEKLGLG RPADALAASR RRFFRALSSA
CETVVCERVL NTEDANEAYP SVMFEELLDC YREADAGEDD RATGLPKPLV AFAMLAGEDA
LHDNLALVCE GVQPRVGKGG ATLSWELSAA GAVSPEQRPR IVLPRSPREG LRGADAALAL
SPSALESYLE CPYKWFALRR LRLSEPDAGF GPLEMGSFSH NVLRSFYEHF REAGHAKVDS
ATLPEARALL GETFDRHLES QRNLKRSVNP LIPRTAFERA ETADLRKRLV RFLDREALLL
PGFEPVRFEF DFGSSEPFPY AGCLLRGSVD RIDVNGAGQA VVIDYKGSLN GDYALDSASP
AAQAGGAVLP HKVQTLMYAQ VARKVLGLDV VGALYVSYGR DRRVSGAFDR TVVGERDVPG
IDVERCGVPG PAGEALGVSS FGELVDAVED RIARAVRTLA DGCIGPDPRG GDPCGYCPVL
ACEKRMGA