Gene Elen_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1784 
Symbol 
ID8416088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp2090146 
End bp2092359 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content67% 
IMG OID645024755 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003182138 
Protein GI257791532 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATG CGGGCGGCGT CGAGCGCCTG ACGGGCGACG CCTCGCTCGA TGCGGGGGAG 
CTTCGCGACG ACGAGCGCCT CGAACGGGAA GGTCGCCGCC GGCCATCGCT CGAACGTGCC
GCGCGCGAGC ACGACGCCCA CGCCGCGCGC GCCGAAACCG GCCAGCGCGC CAAGCTCGAG
CGCATCAAGG AAGAGCTGAA CAGCGTGCCC ACGCTGCCGG GAGTGTACCT TTGGAAGGAC
AAATCAGGCC AGGTCATCTA CGTGGGCAAG GCCAAGCAGC TGCGCGCCCG CATGCGCCAG
TACGTGAACT TCCAGGACGA GCGCGCGAAG ATCCCTTTGC TGGTCGACCA GATCGACAGC
TTCGACTACA TCGTGGTGGA GAACGAGCAC GAGTCGCTCG TGCTGGAGAA GAACCTCATC
AACCAGCACG CGCCGTTCTT CAACGCCGAC TTCAAGGACG ACAAGTCCTA CCCGTTCATC
GCGCTCACGA AGGGCGATGT GTTCCCCGCC ATCAAGTACA CGCGCGAGAA GCACCGCGCG
GACACCAAGT ACTTCGGCCC CTACACCGAC AGCCGCGCCG CCCGCGACAT GGTGGACATC
GCCCGGCGCG TCGTGCCCCT GTGCGCCACG TCGTGCGCCG ACTGGCGCCA GCTCAAGCGC
CGTTTGGAGA AGGATCCGCT CGCGCTCATG TCGCACGACG CCCGCCCGTG CTTCGACGCG
CACGTGGGCC TGGGCCCCGG CGCCTGCTGC GGCGGCATCA CGCCCGAGGA CTACCGCGTC
CACGTCAAGC GCATCGAGCG CTTCCTGTCG GGCCAGCACC GCGAGTTCGT CGACGAGCTG
CAGGCCGAGA TGCAGGAGGC GGCGGCCGAG CTGGACTTCG AGCGCGCCGC GCGCATCAAA
GCGCGCATCG ACACCATCAA CAGCCTCACC GACAAGCAGC ATGCGGTGTC AACGCGCAAC
CTCGACGCCG ACGTCGTGGG CCTGTTCCGC GAGGAGACGG TGGCCGGGGT GCACGTGTTC
ATGGTGCGCG AAGGGCGCAT CATCAACTCC AACGAGTTCG TGCTCGACCG CGGCAAGGAC
GTGCCCGACG ACGACCTCCT GCACATGTTC CTGCTGCGCT ACTACGATGC CACCACGTCC
ATCCCGCACG AGGTCATCCT GCGCGACGAG CCCGAGGACA AGGCTGCCAT GGAGGCATGG
CTCACCGAAA AGCTGGCCAG CCCCTACGGG GCGAAGGTGC GCATCACCGC GCCGCAGAAG
GGCGAGAAGG CCGAGCTCGT GGGCATGGCC GAGACGAACG CGAAGCACAC GCTCATGCGC
TACAAGGTGC GCACGAACTA CGACGACAAG CGCATCAACA ACGCGCTTCT GCAGCTGGAG
AGCGCACTGG CCCTGGACGA GCCGCCCATG CGCATCGAGT GCTTCGACAT CTCCACCATC
CACGGCTCCT ACACGGTGGC CTCGATGGTG GTGTTCACGA ACGGCAAGCC CGACAAGAAC
CAGTACCGCC GCTTCAAGAT CAAGACGCCG CTCGACGAGG CGAACGACTT CCTGTCCATG
CAGGAGGTCA TGAGCCGCCG CTACGCGCCC GAGCGCATGG CCGACGAGCG CTTCGGCAGC
AAGCCCGACC TCATCATTCT CGACGGCGGC AAGCCGCAGC TGTCGGCGGC GCTCGAGATG
TTCGAGCGGA TGGGCATCGA CGACATCGCT ATGTGCGGTC TGGCCAAGCG CGACGAGGAG
CTGTTCGTGC CCTGGCAGGA CACGGGCCCG GTGGTGCTGC CCAGCGGCTC GGCGTCGCTG
TACCTCGTGA AGCAGGTCCG CGACGAGGCG CACCGCTTCG CCATCACGTT CCACCGCGAG
CTGCGCGGCA AGGGCATGAC GGCCTCCATC CTCGACGACG TGACGGGCAT GGGGCCGGTG
CGCAAGAAGG CGCTGCTGAA GGCGTTCAAG TCGTTCAAGA ACCTGAAGAG CGCCACGCTC
GAAGAGATCA AGGAGGCGCG CGTCGTCCCC GTCGAGGTGG CCGAGGAGCT GTACCGGGTG
CTGCAGCAGT ATAATAGGGA GCGGAAAGAC GAGCGCGTGG TGGGCGGCGA GGCCGGAGAG
GCCGTCTGCC CGCCCGCAGG CGAGGGCCCG GCGGCGGTCG AATCCGCCGC GGTGGACGCG
GCGGTCGAGG CTGCCGTGCA GGGCGAACGC ACGCGCACGG ATACGGAAGG ATAG
 
Protein sequence
MPDAGGVERL TGDASLDAGE LRDDERLERE GRRRPSLERA AREHDAHAAR AETGQRAKLE 
RIKEELNSVP TLPGVYLWKD KSGQVIYVGK AKQLRARMRQ YVNFQDERAK IPLLVDQIDS
FDYIVVENEH ESLVLEKNLI NQHAPFFNAD FKDDKSYPFI ALTKGDVFPA IKYTREKHRA
DTKYFGPYTD SRAARDMVDI ARRVVPLCAT SCADWRQLKR RLEKDPLALM SHDARPCFDA
HVGLGPGACC GGITPEDYRV HVKRIERFLS GQHREFVDEL QAEMQEAAAE LDFERAARIK
ARIDTINSLT DKQHAVSTRN LDADVVGLFR EETVAGVHVF MVREGRIINS NEFVLDRGKD
VPDDDLLHMF LLRYYDATTS IPHEVILRDE PEDKAAMEAW LTEKLASPYG AKVRITAPQK
GEKAELVGMA ETNAKHTLMR YKVRTNYDDK RINNALLQLE SALALDEPPM RIECFDISTI
HGSYTVASMV VFTNGKPDKN QYRRFKIKTP LDEANDFLSM QEVMSRRYAP ERMADERFGS
KPDLIILDGG KPQLSAALEM FERMGIDDIA MCGLAKRDEE LFVPWQDTGP VVLPSGSASL
YLVKQVRDEA HRFAITFHRE LRGKGMTASI LDDVTGMGPV RKKALLKAFK SFKNLKSATL
EEIKEARVVP VEVAEELYRV LQQYNRERKD ERVVGGEAGE AVCPPAGEGP AAVESAAVDA
AVEAAVQGER TRTDTEG