Gene Elen_1422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1422 
Symbol 
ID8415720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1692893 
End bp1695784 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content68% 
IMG OID645024391 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003181780 
Protein GI257791174 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0151035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGA ACGCCATCGT CATCAAGGGT GCCCGCGAGC ACAACCTCAA GGACATCGAC 
CTCTCCATCC CGCGCGACGA GCTGGTGGTC ATCACGGGCC TGTCGGGCAG CGGCAAGTCG
TCGCTGGCGT TCGACACGAT GTACGCCGAG GGCCAGCGCC GCTACGTGGA GAGCCTGTCC
AGCTACGCGC GCATGTTCCT GGGCCAGATG TCGAAGCCCG ACCTCGACAG CATCGACGGC
CTGTCGCCGG CGGTGTCCAT CGACCAGAAG ACCACGTCGA AGAACCCGCG CTCGACGGTG
GGCACCACCA CCGAGATCTA CGACTACCTG CGCCTCCTGT TCGCGCGCGT GGGCGTGCCG
CACTGCCCCG AGTGCGGCCG CGTCATCAAG AAGCAGACCA CCGACCAGGT GACCGACGAG
ATCCTCGCCC TCGCGCCCGA TGCGAAGGCC ATCATCATGG CTCCCGTGGT GGCGGGCCGC
AAGGGCGAGT TCACGAAGCT GTTCGCCGAC CTGCAGAAGG AGGGCTTCAG CCGCGTGCGC
ATCGATGGCG AGATCGTGAA GCTGGACGGC GAGCCGCGCA CGCTCAACAA GAAGATCAAG
CACTTCATCG ACGTGGTGGT GGACCGCGTG CAGCTTAAAG CGAGCGCGAC GAGCCGCATC
GCCGAGGCGG TGGAGCTGGC CACGAAGTTG GCCGACGGGC GCGTGCTCGT GCAGGTGCTG
GGCGACGACG GCAAGCCGCT CGGCGAGGGC GGCGGGCGCT CGTCCGGCGC GACGGGCGGC
CTGGGCGCGG GGGAGCACAT CTTCTCGCTC GCGCTGGCAT GCCCCGAGCA CGGACACTCC
ATGGACGAGC TGCAGCCGCG CGACTTCTCG TTCAACGCAC CCTATGGCGC CTGCCCCGAC
TGCCTCGGCA TCGGCAGCCG CGAGGAGGTG GACGCGTCGC TGGTGGCGCC CGACCCGTCG
CTGTCGCTGA ACGAGGGCGC CATCGCGCCG TTCAAGACCG GCAACTACTA CCCGCAGGTG
CTGCGCGCCG TGGCCGCGCA CCTCGGCACC GACGCCGACA CCCCGTGGGA GGATATGCCT
AAGAAGGCGC AGGACGGGCT TCTGCACGGC CTGGGCAAGG ACAAGGTGCG CGTCGACTAC
GTCACGGTGG ACGGTCGCGA GACGTACTGG TACATCGAGT GGGAGGGAGC GCTGGCCGCA
GTGCAGCGCC GCTACCAGGA GGCTCAGTCC GACGCTCAGC GCGAGAAGCT GGCCAGCTAC
TTCGCCATCG TTCCGTGCCC GACCTGCGGC GGCAAGCGCC TGAAGCCCGA GATCCTCGCC
GTCACGGTGA ACGAACGCTC CATCCACGAC ATCACCGAGA TGAGCGCGGC GGACTCGCTC
GAATTCTTCG ACGGCCTCGC GTTCCACGGG TCGGAGGAGC ATATCGCCGG GCCCATCGTG
AAGGAGATCA AGGCGCGCCT CAAGTTCCTC GTGGACGTGG GTCTGGACTA CCTCACGCTG
GAGCGCGCCA CGGCGACGCT CTCCGGCGGC GAGGCGCAGC GCATCCGCCT GGCCACGCAG
ATCGGCGCGG GCCTCATGGG CGTGCTCTAC ATCCTGGACG AGCCTTCCAT CGGCCTGCAC
CAGCGCGACA ACGAGCGGCT CATCGCCACG CTCGAGCGCC TGCGCGACTT AGGGAACACC
GTCATCGTGG TGGAGCACGA CGAGGACACC ATCCGCAGCG CCGACTTCGT GGTGGACATG
GGCCCGGGCG CGGGCGAGCA CGGCGGCGAG ATCGTGGCCA TCGGCACGCC GGACGAGATC
ATGAAGGCCG AAGGCTCGCT CACGGCCGAC TACCTGTCGG GCCGGCGCCG CATCGAGGTG
CCCGAGAAGC GCCGCAAGCC GCGCCGCGGG TCGCTCAAGC TGACGGGCGC CACCGAGAAC
AACCTGCACA ACGTCACGCT CGAGGTTCCC TTCGGCACGC TCACCGTGGT CACGGGCGTG
TCCGGCTCGG GCAAGAGCTC GCTGGTCACC GACACGCTGG CGCCGGCGCT CGCGAACCGC
GTGAACCATG CGCACCGCCG CACGGGCGCC TACAAGAAGA TCACCGGGCT GGACAAGATC
GACAAGGTCA TCAACATCGA CCAGAGCCCC ATCGGGCGCA CGCCGCGTTC CAACCCGGCC
ACCTACATAG GCCTTTGGGA CGACATCCGC GCGCTGTTCG CCTCCACGCA GGAGTCGAAG
GCCCGCGGCT ACTCGCCGGG CCGCTTCTCG TTCAACGTGA ACGGCGGACG CTGCGAGGCG
TGCAAGGGCG ACGGCCAGAT CAAGATCGAG ATGCACTTCC TGCCCGACAT CTACGTGCCG
TGCGAGGTGT GCGGCGGCGA CCGCTACAAC CGCGAGACGC TGCAGGTGAC TTACCGCGGC
AAGAACATCG CCGAGGTGCT GGACATGACC GTGGAGGACG CGCTGGCGTT CTTCGAGAAC
ATACCCGGCA TCAAGCGCAA GCTGCAGACG CTGTTCGACG TGGGCCTGGG CTACATCCGC
CTGGGGCAGC CGGCCACCAC GCTGTCCGGC GGCGAGGCGC AGCGCGTGAA GCTTGCCAGC
GAGCTGCAGC GCCGCCAGAC CGGCAAGACG TTCTACATCC TGGACGAGCC CACCACCGGC
CTGCACTTCG AGGACGTGCG CCAGCTGCTC ATCGTGTTGC AGCGCCTGGT GGACGCGGGC
AACACCGTGC TGGTCATCGA GCACAACCTC GACGTCATCA AGTGCGCCGA CCGCATCGTC
GACCTCGGCC CCGAGGGCGG CGAGCGCGGC GGCACCGTGG TGGCCCAGGG CACGCCCGAG
GAGGTCGCCC AGGTGGAGGG CAGCTACACC GGCGCCTTCG TGAAGAAGAT GCTGGAGGAC
GGCCGGCTGT AG
 
Protein sequence
MGQNAIVIKG AREHNLKDID LSIPRDELVV ITGLSGSGKS SLAFDTMYAE GQRRYVESLS 
SYARMFLGQM SKPDLDSIDG LSPAVSIDQK TTSKNPRSTV GTTTEIYDYL RLLFARVGVP
HCPECGRVIK KQTTDQVTDE ILALAPDAKA IIMAPVVAGR KGEFTKLFAD LQKEGFSRVR
IDGEIVKLDG EPRTLNKKIK HFIDVVVDRV QLKASATSRI AEAVELATKL ADGRVLVQVL
GDDGKPLGEG GGRSSGATGG LGAGEHIFSL ALACPEHGHS MDELQPRDFS FNAPYGACPD
CLGIGSREEV DASLVAPDPS LSLNEGAIAP FKTGNYYPQV LRAVAAHLGT DADTPWEDMP
KKAQDGLLHG LGKDKVRVDY VTVDGRETYW YIEWEGALAA VQRRYQEAQS DAQREKLASY
FAIVPCPTCG GKRLKPEILA VTVNERSIHD ITEMSAADSL EFFDGLAFHG SEEHIAGPIV
KEIKARLKFL VDVGLDYLTL ERATATLSGG EAQRIRLATQ IGAGLMGVLY ILDEPSIGLH
QRDNERLIAT LERLRDLGNT VIVVEHDEDT IRSADFVVDM GPGAGEHGGE IVAIGTPDEI
MKAEGSLTAD YLSGRRRIEV PEKRRKPRRG SLKLTGATEN NLHNVTLEVP FGTLTVVTGV
SGSGKSSLVT DTLAPALANR VNHAHRRTGA YKKITGLDKI DKVINIDQSP IGRTPRSNPA
TYIGLWDDIR ALFASTQESK ARGYSPGRFS FNVNGGRCEA CKGDGQIKIE MHFLPDIYVP
CEVCGGDRYN RETLQVTYRG KNIAEVLDMT VEDALAFFEN IPGIKRKLQT LFDVGLGYIR
LGQPATTLSG GEAQRVKLAS ELQRRQTGKT FYILDEPTTG LHFEDVRQLL IVLQRLVDAG
NTVLVIEHNL DVIKCADRIV DLGPEGGERG GTVVAQGTPE EVAQVEGSYT GAFVKKMLED
GRL