Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1422 |
Symbol | |
ID | 8415720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1692893 |
End bp | 1695784 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645024391 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003181780 |
Protein GI | 257791174 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0151035 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAGA ACGCCATCGT CATCAAGGGT GCCCGCGAGC ACAACCTCAA GGACATCGAC CTCTCCATCC CGCGCGACGA GCTGGTGGTC ATCACGGGCC TGTCGGGCAG CGGCAAGTCG TCGCTGGCGT TCGACACGAT GTACGCCGAG GGCCAGCGCC GCTACGTGGA GAGCCTGTCC AGCTACGCGC GCATGTTCCT GGGCCAGATG TCGAAGCCCG ACCTCGACAG CATCGACGGC CTGTCGCCGG CGGTGTCCAT CGACCAGAAG ACCACGTCGA AGAACCCGCG CTCGACGGTG GGCACCACCA CCGAGATCTA CGACTACCTG CGCCTCCTGT TCGCGCGCGT GGGCGTGCCG CACTGCCCCG AGTGCGGCCG CGTCATCAAG AAGCAGACCA CCGACCAGGT GACCGACGAG ATCCTCGCCC TCGCGCCCGA TGCGAAGGCC ATCATCATGG CTCCCGTGGT GGCGGGCCGC AAGGGCGAGT TCACGAAGCT GTTCGCCGAC CTGCAGAAGG AGGGCTTCAG CCGCGTGCGC ATCGATGGCG AGATCGTGAA GCTGGACGGC GAGCCGCGCA CGCTCAACAA GAAGATCAAG CACTTCATCG ACGTGGTGGT GGACCGCGTG CAGCTTAAAG CGAGCGCGAC GAGCCGCATC GCCGAGGCGG TGGAGCTGGC CACGAAGTTG GCCGACGGGC GCGTGCTCGT GCAGGTGCTG GGCGACGACG GCAAGCCGCT CGGCGAGGGC GGCGGGCGCT CGTCCGGCGC GACGGGCGGC CTGGGCGCGG GGGAGCACAT CTTCTCGCTC GCGCTGGCAT GCCCCGAGCA CGGACACTCC ATGGACGAGC TGCAGCCGCG CGACTTCTCG TTCAACGCAC CCTATGGCGC CTGCCCCGAC TGCCTCGGCA TCGGCAGCCG CGAGGAGGTG GACGCGTCGC TGGTGGCGCC CGACCCGTCG CTGTCGCTGA ACGAGGGCGC CATCGCGCCG TTCAAGACCG GCAACTACTA CCCGCAGGTG CTGCGCGCCG TGGCCGCGCA CCTCGGCACC GACGCCGACA CCCCGTGGGA GGATATGCCT AAGAAGGCGC AGGACGGGCT TCTGCACGGC CTGGGCAAGG ACAAGGTGCG CGTCGACTAC GTCACGGTGG ACGGTCGCGA GACGTACTGG TACATCGAGT GGGAGGGAGC GCTGGCCGCA GTGCAGCGCC GCTACCAGGA GGCTCAGTCC GACGCTCAGC GCGAGAAGCT GGCCAGCTAC TTCGCCATCG TTCCGTGCCC GACCTGCGGC GGCAAGCGCC TGAAGCCCGA GATCCTCGCC GTCACGGTGA ACGAACGCTC CATCCACGAC ATCACCGAGA TGAGCGCGGC GGACTCGCTC GAATTCTTCG ACGGCCTCGC GTTCCACGGG TCGGAGGAGC ATATCGCCGG GCCCATCGTG AAGGAGATCA AGGCGCGCCT CAAGTTCCTC GTGGACGTGG GTCTGGACTA CCTCACGCTG GAGCGCGCCA CGGCGACGCT CTCCGGCGGC GAGGCGCAGC GCATCCGCCT GGCCACGCAG ATCGGCGCGG GCCTCATGGG CGTGCTCTAC ATCCTGGACG AGCCTTCCAT CGGCCTGCAC CAGCGCGACA ACGAGCGGCT CATCGCCACG CTCGAGCGCC TGCGCGACTT AGGGAACACC GTCATCGTGG TGGAGCACGA CGAGGACACC ATCCGCAGCG CCGACTTCGT GGTGGACATG GGCCCGGGCG CGGGCGAGCA CGGCGGCGAG ATCGTGGCCA TCGGCACGCC GGACGAGATC ATGAAGGCCG AAGGCTCGCT CACGGCCGAC TACCTGTCGG GCCGGCGCCG CATCGAGGTG CCCGAGAAGC GCCGCAAGCC GCGCCGCGGG TCGCTCAAGC TGACGGGCGC CACCGAGAAC AACCTGCACA ACGTCACGCT CGAGGTTCCC TTCGGCACGC TCACCGTGGT CACGGGCGTG TCCGGCTCGG GCAAGAGCTC GCTGGTCACC GACACGCTGG CGCCGGCGCT CGCGAACCGC GTGAACCATG CGCACCGCCG CACGGGCGCC TACAAGAAGA TCACCGGGCT GGACAAGATC GACAAGGTCA TCAACATCGA CCAGAGCCCC ATCGGGCGCA CGCCGCGTTC CAACCCGGCC ACCTACATAG GCCTTTGGGA CGACATCCGC GCGCTGTTCG CCTCCACGCA GGAGTCGAAG GCCCGCGGCT ACTCGCCGGG CCGCTTCTCG TTCAACGTGA ACGGCGGACG CTGCGAGGCG TGCAAGGGCG ACGGCCAGAT CAAGATCGAG ATGCACTTCC TGCCCGACAT CTACGTGCCG TGCGAGGTGT GCGGCGGCGA CCGCTACAAC CGCGAGACGC TGCAGGTGAC TTACCGCGGC AAGAACATCG CCGAGGTGCT GGACATGACC GTGGAGGACG CGCTGGCGTT CTTCGAGAAC ATACCCGGCA TCAAGCGCAA GCTGCAGACG CTGTTCGACG TGGGCCTGGG CTACATCCGC CTGGGGCAGC CGGCCACCAC GCTGTCCGGC GGCGAGGCGC AGCGCGTGAA GCTTGCCAGC GAGCTGCAGC GCCGCCAGAC CGGCAAGACG TTCTACATCC TGGACGAGCC CACCACCGGC CTGCACTTCG AGGACGTGCG CCAGCTGCTC ATCGTGTTGC AGCGCCTGGT GGACGCGGGC AACACCGTGC TGGTCATCGA GCACAACCTC GACGTCATCA AGTGCGCCGA CCGCATCGTC GACCTCGGCC CCGAGGGCGG CGAGCGCGGC GGCACCGTGG TGGCCCAGGG CACGCCCGAG GAGGTCGCCC AGGTGGAGGG CAGCTACACC GGCGCCTTCG TGAAGAAGAT GCTGGAGGAC GGCCGGCTGT AG
|
Protein sequence | MGQNAIVIKG AREHNLKDID LSIPRDELVV ITGLSGSGKS SLAFDTMYAE GQRRYVESLS SYARMFLGQM SKPDLDSIDG LSPAVSIDQK TTSKNPRSTV GTTTEIYDYL RLLFARVGVP HCPECGRVIK KQTTDQVTDE ILALAPDAKA IIMAPVVAGR KGEFTKLFAD LQKEGFSRVR IDGEIVKLDG EPRTLNKKIK HFIDVVVDRV QLKASATSRI AEAVELATKL ADGRVLVQVL GDDGKPLGEG GGRSSGATGG LGAGEHIFSL ALACPEHGHS MDELQPRDFS FNAPYGACPD CLGIGSREEV DASLVAPDPS LSLNEGAIAP FKTGNYYPQV LRAVAAHLGT DADTPWEDMP KKAQDGLLHG LGKDKVRVDY VTVDGRETYW YIEWEGALAA VQRRYQEAQS DAQREKLASY FAIVPCPTCG GKRLKPEILA VTVNERSIHD ITEMSAADSL EFFDGLAFHG SEEHIAGPIV KEIKARLKFL VDVGLDYLTL ERATATLSGG EAQRIRLATQ IGAGLMGVLY ILDEPSIGLH QRDNERLIAT LERLRDLGNT VIVVEHDEDT IRSADFVVDM GPGAGEHGGE IVAIGTPDEI MKAEGSLTAD YLSGRRRIEV PEKRRKPRRG SLKLTGATEN NLHNVTLEVP FGTLTVVTGV SGSGKSSLVT DTLAPALANR VNHAHRRTGA YKKITGLDKI DKVINIDQSP IGRTPRSNPA TYIGLWDDIR ALFASTQESK ARGYSPGRFS FNVNGGRCEA CKGDGQIKIE MHFLPDIYVP CEVCGGDRYN RETLQVTYRG KNIAEVLDMT VEDALAFFEN IPGIKRKLQT LFDVGLGYIR LGQPATTLSG GEAQRVKLAS ELQRRQTGKT FYILDEPTTG LHFEDVRQLL IVLQRLVDAG NTVLVIEHNL DVIKCADRIV DLGPEGGERG GTVVAQGTPE EVAQVEGSYT GAFVKKMLED GRL
|
| |