Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1732 |
Symbol | |
ID | 8416031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2038995 |
End bp | 2040437 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645024698 |
Product | exodeoxyribonuclease VII, large subunit |
Protein accession | YP_003182086 |
Protein GI | 257791480 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1570] Exonuclease VII, large subunit |
TIGRFAM ID | [TIGR00237] exodeoxyribonuclease VII, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.709367 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.00631476 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGTACG GGGAGATGGC GCGCCAAGAC GGCCTCGGCG CCGCGCTCGA GCGGGCTCGC GCGGCGACGC GCTCCGAGGG CGGGGGGTCG CCGGATGCGC TGTCGGTGTC GGGCGCGATG GCGTTGGCGA AGGGAGCGCT CGAGGGCGTG ACCGTGAGGC TCGTGGGCGA AGTGTCCGAG GTCTCGAACA AGCCCGGCTA CAAAGCCGTG TACTTCACCG TGAAGGACCA GCGCGCTGCG TTGCCGTGCA TGATGTGGAA CAACCGTTTC CAAGCGTCCG GCGTGCGGTT GGCCGTGGGC CAGCTCGTGG AACTGACCGG GCGTTTCACG CTGTACGCAC CGAAGGGCCG CATGAATTTC GACGTGTTTT CCATCGCGCT GGCCGGAGAG GGTAACCTGC GCATGCAGGT GGCGAACCTC GCGCGCAAGC TGGAAGCCGA GGGCCTTATG GCGCCGGCTC GCAAACGGCC GGTGCCTGCG TATCCGGCGC TCGTCGGACT GGTCACCTCG CCGCGGGGCG CGGCCGTTCA CGACGTGCTG CGCACACTGC GCCGCCGGTT CCCGCTGGCT CGCGTGCTCG TCGCGGGCGT GGCGGTCGAG GGCTCGAACG CGCCCGCCGG CATCGTTGAG GGCATGCGCG CGGTCGTGCG CGCCGGGGCC GAGGTGGTGC TCGTGGTGCG CGGCGGCGGG TCGTTCGAGG ATCTCATGCC CTTCAACGAC GAGGGGCTGG CGCGCATGAT CGCGAAGTGC CCCGTGCCGG TGGTCACCGG CATCGGCCAC GAGCCCGACA CGTCCATCGC CGACATGGTG GCCGACCTGC GCGCCTCTAC GCCCACGGCC GCCGCCGAAG CCGTAAGCCC CGCCCGCGAG AGTCTCGGCC GCTTGTTCGA GGCGCGTTCG TCGTCGCTGC GCGCCAGCAT GTCCCGGGCG CTCGACCGCG CCGGGGCCGA GGTGCGGCGC TGCGCTACGC GTCCGCTGTT CTGCGACGCG CAGCTTCTGT ACGCCACCGA GGCTCAGATG CTCGACCTGG CCTCCGATCG TCTGTTCCGG GCGCTGCCCG CGAACCTGGC GCGCGACGAG GCTTCGGTCG CGCGCCAGCG CGAGCGCTTG GCCTGCGCGT TGCCAGCCAG CCTCGATCGC AGCCGCACGC GTCTCGAGCA CGAGCGCGAG CGCCTGGCGT CGTGCGGCGG CGCTCTCGTG CCGCGCTTCG GGCAACAGGC GGCGCTTGCC GCCGCGCGTC TGCACGACCT GTCGCCGCTT GCCGTGCTGG GGCGCGGCTA CGCCATCGCG CGCACGGAAG ACGGCGCCGT GGTCAAGAGC GTGGAGGCCG CGCCGCCTGG CACGGCCGTG GACGTCGCCG TCGCGGACGG CGTGCTCGCC TGCCGGGTGG AGCAAGCCAG GCGCGTGGAC ACCGAAATCA TCGATTGGGA GGATGCATCA TGA
|
Protein sequence | MGYGEMARQD GLGAALERAR AATRSEGGGS PDALSVSGAM ALAKGALEGV TVRLVGEVSE VSNKPGYKAV YFTVKDQRAA LPCMMWNNRF QASGVRLAVG QLVELTGRFT LYAPKGRMNF DVFSIALAGE GNLRMQVANL ARKLEAEGLM APARKRPVPA YPALVGLVTS PRGAAVHDVL RTLRRRFPLA RVLVAGVAVE GSNAPAGIVE GMRAVVRAGA EVVLVVRGGG SFEDLMPFND EGLARMIAKC PVPVVTGIGH EPDTSIADMV ADLRASTPTA AAEAVSPARE SLGRLFEARS SSLRASMSRA LDRAGAEVRR CATRPLFCDA QLLYATEAQM LDLASDRLFR ALPANLARDE ASVARQRERL ACALPASLDR SRTRLEHERE RLASCGGALV PRFGQQAALA AARLHDLSPL AVLGRGYAIA RTEDGAVVKS VEAAPPGTAV DVAVADGVLA CRVEQARRVD TEIIDWEDAS
|
| |