Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1656 |
Symbol | |
ID | 8415955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 1957236 |
End bp | 1959581 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645024625 |
Product | DNA internalization-related competence protein ComEC/Rec2 |
Protein accession | YP_003182013 |
Protein GI | 257791407 |
COG category | [R] General function prediction only |
COG ID | [COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.754434 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00232522 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGGGCG CTGCGCGGGG CAAACCGGTC GCGCTTCCTC CGCGCCCGGC GCTGCCGCCT GTGCTGGCCT GCGCGCTTTC GCTGTGGGCA TCGTGCGCGG CGGTGCTGGC CGCATCCGGA TCCTGGGATG CCGGCGCATG CCTCGCCGTC GGCGCGGCCG GCATCGTCGC GAGCGTCGCA TGCGCTTTCG CGCTATGGCG GCTGCCGGTG CCGATCGCGT GGGCCGCGCT GCTGGGAGCC GCGCTCGGCA TCGCGCTCGC AGGCGGGTGC GCGGCCGCGC AGCACGAGGC GCAGCTGCAA GGCGACGGCC TTTCGGGACG CTGGCGATTC GAAGTCGCAG CCGACGGTTC GCAGGGCCCT TATGGCGCAA CGTGCTTCGC GCGCGCGAAC CTTCCCGACA TCGGAGCCGT TACGGTGCGG CTTAGGTTCG AGGAAGGCGA GGACCCTCCG CGTTACGGCG ACGTGCTGGA GGCCGACGCG ACGCTCTCCG CGCCGGGCGG GTCGTCGGCG GCATACTGCT GGCGGCAGGG CGCGGTGCTC GAAGGCACGG CGCGCCGAGT CGTGAGCTGC GAGCGTGCCG ACGCCCTCGG TATCTTGACA GGGCTTCGCA ATCGCGCGAT CGACCTGATC GCCGACGAGG GGACCGACGA CGGGGCCGCG GTGCTCGCCG CGCTCGTGTG CGGATGGCGC GGCGCCCTCG AGGTAGGTGA CGCGTACGCC GCCTACCAGA CCAGCGGGCT TGCGCATCTC GTGGCGGTGT CGGGGGCGCA CCTGTCCATC GTCGCCGGCT GCGCAGCCGC GCTCTTGCGT GCGCTGCGCG TTCCCCGGCG CGCCGGAGCC GTCCTGCAGG CGTCGTTTCT GCTGGGCTTC CTCGTGCTGG CGGCCGCGCC CTCGTCGGCC GTGCGCGCGG CCGTCATGGC GTTCGCCGGC ATGTTCGCGT TCACGGCGCG GCGGCGGCCG GCGGCGCTCA GCGCGCTGGC CGTATGCATG ATCGGCTGCA TCGCGCTCGA TCCGCGCACG GCGCTGGCGG TCTCGTTCGC GCTGTCGGCG CTCTCGACGC TCGGCATCGT ACTGTTCGCA GGGCTTTTCC AGGCGTGGAT CGCGCAGTGC TTCCAGCGCG CTCCGCGCCT GGCGCGCGAG GCGCTCTCGC TTACCGCCGC TTCGAGCCTT GCGGCCGCGC CGCTCGCCGC ATCGCTGTTC TCGCAGGTGC CGGTTGTGGC GCCGCTGGCC AACGTAGCGG CCGCGCCGCT GTTTCCCGTA GTCTGCGCCG GCGGGTTCGC GGCGGTGCTG GCATCCTTGG CCGTTCCTGC CGTCGCGCCG GCGTTGATCG GATTCGCGTC GATGGGCACA GGCGCCCTCA CTGCCGTCGT GCGCGCGCTC GCGAGCGTCC CCTACGCGAG CCTGCCTGCG AGCGTGCCCC TCGCCGGGGC GCTTCTGGCG TCGGCGGCAT GCGCGGCGGC GCTGTGGCTG GCGTGGCCGC GTCCGAGCCG TCGTCGCGCG TTCGGTCTGG TCGCGGCGGC GGCGTGCGTC ATGATCGGGA CGGTCGTCGT CGCCCCGCGC CTTTCGGGCG ACGAGATCGT CATGCTGGAC GTGGGGCAGG GCGACGCGTT CCTGGTGCGG AGCCGGGGGA CGGCCGTCCT GATCGACACC GGCAACCAGG ACAGGATGCT GCGCGAGGCG CTCGCACGGC ATGGAGCGTA CCGCCTCGAC GCGGTGGTGA TCACCCACGG CGACGACGAC CATAAAGGAT CGCTCGCCTC GCTTGCGGGC GTCGTGGACG TGCGACGCGT GCTCGTCGCC CAGGACGCGC TCTCGTGCGG CTGCGAAGCG TGCGTCTCGC TCGTCGCCGA CGCGCGCAAG CTCGCGGGCG ACGGGGGAGT CGTCGGGCTT CGGCAGGGAG ATGCGCTTCA GGTGGGCTCC TTCGACTTGC AGACAGTGTG GCCCGAGCGG TTCTCCGACG AAGGGGGCAA CGCCGACAGC CTGTGCCTTG TCGCCGATGC GGACATCGAC GGCGACGGCG CATCCGAGTG GCGTGCGCTT TTCACGGGAG ACGCCGAGCG CGACCAGCTG CGCGCGCTCA TCGACGAAGG GCTTGTGGAC TCCGTCGACC TCTACAAGGT GGGCCATCAC GGATCCAAGA ACGCCCTCGA CGACGAGGAG GCCGCCGTCC TTTCTCCTCG CATCGCGTTG GTCAGCGCAG GGGCGCGCAA CCGCTACGGC CATCCGGCGC AGGACACGCT CGATCGGCTC GAGGCCGCGG GCGCACGCGT CTTCCGCACC GACGAGCAGG GAGATGTTTC GTGCAAATTG ACCGCCGACC GGATCGAGGT GGATACCCTG CGTTAG
|
Protein sequence | MRGAARGKPV ALPPRPALPP VLACALSLWA SCAAVLAASG SWDAGACLAV GAAGIVASVA CAFALWRLPV PIAWAALLGA ALGIALAGGC AAAQHEAQLQ GDGLSGRWRF EVAADGSQGP YGATCFARAN LPDIGAVTVR LRFEEGEDPP RYGDVLEADA TLSAPGGSSA AYCWRQGAVL EGTARRVVSC ERADALGILT GLRNRAIDLI ADEGTDDGAA VLAALVCGWR GALEVGDAYA AYQTSGLAHL VAVSGAHLSI VAGCAAALLR ALRVPRRAGA VLQASFLLGF LVLAAAPSSA VRAAVMAFAG MFAFTARRRP AALSALAVCM IGCIALDPRT ALAVSFALSA LSTLGIVLFA GLFQAWIAQC FQRAPRLARE ALSLTAASSL AAAPLAASLF SQVPVVAPLA NVAAAPLFPV VCAGGFAAVL ASLAVPAVAP ALIGFASMGT GALTAVVRAL ASVPYASLPA SVPLAGALLA SAACAAALWL AWPRPSRRRA FGLVAAAACV MIGTVVVAPR LSGDEIVMLD VGQGDAFLVR SRGTAVLIDT GNQDRMLREA LARHGAYRLD AVVITHGDDD HKGSLASLAG VVDVRRVLVA QDALSCGCEA CVSLVADARK LAGDGGVVGL RQGDALQVGS FDLQTVWPER FSDEGGNADS LCLVADADID GDGASEWRAL FTGDAERDQL RALIDEGLVD SVDLYKVGHH GSKNALDDEE AAVLSPRIAL VSAGARNRYG HPAQDTLDRL EAAGARVFRT DEQGDVSCKL TADRIEVDTL R
|
| |