Gene Elen_1656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1656 
Symbol 
ID8415955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1957236 
End bp1959581 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content72% 
IMG OID645024625 
ProductDNA internalization-related competence protein ComEC/Rec2 
Protein accessionYP_003182013 
Protein GI257791407 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.754434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00232522 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGGCG CTGCGCGGGG CAAACCGGTC GCGCTTCCTC CGCGCCCGGC GCTGCCGCCT 
GTGCTGGCCT GCGCGCTTTC GCTGTGGGCA TCGTGCGCGG CGGTGCTGGC CGCATCCGGA
TCCTGGGATG CCGGCGCATG CCTCGCCGTC GGCGCGGCCG GCATCGTCGC GAGCGTCGCA
TGCGCTTTCG CGCTATGGCG GCTGCCGGTG CCGATCGCGT GGGCCGCGCT GCTGGGAGCC
GCGCTCGGCA TCGCGCTCGC AGGCGGGTGC GCGGCCGCGC AGCACGAGGC GCAGCTGCAA
GGCGACGGCC TTTCGGGACG CTGGCGATTC GAAGTCGCAG CCGACGGTTC GCAGGGCCCT
TATGGCGCAA CGTGCTTCGC GCGCGCGAAC CTTCCCGACA TCGGAGCCGT TACGGTGCGG
CTTAGGTTCG AGGAAGGCGA GGACCCTCCG CGTTACGGCG ACGTGCTGGA GGCCGACGCG
ACGCTCTCCG CGCCGGGCGG GTCGTCGGCG GCATACTGCT GGCGGCAGGG CGCGGTGCTC
GAAGGCACGG CGCGCCGAGT CGTGAGCTGC GAGCGTGCCG ACGCCCTCGG TATCTTGACA
GGGCTTCGCA ATCGCGCGAT CGACCTGATC GCCGACGAGG GGACCGACGA CGGGGCCGCG
GTGCTCGCCG CGCTCGTGTG CGGATGGCGC GGCGCCCTCG AGGTAGGTGA CGCGTACGCC
GCCTACCAGA CCAGCGGGCT TGCGCATCTC GTGGCGGTGT CGGGGGCGCA CCTGTCCATC
GTCGCCGGCT GCGCAGCCGC GCTCTTGCGT GCGCTGCGCG TTCCCCGGCG CGCCGGAGCC
GTCCTGCAGG CGTCGTTTCT GCTGGGCTTC CTCGTGCTGG CGGCCGCGCC CTCGTCGGCC
GTGCGCGCGG CCGTCATGGC GTTCGCCGGC ATGTTCGCGT TCACGGCGCG GCGGCGGCCG
GCGGCGCTCA GCGCGCTGGC CGTATGCATG ATCGGCTGCA TCGCGCTCGA TCCGCGCACG
GCGCTGGCGG TCTCGTTCGC GCTGTCGGCG CTCTCGACGC TCGGCATCGT ACTGTTCGCA
GGGCTTTTCC AGGCGTGGAT CGCGCAGTGC TTCCAGCGCG CTCCGCGCCT GGCGCGCGAG
GCGCTCTCGC TTACCGCCGC TTCGAGCCTT GCGGCCGCGC CGCTCGCCGC ATCGCTGTTC
TCGCAGGTGC CGGTTGTGGC GCCGCTGGCC AACGTAGCGG CCGCGCCGCT GTTTCCCGTA
GTCTGCGCCG GCGGGTTCGC GGCGGTGCTG GCATCCTTGG CCGTTCCTGC CGTCGCGCCG
GCGTTGATCG GATTCGCGTC GATGGGCACA GGCGCCCTCA CTGCCGTCGT GCGCGCGCTC
GCGAGCGTCC CCTACGCGAG CCTGCCTGCG AGCGTGCCCC TCGCCGGGGC GCTTCTGGCG
TCGGCGGCAT GCGCGGCGGC GCTGTGGCTG GCGTGGCCGC GTCCGAGCCG TCGTCGCGCG
TTCGGTCTGG TCGCGGCGGC GGCGTGCGTC ATGATCGGGA CGGTCGTCGT CGCCCCGCGC
CTTTCGGGCG ACGAGATCGT CATGCTGGAC GTGGGGCAGG GCGACGCGTT CCTGGTGCGG
AGCCGGGGGA CGGCCGTCCT GATCGACACC GGCAACCAGG ACAGGATGCT GCGCGAGGCG
CTCGCACGGC ATGGAGCGTA CCGCCTCGAC GCGGTGGTGA TCACCCACGG CGACGACGAC
CATAAAGGAT CGCTCGCCTC GCTTGCGGGC GTCGTGGACG TGCGACGCGT GCTCGTCGCC
CAGGACGCGC TCTCGTGCGG CTGCGAAGCG TGCGTCTCGC TCGTCGCCGA CGCGCGCAAG
CTCGCGGGCG ACGGGGGAGT CGTCGGGCTT CGGCAGGGAG ATGCGCTTCA GGTGGGCTCC
TTCGACTTGC AGACAGTGTG GCCCGAGCGG TTCTCCGACG AAGGGGGCAA CGCCGACAGC
CTGTGCCTTG TCGCCGATGC GGACATCGAC GGCGACGGCG CATCCGAGTG GCGTGCGCTT
TTCACGGGAG ACGCCGAGCG CGACCAGCTG CGCGCGCTCA TCGACGAAGG GCTTGTGGAC
TCCGTCGACC TCTACAAGGT GGGCCATCAC GGATCCAAGA ACGCCCTCGA CGACGAGGAG
GCCGCCGTCC TTTCTCCTCG CATCGCGTTG GTCAGCGCAG GGGCGCGCAA CCGCTACGGC
CATCCGGCGC AGGACACGCT CGATCGGCTC GAGGCCGCGG GCGCACGCGT CTTCCGCACC
GACGAGCAGG GAGATGTTTC GTGCAAATTG ACCGCCGACC GGATCGAGGT GGATACCCTG
CGTTAG
 
Protein sequence
MRGAARGKPV ALPPRPALPP VLACALSLWA SCAAVLAASG SWDAGACLAV GAAGIVASVA 
CAFALWRLPV PIAWAALLGA ALGIALAGGC AAAQHEAQLQ GDGLSGRWRF EVAADGSQGP
YGATCFARAN LPDIGAVTVR LRFEEGEDPP RYGDVLEADA TLSAPGGSSA AYCWRQGAVL
EGTARRVVSC ERADALGILT GLRNRAIDLI ADEGTDDGAA VLAALVCGWR GALEVGDAYA
AYQTSGLAHL VAVSGAHLSI VAGCAAALLR ALRVPRRAGA VLQASFLLGF LVLAAAPSSA
VRAAVMAFAG MFAFTARRRP AALSALAVCM IGCIALDPRT ALAVSFALSA LSTLGIVLFA
GLFQAWIAQC FQRAPRLARE ALSLTAASSL AAAPLAASLF SQVPVVAPLA NVAAAPLFPV
VCAGGFAAVL ASLAVPAVAP ALIGFASMGT GALTAVVRAL ASVPYASLPA SVPLAGALLA
SAACAAALWL AWPRPSRRRA FGLVAAAACV MIGTVVVAPR LSGDEIVMLD VGQGDAFLVR
SRGTAVLIDT GNQDRMLREA LARHGAYRLD AVVITHGDDD HKGSLASLAG VVDVRRVLVA
QDALSCGCEA CVSLVADARK LAGDGGVVGL RQGDALQVGS FDLQTVWPER FSDEGGNADS
LCLVADADID GDGASEWRAL FTGDAERDQL RALIDEGLVD SVDLYKVGHH GSKNALDDEE
AAVLSPRIAL VSAGARNRYG HPAQDTLDRL EAAGARVFRT DEQGDVSCKL TADRIEVDTL
R