Gene Elen_3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_3099 
Symbol 
ID8417435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3605029 
End bp3606570 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content58% 
IMG OID645026079 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003183430 
Protein GI257792824 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000841335 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCGAT GTGACGGCAT ACCGATTGAC AGGGTTGCTT TTTCGAAACA GCTTCAACAG 
CTTGCCGAAA AGTGGGGGGA CGAGACGGCT CTCGTCGATG TCTCCCGCGA CGGATCGGAA
CGGGCGTTGT CTCGTCGAGA GCTGTTCGAG GCGATCGAGA AGGTCGCCTG CCATCTCAAT
GATCGAAACG TTCAACAGGG AGATTACGTC GTCATAGCCC TTCCCAACGG TTACGAGAAC
GTCGTTGCGA CGCTTGCCGC CTGGCAGCTG GGAGCGTGTT GCGTATTCAT GTCCCCCAAG
GGAACCGATG AAGAACGCAA CCATATACAC GCGCTGTTCG AGCGCAAGCT GCTGATATCG
ACATGGGATT GCGACGACGA GGATTGCATA TCGCAGCAAG ACGTGCGGAA CTGGATCCGA
GGAGAGTTCC CCCATGGAGC CGAGATCCTG CCGTTCTTCG CCTGCGAGCC CTCGCGGGCT
ATCCCGACGG GAGGGTCGTC GGGAAAACCC AAGCTCGTCG TGCAAAAAGT CAGGCCGGCC
TACTGCGAGA TGGATCTGTC GGCTTGGACG GCGATGACGG GGCAGACCCC TTGGTCGAAA
CAGCTTATCC CCGGTTCCCT GTTTCACAAC CTGTACAGCA ATGCCACGTA CATCGGACTG
TTCTTCGGGC AGACCGTGTA CCTCATGGAA CGCTTCGACG AAGGACAGGC GCTCGAGCTC
ATCGAAAAGC ATGGGATTCA ATTCATAGGC TTGGCTCCCA CGATGATGGA CAGGATGATG
CGCCATCCTT CCTTCCAAAC CAGAAACCTG GAAAGCCTGG AGGCCGTGTT CCACAGCGGC
GGCCCCTGCC CCGACAAAGT AAAGCTTGCC TGGATCGAGC GCGTCGGCGC GAGAAAAGTG
TACGAGATGT ACGGCGAGAC CGAGATGATC GCAAGCACGT TCATACGAGG CGAAGAATGG
CTGCAGCACC GCGGCAGCGT GGGACGCCCC TTCGGTTGCG AGCTGCAGAT ACGCGACGAG
GAGGGTCGGG CGCTTCCCTG CGGCGAAGTA GGCGAGATAT TCGGCAAGCC GGCTATGGGG
CTTTCCGCAA AGTACGTGGG GCCGCAATCC ATCAAATCCG AGGAAGACGG GTTCTTCAGC
GTCGGCGACC TCGGTTGGCT CGACGAGGAA GGCTTCCTGT ACATCTCCGA CCGTCGTTCG
GACATGATCG TGACCGGCGG GAAGAACGTG TACACCGCCG AGGTGGAGAA CGCCATTTTC
GACTATCCCG GTATCTCCGA CGCTGTGGTT ATAGGCATTC CCGACCAGCA ATGGGGTCTT
AGGGTGCATG CGATTCTCGA GATCGAAGGC GCGGAAAGCG AGTTTTCGAC GGATGGTCTC
AAAGAATTCC TCCACACCAA GCTGAGCGGA TACAAATGCC CGAAGACCTA CGAAATCGTA
CAGGACATGC CTCGCAACGA AATGGGGAAA ATCCGGAGAC GCGAACTGAT AGAGCAACGT
GTGTCCAGCT TATCCGAACG CGAACCCGGA ACGAAAGCGT GA
 
Protein sequence
MDRCDGIPID RVAFSKQLQQ LAEKWGDETA LVDVSRDGSE RALSRRELFE AIEKVACHLN 
DRNVQQGDYV VIALPNGYEN VVATLAAWQL GACCVFMSPK GTDEERNHIH ALFERKLLIS
TWDCDDEDCI SQQDVRNWIR GEFPHGAEIL PFFACEPSRA IPTGGSSGKP KLVVQKVRPA
YCEMDLSAWT AMTGQTPWSK QLIPGSLFHN LYSNATYIGL FFGQTVYLME RFDEGQALEL
IEKHGIQFIG LAPTMMDRMM RHPSFQTRNL ESLEAVFHSG GPCPDKVKLA WIERVGARKV
YEMYGETEMI ASTFIRGEEW LQHRGSVGRP FGCELQIRDE EGRALPCGEV GEIFGKPAMG
LSAKYVGPQS IKSEEDGFFS VGDLGWLDEE GFLYISDRRS DMIVTGGKNV YTAEVENAIF
DYPGISDAVV IGIPDQQWGL RVHAILEIEG AESEFSTDGL KEFLHTKLSG YKCPKTYEIV
QDMPRNEMGK IRRRELIEQR VSSLSEREPG TKA