Gene Elen_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2971 
Symbol 
ID8417303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3446694 
End bp3449561 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content69% 
IMG OID645025948 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003183303 
Protein GI257792697 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01928] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGTG CGTTCGAGAG CACGGTGCGA CAGTTCCCGC AGCGCACGTG CTTCACGTAC 
GTTGACGAAG CGGGCAACGA GACGGCGTAC TCGTATCGCG AGACGCGCAT GCTGTCCGCT
GCGCTCGCGC GGCTCTTGCT GGACCGCGGC GTGCGTCCGG GCGATTGCGT CGCCGTGGAT
CTGCCGAACG ATCCGGCGTA CGTGCTGGTC ATGCTGGCTG CCGCGTATGG CGGCTTCGCG
TTGGTCGCGC TCAACAACCG CCTGACCGAT GCCGAGAAGT ACAGTCGGTT GCTGGACATC
GACCGCAAGC CCGGTGTGTC GCTGGCTTCG TTGCGCGTCG ATGCGACCAA CGTGTCGCGC
CTGTTGGAAG GCGCGAAGGC CATGTTGACG GGCGAGATGG CCGCACCGTC CGCCGTCCGT
GCGACGGGGC GCGGCGTGCG CCCTAGCTTC GCCGCCCGCG TGAACGCGGC GTCCGCCGAG
CCCCGCACTA CGAAGGCGCT CGGCCGCGCC AGCTCGGGCC GCGCGTCGCT GCGCCGTCGC
AGCGAAGTGG CGCAGCAGGA TGCTGCCGAG AGTGTCATCC ACTTCGCCGA ACACGCCGCG
CACGTGTTCG ACTTCGACGC CCGCGCCGTG GTCATGTTCA CGTCGGGCAC CACGGGGCGT
GCGAAGGCCG TATCGCTGAC GTGGGGAAAC ATCTGTTTCG CATCCGAAGC GTCCAACCGG
ACGCTCAATC GTCGCGGCGA GGGCATGTGG CAGGCGGTGC TGCCGCTCTA CCATATCGGC
GGTTTCCAGG TCGTCGTGCG CAGCGTGCTG AACGGCAGCC CGTTCGTGCT GTACCGCCGC
TTCGACGCCG TCCGTCTGCT GTCCGACGCG GCGAAGAAGG GCGCGACGCA CGTGTCGGTG
GTCGATAAGA TGCTGCAGGA CATGGTGGCC GCGGACAAGC CGGAAACGCT CGCGCGCTAC
TCCTGCATCC TGCTGGGCGG CGGCGCGCTC AACCCGCAGA CGCTCGATCG CGCGTGCCGT
GCGGGCGCGC GCGTGTACGC AAGCTACGGC ATGACCGAGA CGTCCAGCCA AATCGCCCAC
GCCCAGGTGA CGGCTTCGTT CACCGGCGGC TTGCGCCTGC TGCCCGGCTA CGAGGCGCAT
ATCGTGGATC CCGGCGACGA CGGGTTCGGC CGCCTGGCCG TGAAGGGCCC GGGCCTGTTC
GGCGGCTACC TGAACGCGCG CGCCGCGTAC ACGGTGGACG GCTTCTTCCT GACGGGGGAC
ACGGCGGCGC TGTACGGCGG CAAGCTGTTC GTGAAGGAGC GCACCGAGGA CATGTTCGTG
TCGGGCGGCG AGAACGTGTA CCCGGCCGAG ATCAAGGAGA AGCTGCTGCG CGTGGCCGGG
GTATCCGACG CCCACGTGTT CGGCGCGCCC GACGCCCGCT GGGGAAGACG GCCTGTTGCG
TTCGTCGAGC GCGAGAAAGC GCCCGCGCCG CGTCCGAGGG CGAGCCGCTA CGCGCAGCGG
ACGCAAGCGC AGACGCAGGC GGATCAGCTC GCATCCCTGA CGAACCGTCA GCTGGCGTCG
TACGTTCGCA CCAGCTTGGC GCCGCGCCTG TCGAAGCTGT ACCTGCCGAA GCACGTGTGC
GTGCTCGACG AGTTCCCGCG CACGGGCATC GGGAAAATCG ACCGCGTCGC GCTCGAGCGG
CGCTACGACC AGCGCATCGA GGTGGCGCGC GTCACGCTGC ACCGCATCCG GCTGCCGTTC
AAAACGCCGT TCAAAACAGC CAAGGCCACT CTGACGCATC GCGAGTCGAT CATCGTCGAA
GTGACGGATC ATGCCGGTCG CACGGGGCTG GGCGAGTGCG TGGCGTTTCC GACGGATTGG
TATCTGCCGG AAACGCTCGA CCAGGACGCG CGCATCCTCC ACGACGTTCT GGCTCCGATC
GTGCTGCGCG AGGCGTTCCT GCATCCCAGC GAGGCGAGCG CCGCCTTCGC CGCCGTTCCC
GAGGCGAAGG CGTTTCCGTT GGCATGCGGC GCGCTCGAGC CTGCGCTGTG GGACCTGTAC
GGCAAGATCG TCGGCAAGCC GCTATGGCAG CTGATCGGCG GGGCTGCGAA CGCCGCGACC
GCGGGCGCTT CGGCATCCGT GCCGGCCGGC GCGGCGATCG GGCTGGGCTC CGCGGTCGAG
ACGGCCGCGG CCGCGCGCCG CTGCGCCGAG GCAGGCTACA AGCGCGTGAA GCTGAAGGTG
AAACCGGGAA CCGCGCTCGC ATGCGCGCAG GCCGTGCGCG CCGCGTTGCC CGACATGATG
ATCACGCTCG ACGCCAACCA GAGCTTCTCG GAGCGTGAAG CCGAGGAGCT GCGCGGCTTG
GACGGCCTGA ACGTCGCCTG GATCGAGGAG CCGCTGGATC CGCGGCGCCT GCCCCCCGTG
GGGCCGACCG ACCTGTTCAG CCGCCTCGCG CGTCTGCAGC GCACGCTGCG CACCCCCATC
TGCCTCGACG AGTCCATCGT GCGCCCCGAG GACCTGTCCC GTGCGTTGGC GCAGCCCGAG
CTGCGGTGCT ACGCCCTCAA GCTGGGAAAA TGCGGCGGCG TGCAGCCTGC GCTCGATTTC
GTGCAAATGG CGCGCAAGCG GGGCATCGAG GTGTGGATGG GCGGCATGTA CGACACGGGC
GTGTCCAGGC GCCTGCATGC CGCGTTCGAG ACGCTGAGCG CGGTGGGTGC GCCCGGCGAC
ATCGGAGCCA CGGCTCGTTA CTTCTCCTGC GACATCACCG ATCCGCCCTA CACGGCCGAG
CGCGGCATGG TGACGCTCAA CCGCGAAGGC CATGCCAGCG GCCTGGGTTG CGACCTCAAC
CGCTCGGCGT TGGAAAACGT CCTCGTGGAG CGCCTCGTCA TTGCGTGA
 
Protein sequence
MIGAFESTVR QFPQRTCFTY VDEAGNETAY SYRETRMLSA ALARLLLDRG VRPGDCVAVD 
LPNDPAYVLV MLAAAYGGFA LVALNNRLTD AEKYSRLLDI DRKPGVSLAS LRVDATNVSR
LLEGAKAMLT GEMAAPSAVR ATGRGVRPSF AARVNAASAE PRTTKALGRA SSGRASLRRR
SEVAQQDAAE SVIHFAEHAA HVFDFDARAV VMFTSGTTGR AKAVSLTWGN ICFASEASNR
TLNRRGEGMW QAVLPLYHIG GFQVVVRSVL NGSPFVLYRR FDAVRLLSDA AKKGATHVSV
VDKMLQDMVA ADKPETLARY SCILLGGGAL NPQTLDRACR AGARVYASYG MTETSSQIAH
AQVTASFTGG LRLLPGYEAH IVDPGDDGFG RLAVKGPGLF GGYLNARAAY TVDGFFLTGD
TAALYGGKLF VKERTEDMFV SGGENVYPAE IKEKLLRVAG VSDAHVFGAP DARWGRRPVA
FVEREKAPAP RPRASRYAQR TQAQTQADQL ASLTNRQLAS YVRTSLAPRL SKLYLPKHVC
VLDEFPRTGI GKIDRVALER RYDQRIEVAR VTLHRIRLPF KTPFKTAKAT LTHRESIIVE
VTDHAGRTGL GECVAFPTDW YLPETLDQDA RILHDVLAPI VLREAFLHPS EASAAFAAVP
EAKAFPLACG ALEPALWDLY GKIVGKPLWQ LIGGAANAAT AGASASVPAG AAIGLGSAVE
TAAAARRCAE AGYKRVKLKV KPGTALACAQ AVRAALPDMM ITLDANQSFS EREAEELRGL
DGLNVAWIEE PLDPRRLPPV GPTDLFSRLA RLQRTLRTPI CLDESIVRPE DLSRALAQPE
LRCYALKLGK CGGVQPALDF VQMARKRGIE VWMGGMYDTG VSRRLHAAFE TLSAVGAPGD
IGATARYFSC DITDPPYTAE RGMVTLNREG HASGLGCDLN RSALENVLVE RLVIA