Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2971 |
Symbol | |
ID | 8417303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3446694 |
End bp | 3449561 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645025948 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_003183303 |
Protein GI | 257792697 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | [TIGR01928] o-succinylbenzoic acid (OSB) synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGTG CGTTCGAGAG CACGGTGCGA CAGTTCCCGC AGCGCACGTG CTTCACGTAC GTTGACGAAG CGGGCAACGA GACGGCGTAC TCGTATCGCG AGACGCGCAT GCTGTCCGCT GCGCTCGCGC GGCTCTTGCT GGACCGCGGC GTGCGTCCGG GCGATTGCGT CGCCGTGGAT CTGCCGAACG ATCCGGCGTA CGTGCTGGTC ATGCTGGCTG CCGCGTATGG CGGCTTCGCG TTGGTCGCGC TCAACAACCG CCTGACCGAT GCCGAGAAGT ACAGTCGGTT GCTGGACATC GACCGCAAGC CCGGTGTGTC GCTGGCTTCG TTGCGCGTCG ATGCGACCAA CGTGTCGCGC CTGTTGGAAG GCGCGAAGGC CATGTTGACG GGCGAGATGG CCGCACCGTC CGCCGTCCGT GCGACGGGGC GCGGCGTGCG CCCTAGCTTC GCCGCCCGCG TGAACGCGGC GTCCGCCGAG CCCCGCACTA CGAAGGCGCT CGGCCGCGCC AGCTCGGGCC GCGCGTCGCT GCGCCGTCGC AGCGAAGTGG CGCAGCAGGA TGCTGCCGAG AGTGTCATCC ACTTCGCCGA ACACGCCGCG CACGTGTTCG ACTTCGACGC CCGCGCCGTG GTCATGTTCA CGTCGGGCAC CACGGGGCGT GCGAAGGCCG TATCGCTGAC GTGGGGAAAC ATCTGTTTCG CATCCGAAGC GTCCAACCGG ACGCTCAATC GTCGCGGCGA GGGCATGTGG CAGGCGGTGC TGCCGCTCTA CCATATCGGC GGTTTCCAGG TCGTCGTGCG CAGCGTGCTG AACGGCAGCC CGTTCGTGCT GTACCGCCGC TTCGACGCCG TCCGTCTGCT GTCCGACGCG GCGAAGAAGG GCGCGACGCA CGTGTCGGTG GTCGATAAGA TGCTGCAGGA CATGGTGGCC GCGGACAAGC CGGAAACGCT CGCGCGCTAC TCCTGCATCC TGCTGGGCGG CGGCGCGCTC AACCCGCAGA CGCTCGATCG CGCGTGCCGT GCGGGCGCGC GCGTGTACGC AAGCTACGGC ATGACCGAGA CGTCCAGCCA AATCGCCCAC GCCCAGGTGA CGGCTTCGTT CACCGGCGGC TTGCGCCTGC TGCCCGGCTA CGAGGCGCAT ATCGTGGATC CCGGCGACGA CGGGTTCGGC CGCCTGGCCG TGAAGGGCCC GGGCCTGTTC GGCGGCTACC TGAACGCGCG CGCCGCGTAC ACGGTGGACG GCTTCTTCCT GACGGGGGAC ACGGCGGCGC TGTACGGCGG CAAGCTGTTC GTGAAGGAGC GCACCGAGGA CATGTTCGTG TCGGGCGGCG AGAACGTGTA CCCGGCCGAG ATCAAGGAGA AGCTGCTGCG CGTGGCCGGG GTATCCGACG CCCACGTGTT CGGCGCGCCC GACGCCCGCT GGGGAAGACG GCCTGTTGCG TTCGTCGAGC GCGAGAAAGC GCCCGCGCCG CGTCCGAGGG CGAGCCGCTA CGCGCAGCGG ACGCAAGCGC AGACGCAGGC GGATCAGCTC GCATCCCTGA CGAACCGTCA GCTGGCGTCG TACGTTCGCA CCAGCTTGGC GCCGCGCCTG TCGAAGCTGT ACCTGCCGAA GCACGTGTGC GTGCTCGACG AGTTCCCGCG CACGGGCATC GGGAAAATCG ACCGCGTCGC GCTCGAGCGG CGCTACGACC AGCGCATCGA GGTGGCGCGC GTCACGCTGC ACCGCATCCG GCTGCCGTTC AAAACGCCGT TCAAAACAGC CAAGGCCACT CTGACGCATC GCGAGTCGAT CATCGTCGAA GTGACGGATC ATGCCGGTCG CACGGGGCTG GGCGAGTGCG TGGCGTTTCC GACGGATTGG TATCTGCCGG AAACGCTCGA CCAGGACGCG CGCATCCTCC ACGACGTTCT GGCTCCGATC GTGCTGCGCG AGGCGTTCCT GCATCCCAGC GAGGCGAGCG CCGCCTTCGC CGCCGTTCCC GAGGCGAAGG CGTTTCCGTT GGCATGCGGC GCGCTCGAGC CTGCGCTGTG GGACCTGTAC GGCAAGATCG TCGGCAAGCC GCTATGGCAG CTGATCGGCG GGGCTGCGAA CGCCGCGACC GCGGGCGCTT CGGCATCCGT GCCGGCCGGC GCGGCGATCG GGCTGGGCTC CGCGGTCGAG ACGGCCGCGG CCGCGCGCCG CTGCGCCGAG GCAGGCTACA AGCGCGTGAA GCTGAAGGTG AAACCGGGAA CCGCGCTCGC ATGCGCGCAG GCCGTGCGCG CCGCGTTGCC CGACATGATG ATCACGCTCG ACGCCAACCA GAGCTTCTCG GAGCGTGAAG CCGAGGAGCT GCGCGGCTTG GACGGCCTGA ACGTCGCCTG GATCGAGGAG CCGCTGGATC CGCGGCGCCT GCCCCCCGTG GGGCCGACCG ACCTGTTCAG CCGCCTCGCG CGTCTGCAGC GCACGCTGCG CACCCCCATC TGCCTCGACG AGTCCATCGT GCGCCCCGAG GACCTGTCCC GTGCGTTGGC GCAGCCCGAG CTGCGGTGCT ACGCCCTCAA GCTGGGAAAA TGCGGCGGCG TGCAGCCTGC GCTCGATTTC GTGCAAATGG CGCGCAAGCG GGGCATCGAG GTGTGGATGG GCGGCATGTA CGACACGGGC GTGTCCAGGC GCCTGCATGC CGCGTTCGAG ACGCTGAGCG CGGTGGGTGC GCCCGGCGAC ATCGGAGCCA CGGCTCGTTA CTTCTCCTGC GACATCACCG ATCCGCCCTA CACGGCCGAG CGCGGCATGG TGACGCTCAA CCGCGAAGGC CATGCCAGCG GCCTGGGTTG CGACCTCAAC CGCTCGGCGT TGGAAAACGT CCTCGTGGAG CGCCTCGTCA TTGCGTGA
|
Protein sequence | MIGAFESTVR QFPQRTCFTY VDEAGNETAY SYRETRMLSA ALARLLLDRG VRPGDCVAVD LPNDPAYVLV MLAAAYGGFA LVALNNRLTD AEKYSRLLDI DRKPGVSLAS LRVDATNVSR LLEGAKAMLT GEMAAPSAVR ATGRGVRPSF AARVNAASAE PRTTKALGRA SSGRASLRRR SEVAQQDAAE SVIHFAEHAA HVFDFDARAV VMFTSGTTGR AKAVSLTWGN ICFASEASNR TLNRRGEGMW QAVLPLYHIG GFQVVVRSVL NGSPFVLYRR FDAVRLLSDA AKKGATHVSV VDKMLQDMVA ADKPETLARY SCILLGGGAL NPQTLDRACR AGARVYASYG MTETSSQIAH AQVTASFTGG LRLLPGYEAH IVDPGDDGFG RLAVKGPGLF GGYLNARAAY TVDGFFLTGD TAALYGGKLF VKERTEDMFV SGGENVYPAE IKEKLLRVAG VSDAHVFGAP DARWGRRPVA FVEREKAPAP RPRASRYAQR TQAQTQADQL ASLTNRQLAS YVRTSLAPRL SKLYLPKHVC VLDEFPRTGI GKIDRVALER RYDQRIEVAR VTLHRIRLPF KTPFKTAKAT LTHRESIIVE VTDHAGRTGL GECVAFPTDW YLPETLDQDA RILHDVLAPI VLREAFLHPS EASAAFAAVP EAKAFPLACG ALEPALWDLY GKIVGKPLWQ LIGGAANAAT AGASASVPAG AAIGLGSAVE TAAAARRCAE AGYKRVKLKV KPGTALACAQ AVRAALPDMM ITLDANQSFS EREAEELRGL DGLNVAWIEE PLDPRRLPPV GPTDLFSRLA RLQRTLRTPI CLDESIVRPE DLSRALAQPE LRCYALKLGK CGGVQPALDF VQMARKRGIE VWMGGMYDTG VSRRLHAAFE TLSAVGAPGD IGATARYFSC DITDPPYTAE RGMVTLNREG HASGLGCDLN RSALENVLVE RLVIA
|
| |