Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1557 |
Symbol | |
ID | 8415855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1850556 |
End bp | 1853402 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645024525 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_003181914 |
Protein GI | 257791308 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0128152 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGAACA CGTACAAAGA AACGATGAAC CTGCCGAAGA CCGACTTCGC GATGCGGGCG AACCTGCCCG AGAGCGAGCC GAAGCGTCTG GCCAAGTGGG AAGAAGAGCA TATCTACGAG CAGGTTCTCG AGAAGAACAA GGACGGCAAG CCCTTCATCC TGCACGACGG CCCTCCGTAC GCCAACGGCC CCATCCATAT CGGCCATGCC TTCAACAAGA TCCTCAAGGA CTTCGTGAAC AAGTCCCATG CGCAGCGCGG GTTCTTCACG CCCTACGTGC CCGGCTGGGA TTGCCACGGC CAGCCCATCG AGCATATGGT GGAGAAGACC CTCGGCCCCG ACAAAATGGC CAAGATCGAC CAGCCCACGC TGCGCCGCCT CTGCCGCGAA TGGGCTGAGA AATACGTCGA CGTGCAGCGC GAGGGCTTCA AGCGCCTCGG CGTGAACGCC GATTGGGAGC ATCCGTACCT CACGTTCACG CCGAACTACG AGGCGGGCAA CGTCGAGGTG TTCAAGCGGA TGTACCTCGA CGGCTCGGTG TACCGCGGCC GCAAACCCAT CCACTGGTGC AAGCGCTGCC ACACGGCGCT TGCCGAGGCC GAGATCGAGT ACTCCGACGA GACGTCGCCG TCCATCTTCG TGAAGTTCAA GATGGATATC ATGCCCGGCA TGTTCGAGAC TGCAGGCGCT GCGGGCGACG CCTACGTGCT CATCTGGACC ACCACGCCCT GGACGCTTCC GGCGAACACG GCCGTGTCGC TGGCGCCCGA CGCCGACTAC GTGATGGTCC AGGCGGACGG CTCCAACATG ATCATGGCGC GCGAGCTGGT GGAACAGGTG GCCGAGATCG CGGGCTGGGA ATCCTACGAC CTCGTGCGCG GCGAGGACGG CGAGCCCGTC GCGCTCAAGG GCCGCGAATT CACCGGCCTC ACCTACACGT GCCCCGTCCG CCAGGACCTC AAGGGCACCA TCATCTACGG CGACCACGTC ACGCTGGACT CCGGCACGGG CGCGGTGCAC ACGGCTCCCG GCCATGGTCA GGACGACTAC CTCGTGGCGC TGGAGTTCGA CGTGCCGCTG CTCATGCCGG TGGACGACAA CGGCGTTCTC ACCGACGAGG CGGGCCCTTT CGCGGGCCTC GACGTTGACG AGGCGAACCC GGTCATCATC GAATGGCTGC GCGAACGCGG CACGCTGGTG GCCCAAAAGG AGATCCTGCA CAGCTACCCG CACTGCTGGC GCTGCCACGA GCCGGTCATC TTCCGCGCCA CCGACCAGTG GTTCGTGTCC ATGGACAAGA ACAGCCTGCG CGAGAACGCG CTCAAGGCCA TCGAGAACGA CGTCGAGTGG ATTCCCGCGT GGGCTTCGAA CCGTATCGGA TCCATGGTGG CCGACCGTCC CGACTGGTGC ATCTCGCGCC AGCGTTCGTG GGGCGTGCCC ATCCCCGTGT TCAAATGCGC GAAGTGCGGC TCCACCGTGG CGAACGAGCA GACGTTCGAC GCGGTGATCG ACCTGTTCTA CCGCGAGGGC GCCGACGCGT GGTTCACGCG CGAGCCGTCC GAGTACCTGC CGCGCGGCGT GAAATGCGAG ACGTGCGGCT GCACCGAGCT GACCCCCGAG AAGGACATCC TCGACGTGTG GTGGGAGAGC GGCGTGTCGC ACACCAGCGT GTTGAAGCAT CGCGAGGCCG AGGGCCTGCG CTTCCCGGCC GACATGTACC TGGAAGGCTC CGACCAGCAC CGCGGCTGGT TCCAGTCGTC GCTGCTCACT AGCATGGGCG CGTACGGCGT GCCGCCGTAC AAGGCCGTCA TGCACTGCGG CTTCACCGTG GACGGCGAAG GCCGCAAGAT GTCGAAATCG CTGGGCAACG GCGTGGATCC GGCCGAGGTC ATGGCGAAGA GCGGCGCCGA CGTGCTGCGC CTGTGGGTGG CCAGCGTCGA TTACTCGCAG GACGTGAGCA TCTCCGACGA GATCCTCCAG CGCACCTCCG AGGCGTACCG CCGCATCCGC AACACGTTCC GCTTCCTGCT GGGCAGCCTG GATGACTTCG ACGATCAGAA GGACGCCGTG TCCGATTGGA ACGCGCTCGA GCCCCTCGAC CAGTGGGCCA TGGTGCGCAC GAAGCACCTG CTGGACGACG TGAGCGCCGC CTACGACGCG TACAAGTTCC ACTACGTGTA CCGCGCCGTG TATGACTACA TTGTGAACGA CCTGTCGGCC GTGTACATGG ACGCGACGAA GGACCGCCTG TACTCCGAGG CGCCTGACTC GCCGCGCCGC CGCGCCGTGC AGACGGTGCT GATGAACATC CTCGAGGTGC TCGTGCGCGT GCTGGCGCCG GTGCTCTCGT TCACCACCGA CGAGGTGTGG GAGCACTACC CCCAGGCCAT GCGCGAGCGC GCAGGCCGCC CGACGAACGT GCAGCTGGCC GGTTGGCCCA AGGCGTCCGA CTTCGCGCCC GCCATCCCCG CGGACGGGGA GCGCGTGTCC GAGGACTTCG GCGTGATCAT GGGCGTGCGC GAAGTCGTGA CCAAGGCGCT CGAGGACGCG CGCGGTCAGA AGGTCGTGAA CAAGAGCCAG GAGGCGGCCG TCGTCGTGAC CGCGCCCCGC GCCGTGCTCG ACGCGGTGGA GCGCTATGAC GCGGCGGTGT TCGAGGAGCT GTTCATCGTG GCGTCCGTGT CGTTCGCCGA AGGCGAGGAG CTGGCGGCGA CGGTTTCGAA GACCGAGGCC GAGAAATGCC CGCGCTGCTG GAACCACCGT GCGCTCGGCG GCAACGCGAA CCACGGATCC GTCTGCGAGC GCTGCGGCGA TGCGCTCGAC GCCATCGGCT TCGCGGAAGG GGAATAG
|
Protein sequence | MANTYKETMN LPKTDFAMRA NLPESEPKRL AKWEEEHIYE QVLEKNKDGK PFILHDGPPY ANGPIHIGHA FNKILKDFVN KSHAQRGFFT PYVPGWDCHG QPIEHMVEKT LGPDKMAKID QPTLRRLCRE WAEKYVDVQR EGFKRLGVNA DWEHPYLTFT PNYEAGNVEV FKRMYLDGSV YRGRKPIHWC KRCHTALAEA EIEYSDETSP SIFVKFKMDI MPGMFETAGA AGDAYVLIWT TTPWTLPANT AVSLAPDADY VMVQADGSNM IMARELVEQV AEIAGWESYD LVRGEDGEPV ALKGREFTGL TYTCPVRQDL KGTIIYGDHV TLDSGTGAVH TAPGHGQDDY LVALEFDVPL LMPVDDNGVL TDEAGPFAGL DVDEANPVII EWLRERGTLV AQKEILHSYP HCWRCHEPVI FRATDQWFVS MDKNSLRENA LKAIENDVEW IPAWASNRIG SMVADRPDWC ISRQRSWGVP IPVFKCAKCG STVANEQTFD AVIDLFYREG ADAWFTREPS EYLPRGVKCE TCGCTELTPE KDILDVWWES GVSHTSVLKH REAEGLRFPA DMYLEGSDQH RGWFQSSLLT SMGAYGVPPY KAVMHCGFTV DGEGRKMSKS LGNGVDPAEV MAKSGADVLR LWVASVDYSQ DVSISDEILQ RTSEAYRRIR NTFRFLLGSL DDFDDQKDAV SDWNALEPLD QWAMVRTKHL LDDVSAAYDA YKFHYVYRAV YDYIVNDLSA VYMDATKDRL YSEAPDSPRR RAVQTVLMNI LEVLVRVLAP VLSFTTDEVW EHYPQAMRER AGRPTNVQLA GWPKASDFAP AIPADGERVS EDFGVIMGVR EVVTKALEDA RGQKVVNKSQ EAAVVVTAPR AVLDAVERYD AAVFEELFIV ASVSFAEGEE LAATVSKTEA EKCPRCWNHR ALGGNANHGS VCERCGDALD AIGFAEGE
|
| |