Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2233 |
Symbol | |
ID | 8416556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2622808 |
End bp | 2624313 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645025219 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_003182583 |
Protein GI | 257791977 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.056881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.890291 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGAC TCTACAATAC GAAGACCCGC ACGAAGGTCG ACTTCGAGAC GCTCGAGCGC GGCAAGGTGG GCATGTACGT GTGCGGACCT ACCGTGTACA ACTACATTCA CATCGGCAAC GCCCGCACGT TCATCAGCTT CGACGTGATC CGCCGTTACC TTATGTGGCG CGGCTTCGAC GTGACGTTCG TGCAGAACGT CACCGACGTG GACGACAAGA TCATCGGCAA GTCGCTTGAG GAAGGCCGCA GCGCTGCCGA GGTGGCCGCC GAGTACACCG AGGCGTTTAT CGAGGACATG CGCGCGGCCG GCGTGCTGGA TCCCGATATC CGTCCGAAGG CCACCGAGGA GATCCCCGCA ATGATAGAGC TCATCCAAGA GCTCATCGAC GGAGGCCACG CTTACGATGC CGACGGCGAC GTGTACTTCA ACGTGCGTTC GTTCCCGGCT TACGGCGAGC TGTCGGGCCG CAACGTAGAC GAGATGGAGA GCGGCCACCG CGAGCTGCGC GCCGACGGCA AGGGCGTGGA GGATCGCAAG CGCGACCCGC TGGACTTCGC GCTGTGGAAG GCCGCCAAGC CCGGCGAGCC GGCGTGGGAG TCGCCCTGGG GCATGGGCCG TCCCGGCTGG CACATCGAGT GCTCGGCCAT GTCGCGCAAG TACCTGGGGC TTCCCTTCGA CATCCACGGC GGCGGCGCCG ACCTCGTGTT CCCGCACCAC GAGAACGAGC GCGCGCAGAG CGAGGCCGCG TGCGGCTGCA CGTTCGCGAA CTACTGGATG CACGGCGGCA TGCTGCAGAT CAACTCCGAG AAGATGAGCA AGTCGCTGGG CAACTTCAAG CTGCTGCGCG ACGTGCTGAA GGTCACCGAT CCGAAGGTGC TGCGCTTCCT TATGCTGCAG ACGCACTACC GCAGCCCGCT CGACTTCTCC GACGAGCGCC TGGCCGAGGC GGGCGCGGCG CTGTGCCGTA TCGAGAACGC GGTGAAGAAC CTCGATTGGC AGCTGCAGAA CGCCCAGGAC ATCCCCTCGC CGCTGGACAC GCGCGAGCTG ATGAAGCGTA CGAAGGAGGC GAAGCTGGCG TTCATCCTGG CCATGGACGA CGACTTCAAC ACCTGCAAGG CGCTGGGCGA GGTGTTCGAC TTCGTGGCCG CGGTGAACGC GCAGACGGCC GACAGGACTA TCTCGCTGTC CGACGTGCCG CCCGTGCGCG ATGCCCGCGG CGTCATCGTG GAGCTCATGG GCGTGTTCGG CATCGACGTG GAGGCCGCAT CCGCCTCGTG CGCCGCAGGA GGGTACCCGC CCGAGGTCGT GGGGTTGGCG GCCGATATCG CGGGCTACGA AGGCGCGGAT GCCGCCGAGG CCGTGGACGC CCTGCTGGCG GCCCGTGCCG ACGCCCGCGC CGCGAAGGAT TGGAGCCGTG CCGACGCCGT CCGCGACGGC CTGTGCGGTT TGGGCTTCGT GATCGAGGAC ACCCCGCAGG GTGCGCGCGT GACCTACGAG GGGTAA
|
Protein sequence | MIRLYNTKTR TKVDFETLER GKVGMYVCGP TVYNYIHIGN ARTFISFDVI RRYLMWRGFD VTFVQNVTDV DDKIIGKSLE EGRSAAEVAA EYTEAFIEDM RAAGVLDPDI RPKATEEIPA MIELIQELID GGHAYDADGD VYFNVRSFPA YGELSGRNVD EMESGHRELR ADGKGVEDRK RDPLDFALWK AAKPGEPAWE SPWGMGRPGW HIECSAMSRK YLGLPFDIHG GGADLVFPHH ENERAQSEAA CGCTFANYWM HGGMLQINSE KMSKSLGNFK LLRDVLKVTD PKVLRFLMLQ THYRSPLDFS DERLAEAGAA LCRIENAVKN LDWQLQNAQD IPSPLDTREL MKRTKEAKLA FILAMDDDFN TCKALGEVFD FVAAVNAQTA DRTISLSDVP PVRDARGVIV ELMGVFGIDV EAASASCAAG GYPPEVVGLA ADIAGYEGAD AAEAVDALLA ARADARAAKD WSRADAVRDG LCGLGFVIED TPQGARVTYE G
|
| |