Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1907 |
Symbol | |
ID | 8416211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 2237638 |
End bp | 2239551 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645024877 |
Product | threonyl-tRNA synthetase |
Protein accession | YP_003182260 |
Protein GI | 257791654 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0441] Threonyl-tRNA synthetase |
TIGRFAM ID | [TIGR00418] threonyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0000167344 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACATCG TACTTCCCGA CGGCTCGAAC AAGGGGCTCG CAGACGGCTC CACCGTCGCC GACGTGGCCG CGTCCATCGG CGCCGGCCTG GCGAAGGCCG CGCTCGCGGG CATCGTGAAC GACCAGCCGG TCGATCTCTC GGCTCCCGTG GCCGAAGGCG ACTCGGTTGC CATCGTCACG GCCAAGAGCG ACGAGGGGCT GGAGCTTCTG CGCCACTCCA CCGCGCACGT CATGGCCGCC GCGCTCGTCG ACCTCTACGG CGACGTGCAG TTCGGTGTGG GGCCGGCCAT CGAGGACGGT TTCTACTACG ACGTGAAGCT CGACCGCGCG CTGTCGCCGG ACGACTTTGC CGCCATCGAG GCCCGCATGG CCGAGATCGT GAAGGCCGAC GAAGCCTTCG AGCGCCGCGT GGTCACGCGC GCCGAGGCCG AGGAGATCTT CGCGAGCCAG CCGTTCAAGC TGGAGCTCAT CGCCGAGCTG CCGGAGGACG CCGTCATCAC CACGTACACC ATCGGCAAGT TCACCGACCT GTGCCGCGGC CCGCACCTGC CCTCTACCGG CAAGCTGGGC GCGTTCAAGT TGACCAAGCT GGCCGGCGCG TACTGGCGCG GCGATGCCGA GCGCGAGATG CTCACCCGCA TCTACGGCAC CGCCTTCTTC AAGCAGAAGG AGCTGGACGA GCACCTGCGC AACCTGGAGG AGGCCGAGAA GCGCGACCAT CGCAAGCTGG GCCGTGAACT GGGGATCTAC ACCATGGACC CGCTGGCCGG CGTGGGTCTG CCCATGTACC TGCCCAAGGG CGCGCGCGTG ATCCGCACCA TGCAGGAGTG GCTGCGCCGC GACCTGTACG AGCGCGGCTA CGAGGAGGTC ATCACGCCGC ACGTGTACAA CGCCGACGTG TGGAAGACCT CGGGCCACTA CGGCTTCTAC AAAGAGAACA TGTACTTCTT CAACATCAAC GAGGGCACCG ACGAGGATCC GCGCCTCACC GAGTACGCCG TGAAGCCCAT GAACTGCCCG GGCCACGTGA TGCTGTACAA GAGCGAGCTG CATTCCTACC GCGACCTGCC GCTGCGCTAC TTCGAGTTCG GCACCGTGTA CCGCCACGAG ATGAGCGGCG TCGTGCACGG CCTGCTGCGC GCCCGCGGCT TCACTCAGGA CGACGCGCAC GTGTTCTGCA CGCGCGACCA GGTAGTGGAC GAGGTGGTGG CCATCCTCGA CCTCGTGGAC CACATCATGT CCACGTTCGG GTTCCAGTAC GAGGCCGAGA TCTCCACGCG CCCGGAGAAG AGCATCGGAA CCGACGACAT GTGGGAGCAC GCCACCAACG CGCTCAAGGA AGCCTGCGCG CGCCACGAGC TGGCCTACGA CATCAACGAG GGCGACGGCG CGTTCTACGG CCCGAAGATC GACATCAAGG TGAAGGACGC CATCGGGCGC ACCTGGCAGT GCTCCACCGT GCAGGTGGAC TTCAACATGC CCGAGCGCTT CGAACTGACC TACCGCACCG AGGACAACAC CGAGGAGCGC CCCTGGATGC TCCATCGCGC CATCTTCGGC TCCATCGAGC GCTTCCTCGG CATCCTCATC GAGCACTACG CCGGGGCGCT GCCGCTGTGG CTGGCCCCCG TGCAGGTGGC CGTGCTGCCG CTGGCCGACC GTCACAACGA GGCCGCGGCC GAGCTGGCGA AGCAGCTGAA GGCCGCGGGC GGCCGCATCG AGGTGTACGA CCAGAACGAG CCCATGCGCG TGAAGATCGC GAAGGCGCAA AGCCAGAAGA TCCCGTACAT GGTGGTGCTG GGCGACAAGG AGATCGAGAA CGGCACGGTC AGCGTGCGCG AGCGCCACGA GGGCGACCTC GGCGCGTGGC CGGTCGAGCA GCTGGTGGAG AAGCTGCGCG AGGCCGCGCT GTAG
|
Protein sequence | MNIVLPDGSN KGLADGSTVA DVAASIGAGL AKAALAGIVN DQPVDLSAPV AEGDSVAIVT AKSDEGLELL RHSTAHVMAA ALVDLYGDVQ FGVGPAIEDG FYYDVKLDRA LSPDDFAAIE ARMAEIVKAD EAFERRVVTR AEAEEIFASQ PFKLELIAEL PEDAVITTYT IGKFTDLCRG PHLPSTGKLG AFKLTKLAGA YWRGDAEREM LTRIYGTAFF KQKELDEHLR NLEEAEKRDH RKLGRELGIY TMDPLAGVGL PMYLPKGARV IRTMQEWLRR DLYERGYEEV ITPHVYNADV WKTSGHYGFY KENMYFFNIN EGTDEDPRLT EYAVKPMNCP GHVMLYKSEL HSYRDLPLRY FEFGTVYRHE MSGVVHGLLR ARGFTQDDAH VFCTRDQVVD EVVAILDLVD HIMSTFGFQY EAEISTRPEK SIGTDDMWEH ATNALKEACA RHELAYDINE GDGAFYGPKI DIKVKDAIGR TWQCSTVQVD FNMPERFELT YRTEDNTEER PWMLHRAIFG SIERFLGILI EHYAGALPLW LAPVQVAVLP LADRHNEAAA ELAKQLKAAG GRIEVYDQNE PMRVKIAKAQ SQKIPYMVVL GDKEIENGTV SVRERHEGDL GAWPVEQLVE KLREAAL
|
| |