Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0829 |
Symbol | leuS |
ID | 5669245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 966995 |
End bp | 970045 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239758 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_001505193 |
Protein GI | 158312685 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0577341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA CGACCGAGCG CGGCCCCGCC GCGGTGCCCG CGGGAACCGC GGCCGGTGAC GGCGACCTGC CCCACCGGTA CGACTCCCGG CTGGCCGCCG AGATCGAGCG GCACTGGCAG CAGCGGTGGC TGCGGGAGGG CACCTTCGAG TCACCGAACC CGACGGGCCC GCTGTCCGAG GGCTTCGAGG CGGTCCGCGG CCGTGACCCG TTCTACGTCC TCGACATGTT CCCGTACCCG AGCGGAACCG GCCTGCATGT GGGCCACCCG CTGGGCTACA TCGGCTCGGA CGTCTTCGCC CGCTTCCTGC GAATGACCGG GCGTCATGTC CTGCACACCT TCGGCTACGA CGCCTTCGGG TTGCCGGCCG AGCAGTACGC CATCAACACC GGCCAGCACC CCCGGGTGAC GACCGAGGCG AACATCGCGA ACATGCGCCG CCAGCTCTCC CGGCTGGGAC TGGGCCACGA CACCCGCCGC GAGATCGCCA CCACCGACAC CGCGTACTAC CGGTGGACAC AGTGGATCTT CCTGAAGATC TTCGACAGCT GGTACGACGA GGCCGCCGGG CGGGCCCGTC CGATCAGCGA GCTGGTCGAG GAGCTCGACG CCGGGCACCG CGCCGCGACC GGGCCCGGCA CGGCCGAGGC GAACCCGCGG CAGGCCGCCT GGTCGGAGCT GACCGCGACG GAGCGCCGCC GGGTCGTCGA CGCGCACCGG CTGACCTACA TCTCCGAGGA GCTGGTCAAC TGGTGCCCGG GGCTGGGCAC GGTGCTGGCC AACGAGGAGG TCACGCCCGA GGGTCGCAGC GACATCGGGA ACTATCCGGT CTACCGCCGT CCGCTGCGCC AGTGGATGAT GCGGATCACC GCCTACGCCG ACCGGTTGAT GTCGGACCTG GACCTGGTCG ACTGGCCAGA TTCCATCAAG CACATGCAGC GGAACTGGAT CGGCCCGAGC GACGGCGCGA CCGTCCGCTT CTCCACGGTC ACCGGTGCGG GCGACACGGC CGGCGCGGGC GGGGTGGATG CCCCGGTCGG GCCGGCGCCG ATCGACGTCG AGGTGTACAC CACCCGGCCC GACACCCTGC CGGGGGCGAC CTTCCTCGTC CTGGCGCCGG AGCACCCGCT GGTCGACGCC CTGACCGCCA CCTCGTGGCC GGCCGACACC CCGGCGGGCT GGCGCTTCGC GCAGGAACGC CCTGCCGGTG TCACCGACGG GGAGTGGACG CCCCAGGCGG CCGTCGACGC CTACCGGGCG TTCGCCGCCC GGCGCAGCGA CCGCCAGCGC GGCGGCACGG AGATCGACCG CACCGGCGTG TTCACCGGGA CGTACGTCCG CAACCCGGTC GGCGGCGGCG TGATCCCGGT CTTCCTGGCG GACTACGTCC TGCTCGGCTA CGGCACCGGC GCGATCATGG CGGTGCCCGC GCACGACGAG CGTGACTTCT CCTTCGCCCA GGAGTTCGGC CTGCCCATTC CCGCGGTCCT GGAGCCCGAC GAGGCGTGGC TGGCCGAGCG CGACCTGGCC GCCGGGGCTC CGGCGTCGTC CTGGCCGGAG GCGTTCAGCG GCGAGGGCTC GTATCTGGCC GGCGCCACGG ACCGGCCCGT GCTGGCCGGC CTGTCCAAGG CCGACGCGAT CAAGACGACG ATCAGCTGGC TGGAGGACGC CGGCCGCGGC CGGGCGACCC GCTCCTACCG GCTGCGGGAC TGGCTGTTCT CCCGCCAGCG CTACTGGGGT GAGCCGTTCC CGATCGTCTT CGACGACGAC GGCATGCCCC GCGCCGTACC CGAGGAGCAG CTGCCGGTCG AGCTCCCGGA GATGACCGAC TTCCGGCCGA AGGCGATGGC CGACGACGAC GAGAGCGAGC CCGTCCCCCC GCTGGCCCGG GCCACCGAGT GGACCACGGT CACCCTCGAC CTGGGCGACG GCCCGCGCGG CTACCGCCGC GAGCTGAACA CGATGCCGCA GTGGGCCGGC TCCTGCTGGT ACTACCTGCG CTACCTGGAC CCGACGAACT CCGAGCGCTT CGTCGACCCG GCCGTCGAGC GCTACTGGAT GCACTCCGAG CGCGGCCCGG CCGGCGACGG AGGCGTCGAC CTGTACGTGG GCGGCGTCGA GCACGCCGTG CTGCACCTGC TCTACGCCCG GTTCTGGCAC AAGGTGCTGT ACGACCTGGG CCTGGTCTCG ACCAGGGAGC CGTTCAAGCG GCTCTACAAC CAGGGCTACA TCCAGGCGGA CGCGTTCACC GACGAGCGCG GCATGTACGT CCCGGCGACC GAGGTCGTCC AGGGTGCCGA CGGGTCGTTC AGCCACGAGG GCGCCCCGGT CAACCGGCGC TCCGGGAAGA TGGGCAAGAG CCTCAAGAAC AGCGTCAGCC CCGACGAGAT GTACGACAGC TACGGGGCCG ACACGCTGCG CGTGTACGAG ATGGCGATGG GCCCGCTGGA CGCCCACCGC CCGTGGCGCA CCGACGACAT CGTCGGCTCC TACCGGTTCC TGCAGCGGCT GTGGCGCAAC ATCATCGACG AGGGCACCGG GGAGCCACGG GTCCGTGCCG CCGCGCTCGA CGACGAGACC GCGCAGGCCC TGCACCGGAC TATCCTGGCC GTCCGCGCCG ACTACGCCGA GCTGCGCTTC AACACCGCGG TCGCCCGGCT CATCGAGCTG ACGAACCTGG CCAGCAAGCG CTTCGGCGCC GGGCTCGACG GCCCGCCGCG GGAGCTGGCC GAGGCGTTGG TGCTGATGGC CGCGCCGCTG GCGCCGCACA TCGCCGAGGA GCTGTGGACG CGGCTGGGGC ACACCGGCTC GGTCTGCGCC GTGCCCTTCC CCGAGGGGGA CGAGTCGCTG GCGGCGGCCG CGACGGTGCG GCTGCCGGTG CAGGTCAACG GCAAGGTGCG CTTCACGATC GACGTCCCGG CCGACGCGGA CGAGGCGGCC GTGCGCGCGG TCCTGGAGGC ACATGCGGAC TACACCCGGC ACACCTCCGG GCGCACCATC AAGCGCCTCA TCGTGGTCCC CGGCCGGATC GTGAACATCG CCCTGGGCTG A
|
Protein sequence | MSETTERGPA AVPAGTAAGD GDLPHRYDSR LAAEIERHWQ QRWLREGTFE SPNPTGPLSE GFEAVRGRDP FYVLDMFPYP SGTGLHVGHP LGYIGSDVFA RFLRMTGRHV LHTFGYDAFG LPAEQYAINT GQHPRVTTEA NIANMRRQLS RLGLGHDTRR EIATTDTAYY RWTQWIFLKI FDSWYDEAAG RARPISELVE ELDAGHRAAT GPGTAEANPR QAAWSELTAT ERRRVVDAHR LTYISEELVN WCPGLGTVLA NEEVTPEGRS DIGNYPVYRR PLRQWMMRIT AYADRLMSDL DLVDWPDSIK HMQRNWIGPS DGATVRFSTV TGAGDTAGAG GVDAPVGPAP IDVEVYTTRP DTLPGATFLV LAPEHPLVDA LTATSWPADT PAGWRFAQER PAGVTDGEWT PQAAVDAYRA FAARRSDRQR GGTEIDRTGV FTGTYVRNPV GGGVIPVFLA DYVLLGYGTG AIMAVPAHDE RDFSFAQEFG LPIPAVLEPD EAWLAERDLA AGAPASSWPE AFSGEGSYLA GATDRPVLAG LSKADAIKTT ISWLEDAGRG RATRSYRLRD WLFSRQRYWG EPFPIVFDDD GMPRAVPEEQ LPVELPEMTD FRPKAMADDD ESEPVPPLAR ATEWTTVTLD LGDGPRGYRR ELNTMPQWAG SCWYYLRYLD PTNSERFVDP AVERYWMHSE RGPAGDGGVD LYVGGVEHAV LHLLYARFWH KVLYDLGLVS TREPFKRLYN QGYIQADAFT DERGMYVPAT EVVQGADGSF SHEGAPVNRR SGKMGKSLKN SVSPDEMYDS YGADTLRVYE MAMGPLDAHR PWRTDDIVGS YRFLQRLWRN IIDEGTGEPR VRAAALDDET AQALHRTILA VRADYAELRF NTAVARLIEL TNLASKRFGA GLDGPPRELA EALVLMAAPL APHIAEELWT RLGHTGSVCA VPFPEGDESL AAAATVRLPV QVNGKVRFTI DVPADADEAA VRAVLEAHAD YTRHTSGRTI KRLIVVPGRI VNIALG
|
| |