Gene Franean1_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0829 
SymbolleuS 
ID5669245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp966995 
End bp970045 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content73% 
IMG OID641239758 
Productleucyl-tRNA synthetase 
Protein accessionYP_001505193 
Protein GI158312685 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0577341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CGACCGAGCG CGGCCCCGCC GCGGTGCCCG CGGGAACCGC GGCCGGTGAC 
GGCGACCTGC CCCACCGGTA CGACTCCCGG CTGGCCGCCG AGATCGAGCG GCACTGGCAG
CAGCGGTGGC TGCGGGAGGG CACCTTCGAG TCACCGAACC CGACGGGCCC GCTGTCCGAG
GGCTTCGAGG CGGTCCGCGG CCGTGACCCG TTCTACGTCC TCGACATGTT CCCGTACCCG
AGCGGAACCG GCCTGCATGT GGGCCACCCG CTGGGCTACA TCGGCTCGGA CGTCTTCGCC
CGCTTCCTGC GAATGACCGG GCGTCATGTC CTGCACACCT TCGGCTACGA CGCCTTCGGG
TTGCCGGCCG AGCAGTACGC CATCAACACC GGCCAGCACC CCCGGGTGAC GACCGAGGCG
AACATCGCGA ACATGCGCCG CCAGCTCTCC CGGCTGGGAC TGGGCCACGA CACCCGCCGC
GAGATCGCCA CCACCGACAC CGCGTACTAC CGGTGGACAC AGTGGATCTT CCTGAAGATC
TTCGACAGCT GGTACGACGA GGCCGCCGGG CGGGCCCGTC CGATCAGCGA GCTGGTCGAG
GAGCTCGACG CCGGGCACCG CGCCGCGACC GGGCCCGGCA CGGCCGAGGC GAACCCGCGG
CAGGCCGCCT GGTCGGAGCT GACCGCGACG GAGCGCCGCC GGGTCGTCGA CGCGCACCGG
CTGACCTACA TCTCCGAGGA GCTGGTCAAC TGGTGCCCGG GGCTGGGCAC GGTGCTGGCC
AACGAGGAGG TCACGCCCGA GGGTCGCAGC GACATCGGGA ACTATCCGGT CTACCGCCGT
CCGCTGCGCC AGTGGATGAT GCGGATCACC GCCTACGCCG ACCGGTTGAT GTCGGACCTG
GACCTGGTCG ACTGGCCAGA TTCCATCAAG CACATGCAGC GGAACTGGAT CGGCCCGAGC
GACGGCGCGA CCGTCCGCTT CTCCACGGTC ACCGGTGCGG GCGACACGGC CGGCGCGGGC
GGGGTGGATG CCCCGGTCGG GCCGGCGCCG ATCGACGTCG AGGTGTACAC CACCCGGCCC
GACACCCTGC CGGGGGCGAC CTTCCTCGTC CTGGCGCCGG AGCACCCGCT GGTCGACGCC
CTGACCGCCA CCTCGTGGCC GGCCGACACC CCGGCGGGCT GGCGCTTCGC GCAGGAACGC
CCTGCCGGTG TCACCGACGG GGAGTGGACG CCCCAGGCGG CCGTCGACGC CTACCGGGCG
TTCGCCGCCC GGCGCAGCGA CCGCCAGCGC GGCGGCACGG AGATCGACCG CACCGGCGTG
TTCACCGGGA CGTACGTCCG CAACCCGGTC GGCGGCGGCG TGATCCCGGT CTTCCTGGCG
GACTACGTCC TGCTCGGCTA CGGCACCGGC GCGATCATGG CGGTGCCCGC GCACGACGAG
CGTGACTTCT CCTTCGCCCA GGAGTTCGGC CTGCCCATTC CCGCGGTCCT GGAGCCCGAC
GAGGCGTGGC TGGCCGAGCG CGACCTGGCC GCCGGGGCTC CGGCGTCGTC CTGGCCGGAG
GCGTTCAGCG GCGAGGGCTC GTATCTGGCC GGCGCCACGG ACCGGCCCGT GCTGGCCGGC
CTGTCCAAGG CCGACGCGAT CAAGACGACG ATCAGCTGGC TGGAGGACGC CGGCCGCGGC
CGGGCGACCC GCTCCTACCG GCTGCGGGAC TGGCTGTTCT CCCGCCAGCG CTACTGGGGT
GAGCCGTTCC CGATCGTCTT CGACGACGAC GGCATGCCCC GCGCCGTACC CGAGGAGCAG
CTGCCGGTCG AGCTCCCGGA GATGACCGAC TTCCGGCCGA AGGCGATGGC CGACGACGAC
GAGAGCGAGC CCGTCCCCCC GCTGGCCCGG GCCACCGAGT GGACCACGGT CACCCTCGAC
CTGGGCGACG GCCCGCGCGG CTACCGCCGC GAGCTGAACA CGATGCCGCA GTGGGCCGGC
TCCTGCTGGT ACTACCTGCG CTACCTGGAC CCGACGAACT CCGAGCGCTT CGTCGACCCG
GCCGTCGAGC GCTACTGGAT GCACTCCGAG CGCGGCCCGG CCGGCGACGG AGGCGTCGAC
CTGTACGTGG GCGGCGTCGA GCACGCCGTG CTGCACCTGC TCTACGCCCG GTTCTGGCAC
AAGGTGCTGT ACGACCTGGG CCTGGTCTCG ACCAGGGAGC CGTTCAAGCG GCTCTACAAC
CAGGGCTACA TCCAGGCGGA CGCGTTCACC GACGAGCGCG GCATGTACGT CCCGGCGACC
GAGGTCGTCC AGGGTGCCGA CGGGTCGTTC AGCCACGAGG GCGCCCCGGT CAACCGGCGC
TCCGGGAAGA TGGGCAAGAG CCTCAAGAAC AGCGTCAGCC CCGACGAGAT GTACGACAGC
TACGGGGCCG ACACGCTGCG CGTGTACGAG ATGGCGATGG GCCCGCTGGA CGCCCACCGC
CCGTGGCGCA CCGACGACAT CGTCGGCTCC TACCGGTTCC TGCAGCGGCT GTGGCGCAAC
ATCATCGACG AGGGCACCGG GGAGCCACGG GTCCGTGCCG CCGCGCTCGA CGACGAGACC
GCGCAGGCCC TGCACCGGAC TATCCTGGCC GTCCGCGCCG ACTACGCCGA GCTGCGCTTC
AACACCGCGG TCGCCCGGCT CATCGAGCTG ACGAACCTGG CCAGCAAGCG CTTCGGCGCC
GGGCTCGACG GCCCGCCGCG GGAGCTGGCC GAGGCGTTGG TGCTGATGGC CGCGCCGCTG
GCGCCGCACA TCGCCGAGGA GCTGTGGACG CGGCTGGGGC ACACCGGCTC GGTCTGCGCC
GTGCCCTTCC CCGAGGGGGA CGAGTCGCTG GCGGCGGCCG CGACGGTGCG GCTGCCGGTG
CAGGTCAACG GCAAGGTGCG CTTCACGATC GACGTCCCGG CCGACGCGGA CGAGGCGGCC
GTGCGCGCGG TCCTGGAGGC ACATGCGGAC TACACCCGGC ACACCTCCGG GCGCACCATC
AAGCGCCTCA TCGTGGTCCC CGGCCGGATC GTGAACATCG CCCTGGGCTG A
 
Protein sequence
MSETTERGPA AVPAGTAAGD GDLPHRYDSR LAAEIERHWQ QRWLREGTFE SPNPTGPLSE 
GFEAVRGRDP FYVLDMFPYP SGTGLHVGHP LGYIGSDVFA RFLRMTGRHV LHTFGYDAFG
LPAEQYAINT GQHPRVTTEA NIANMRRQLS RLGLGHDTRR EIATTDTAYY RWTQWIFLKI
FDSWYDEAAG RARPISELVE ELDAGHRAAT GPGTAEANPR QAAWSELTAT ERRRVVDAHR
LTYISEELVN WCPGLGTVLA NEEVTPEGRS DIGNYPVYRR PLRQWMMRIT AYADRLMSDL
DLVDWPDSIK HMQRNWIGPS DGATVRFSTV TGAGDTAGAG GVDAPVGPAP IDVEVYTTRP
DTLPGATFLV LAPEHPLVDA LTATSWPADT PAGWRFAQER PAGVTDGEWT PQAAVDAYRA
FAARRSDRQR GGTEIDRTGV FTGTYVRNPV GGGVIPVFLA DYVLLGYGTG AIMAVPAHDE
RDFSFAQEFG LPIPAVLEPD EAWLAERDLA AGAPASSWPE AFSGEGSYLA GATDRPVLAG
LSKADAIKTT ISWLEDAGRG RATRSYRLRD WLFSRQRYWG EPFPIVFDDD GMPRAVPEEQ
LPVELPEMTD FRPKAMADDD ESEPVPPLAR ATEWTTVTLD LGDGPRGYRR ELNTMPQWAG
SCWYYLRYLD PTNSERFVDP AVERYWMHSE RGPAGDGGVD LYVGGVEHAV LHLLYARFWH
KVLYDLGLVS TREPFKRLYN QGYIQADAFT DERGMYVPAT EVVQGADGSF SHEGAPVNRR
SGKMGKSLKN SVSPDEMYDS YGADTLRVYE MAMGPLDAHR PWRTDDIVGS YRFLQRLWRN
IIDEGTGEPR VRAAALDDET AQALHRTILA VRADYAELRF NTAVARLIEL TNLASKRFGA
GLDGPPRELA EALVLMAAPL APHIAEELWT RLGHTGSVCA VPFPEGDESL AAAATVRLPV
QVNGKVRFTI DVPADADEAA VRAVLEAHAD YTRHTSGRTI KRLIVVPGRI VNIALG