Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3901 |
Symbol | leuS |
ID | 3906669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4667018 |
End bp | 4670212 |
Gene Length | 3195 bp |
Protein Length | 1064 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881227 |
Product | leucyl-tRNA synthetase |
Protein accession | YP_482980 |
Protein GI | 86742580 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0495] Leucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACGAG CGATGAGTGA GACGGCCGAG CCCGGCGCCC GGACCGGCGC CGCGGACACC ACGGTGGCAC CGACCGGTGC GTCTGGCGGG ATCATCCCGG CAGCCGCGGG CACCGCGGGC GGCGCCCCCG CCGGGACCGG GAGCGTCGAG CCGAGCTTCC GGTACGACGC CCGGCTCGCC GCCGACATCG AGCGGCGCTG GCAGCGCCGG TGGGCCGACG AGGGCACGTT CAACTCGCCG AACCCGGTCG GGCCGCTCTC GACGGGCTTC GAGAAGGTCG CCGGCCGGGA GCCGTTCTAC ATCATGGACA TGTTCCCCTA TCCGAGCGGC TCGGGGCTGC ACGTCGGGCA CCCGCTGGGG TACATCGGCA CCGACGTGTT CGCCCGCTAT CTGCGGATGT CCGGCCGGCA CGTGCTGCAT CCGTTCGGAT ACGACTCCTT CGGCCTGCCC GCCGAGCAGT ACGCCATCAA CACGGGCCAG CATCCCCGCG ACACCACCAA CGCCAACATC GCCAACATGC GCCGCCAACT CTCCCGGCTG GGGCTGGGCC ACGACACCCG CCGCGAGATC GCGACGACCG ACGTGGGCTA CTACCGCTGG ACGCAGTGGA TCTTCCAGCA GATCTTCAAC AGCTGGTACG ACCCGCAGGC CGGCCGGGCC CGGCCGATCG CCGAGCTGAT CGAGGAGTTC GCCGCGGGCA CCCGGGCGCC GGTAGCCGGC CCCGCCGGGG GGAACACGGC CGTGTCGGTT GACGCGGTCC GGGCGGCGAA CCCGGCCGGG CTGGCGTGGA CCGAACTCGA CGAGGTGTCC CGCCGGAAGG TCGTCAACGC GCACCGCCTG GCCTACATCT CCGAACAGCT GGTCAACTGG TGCCCGGGGC TGGGCACCGT GCTGGCGAAC GAGGAGGTCA CCGCCGACGG CCGCAGCGAC ATCGGGAACT ACCCAGTGTT TCGCCGGCCG CTGAAGCAGT GGATCCTGCG GATCACCGCC TATGCCGAGC GGCTGATCTC CGACCTCGAC CTGGTCGACT GGCCGGACTC GATCAAGCAG ATGCAGCGCA ACTGGATCAG TCCGAGCGAG GGCGCCAGCG TCGAGTTCAC CGTCGTCGCC CCCGGCGAGG AGGCAGGTGC GTCCGATCCG TCTGGCTCAT CGACCGCCCG GCGTATCGAG GTCTATACCA CCCGCCCGGA CACCCTGGCG GGGGCCACCT TCCTGGTACT CGCGCCCGAA CATCCCCTGG CCGACGCCCT GATCGCCGAC ACCTGGCCGG CGGACACCCC GGTGAGCTGG CGCTTCCCGG CGGGACGGCC GGGCGGCGGC ACGGAACCGG CGGACACCGC CGGGCCGGAG GCGGGCGCCG ATCCGGCGTG GACCCCGCGA GCCGCCGTCG ACGCCTACCG GGAGTTCGCG GCCCACCGCA GCGACCGGCA GCGCGGCGAG GAGGTCATCG ACCGCACCGG CGTGTTCACC GGCTCGTACG TGCGCAACCC GGTCGGTGGC GGGGTCATCC CGGTCTTCCT GGCCGACTAT GTGCTGCTGG GCTACGGCAC CGGGGCAATC ATGGCGGTAC CGGCGCACGA CAGCCGGGAC TTCTCCTTCG CCCGCGCGTT CGACCTGCCG ATCCCCGCCG TGCTGGAGCC GGACGCGGAC TGGTACGCCG CGCACGGGGT AGTGCCCGCG ACTCCATCAG CGCAGTGGCC CGAGGCGTTC AGCGGTGCGG GCGAGTATCG GCCCGGTCCG GCCAGCGCCC CGGTGCTGGT CGGCCTGTCG AAAAGCGAGG CGATCAAGGC CACGGTTCAC TGGCTGGAGG AGATCGGCGC CGGCAGGTCG GCGCGGTCGT ACCGGCTGCG GGACTGGCTG TTCTCCCGCC AGCGGTACTG GGGCGAGCCG TTCCCGATCG TCTTCGATGT CGACGGGCTG CCCCACGCGG TTCCCGACGA GCTGCTGCCG ATCGAACTGC CGGAGATGAC CGACTTCCGG CCCACGGCGA TGGCCGAGGA CGACGCGAGC GACCCGGTGC CCCCCCTGGC CCGGGTGGCC GACTGGGTGA CGGTCACCCT GGATCTCGGC GACGGGCCGA AGCAGTACCG GCGCGAGACG AACACCATGC CGCAGTGGGC CGGTTCGTGT TGGTACTACC TGCGCTACCT GGACCCGACC AACACCGAGC GCTTCGTCGA CCCGACCGTC GAGCGCTACT GGATGGCCAG GCCGGGCGCG GTTCCCGGCG ACGGCGGCGT CGATCTGTAC GTCGGCGGTG TCGAGCACGC CGTGCTGCAC CTGCTCTACG CCCGGTTCTG GCACAAGGTG CTCTACGACC TGGGCCACGT CTCCACCAAG GAGCCGTTCA AGCGGCTGTT CAACCAGGGA TACATCCAGG CGGATGCCTT CACCGACGCC CGGGGCATGT ACGTCCCGGC GGCCGAGGTG ACGGCGACCC CCGACGGCCG GTTCCTCTTC CAGGGCGCCC CGGTCAACCG GCGCTCGGGC AAGATGGGCA AGAGCCTGAA GAACAGCGTC AGCCCGGACG AGATGTACGA CAGGTTCGGC GCCGACACGC TGCGCGTCTA CGAGATGGCG ATGGGCCCGC TCGACGCTGA CCGGCCATGG CACACCGACG ACATCGTCGG TTCGCACCGG TTCCTCCAGC GGCTGTGGCG CACCGTCGTC GACGAAACCA CCGGGGCGGC CGCCGTCGTT GACGAGCCGT TGGACGACGA GGCTCTTCGC GTCCTGCACC GGACGATCCT CACGGTCACC GCCGAATACG CGGGGCTGCG GTTCAACACC GCGGTCGCCC GGCTCATCGA ACTAACCAAC TTCGTCAGCA AGAGCTACGG GAAATCCCCC ACCCCCCGCG CGCTCGCCGA GCCGCTCACC CTGATGGCGG CCCCGCTGGC CCCGCACATC GCCGAGGAAC TGTGGTCCCG CCTCGGTCAC GAGGAGTCGG TCAGCACGGT CGCCTTCCCG ATCGGGGATC CGGCGCTGGC CGCCGAGTCG GTCAGGACGA TCCCGGTCCA GGTGAACGGG AAGGTCCGGT TCACCATCGA GGTTCCGGAC GGTTCAGCGG AGCAGACGGT TCGCGATCTG CTCGCCGCAC ATCCCGAGTT CGCCCGGCAG ACCGATGGTC GGACGATCAA GAAGATCATC GTCGTGCCCG GTCGGATCGT GAATATCGCC ATCTCCCCCG CCTAG
|
Protein sequence | MARAMSETAE PGARTGAADT TVAPTGASGG IIPAAAGTAG GAPAGTGSVE PSFRYDARLA ADIERRWQRR WADEGTFNSP NPVGPLSTGF EKVAGREPFY IMDMFPYPSG SGLHVGHPLG YIGTDVFARY LRMSGRHVLH PFGYDSFGLP AEQYAINTGQ HPRDTTNANI ANMRRQLSRL GLGHDTRREI ATTDVGYYRW TQWIFQQIFN SWYDPQAGRA RPIAELIEEF AAGTRAPVAG PAGGNTAVSV DAVRAANPAG LAWTELDEVS RRKVVNAHRL AYISEQLVNW CPGLGTVLAN EEVTADGRSD IGNYPVFRRP LKQWILRITA YAERLISDLD LVDWPDSIKQ MQRNWISPSE GASVEFTVVA PGEEAGASDP SGSSTARRIE VYTTRPDTLA GATFLVLAPE HPLADALIAD TWPADTPVSW RFPAGRPGGG TEPADTAGPE AGADPAWTPR AAVDAYREFA AHRSDRQRGE EVIDRTGVFT GSYVRNPVGG GVIPVFLADY VLLGYGTGAI MAVPAHDSRD FSFARAFDLP IPAVLEPDAD WYAAHGVVPA TPSAQWPEAF SGAGEYRPGP ASAPVLVGLS KSEAIKATVH WLEEIGAGRS ARSYRLRDWL FSRQRYWGEP FPIVFDVDGL PHAVPDELLP IELPEMTDFR PTAMAEDDAS DPVPPLARVA DWVTVTLDLG DGPKQYRRET NTMPQWAGSC WYYLRYLDPT NTERFVDPTV ERYWMARPGA VPGDGGVDLY VGGVEHAVLH LLYARFWHKV LYDLGHVSTK EPFKRLFNQG YIQADAFTDA RGMYVPAAEV TATPDGRFLF QGAPVNRRSG KMGKSLKNSV SPDEMYDRFG ADTLRVYEMA MGPLDADRPW HTDDIVGSHR FLQRLWRTVV DETTGAAAVV DEPLDDEALR VLHRTILTVT AEYAGLRFNT AVARLIELTN FVSKSYGKSP TPRALAEPLT LMAAPLAPHI AEELWSRLGH EESVSTVAFP IGDPALAAES VRTIPVQVNG KVRFTIEVPD GSAEQTVRDL LAAHPEFARQ TDGRTIKKII VVPGRIVNIA ISPA
|
| |