Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1424 |
Symbol | ileS |
ID | 3903155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 1714744 |
End bp | 1717953 |
Gene Length | 3210 bp |
Protein Length | 1069 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637878761 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_480530 |
Protein GI | 86740130 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.292782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0637422 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCC CTCTGCGGCA TCCCACCTTC GCTCCGCTGC CCGCCCAGGT GGACCTGCCC GCCCTGGAGC GGGAGACGTT GGCCCGCTGG CGCGACACCA AGGTGTTCCA CCGTTCGCTG GAGGCCACCG CGGACCGCCC GCTGTGGGTT TTCTACGAGG GCCCACCCAC CGCCAACGGC AGGCCGGGGG CGCACCACGT GGAGGCCAGG GTCTTCAAGG ACCTGTTCCC CCGCTACCGG ACGATGAAGG GCTACCACGT TCCCCGGCGG GCGGGCTGGG ACTGCCACGG GCTGCCCGTC GAGCTCGCGG TCGAGAAGGA GCTCGGCTTC ACCAGCAAAA ACGACATCGA GGCGTTCGGC ATCGCCGAGT TCAACGCCCG GTGCCGCGAG TCGGTGCTGC GCCATGTCGC GGACTTCTCC GCGATGACCG AGCGGATGGG CTATTGGGTC GACCTCGATG GCGCCTACCG CACCATGGAC ACCAGCTACG TCGAGAGCGT CTGGTGGTCG CTCAAGCAGA TCTTCGACCA GGGCCTGCTG GTCGAGGACT TCCGGGTCAC CCCGTACTGC CCGCGCGACG AAACGCCGCT TAGTGACCAC GAGGTGTCCC AGGGCTACTC GGACGTCGAC GACCCCTCGG TCTACGTCCG TTTCCCGCTC GTCGCCGACG CCCTGGGCCT CGCCGGGCAG GGTGCGCAGC TGCTGGTATG GACGACGACC CCGTGGACGC TGGTGTCGAA CACGGCGGTG GCCGTTCATC CGGAGGTGGA GTACGTCCTG GCCCGCGCGG GGGACGGGGA GTTGTTCGTC GTCGCCGAAC CACTGGTCAC CGCGGCCCTC GGCGAGGACG CCGAGATCGT CGAGCGGTTC CGCGGGGCGG AGCTGGCCGG TGCCCGCTAC ACCCGCCCGT TCGAGCTGCT GGCGGCCGAG CGGTTCGCGG CCGGCACCGG CGTCCCGCAC TCGGTCGTGC TCGCCGACTA CGTGACGACC ACGGATGGCA CGGGCCTGGT CCACCAGGCG CCAGCGTTCG GCGCGGAGGA CCTCGCAGTC TGCCGGGCGA GCGGGCTGCC GGTGGTGAAC CCGATCGGGA CGGACGGTCG CTTCCTCGCC GACGTCCCCC TGGTCGGCGG GATGTTCTTC AAGGATGCCG ACGCGCCGCT GACCGCCGAC CTGCGCGAGC GGGGCCGGCT GTGGCGGGCG TCGACGTATA CGCACAGTTA TCCGCACTGC TGGCGCTGTC ACACACCGTT GATCTACTAT CCGCTGCCGT CCTGGTACAT CCGGACCACC GCCATCCGCG ACGAGCTGCT GGCGCAGAAC GAGCGGACGA CCTGGCACCC CGAGCGGATC AAGACCGGTC GGTACGGCGA GTGGCTGCGC GGCAACGTCG ACTGGGCGCT GTCCCGCAAC CGATACTGGG GGACACCGCT GCCGGTCTGG CGCTGCGACG ACGACCCCAC CCACCTGGTG TGCGTCGGGT CACTCGCGGA GCTCTCCGAG CTCGCCGGGC GGAACCTGGC CGACCTCGAC CCGCACCGCC CGTTCGTCGA CGAGGTCACC GGGACCTGCC CGACCTGTGG CGGCGCCTCG CACCGGGTGC CCGAGGTGAT CGACGTCTGG TATGACAGCG GCGCGATGCC GTTCGCCCAG TGGGGCGCCC CGCACCACAA CCTCGCGGCG TTCACCCGGC AGTACCCGGC GCAGTACATT TGCGAGGCGA TCGACCAGAC CCGCGGCTGG TTCTACACGA TGATGGCGGT CGGCACGCTG GTGTTCGGCC GCTCCTCCTA CGAGACGGTG CTCTGTCTCG GCCTGCTCCT GGACGCCGAC GGCCGCAAGA TGAGCAAGCA TCTCGGCAAC GTGCTCGATC CCTTCGAGCT GTTCGAGCGG CACGGCGCGG ACGCGGTCCG CTGGCTGATG CTCGCCGGCG GCTCGCCGTG GGCGGACCGC CGGGTGAGTC ACGAGGCGAT CGAGGACATC GTCCGCAAGG TCCTGCTCAC CTACTGGAAC ACCTCGTCCT TCTTCGCTCT CTATGCCGGG GCAGCCGGCT GGCGCCCAGG CGCGGACCCG GCCGCGGACC CGCGCGCGAC ACCGCCGGCC CGACGGCACG TGCTGGACCG CTGGGCGCTG TCCGAGCTCG CGGCCACCGT CGCCGAGGTG GACGATGCGC TGGAGAACTT CGACTCGCTG CGGGCCGGGC GGCGGATCGC CCGGTTCGTC GACGACCTGT CCAACTGGTA CGTCCGCCGG TCCCGCCGCC GGTTCTGGGC CGGCGACGCC GACGCCCTGA GCACCCTGCA CACCTGCCTG GACGCGCTGA CCCGGGTGAT GGCGCCGTTC ACGCCGTTCC TCACCGACTG GCTGTGGTCA CGGCTGTTCG CCGACGCATC CCCGCGGACC CCCGACTCGG TGCACCTGGC CGCCTGGCCC GAGCTCCCGG CGGGGCTGCA CACGCCGGAG CTCTCGGAGC AGATGGATCT CGTCCGGCGG ATCGTGGAAC TCGGCCGCGC CGCCCGGGCC GCCAGCGGGG TGCGCACCCG CCAGCCGTTG CCGCGGGCGG TCGTCGGCGC GAGTGCCTTT GACGAGCTCT CCCCCGAGCT GATCGCGCAG ATCACCGAGG AGCTCAACGT GACCACGGTG GAGCCGGCGA CCTCGGAGGT CGTCGACATC TCGGTGAAGC CGAACTTCCG GGCGCTGGGG CGGCGCTTCG GCAGGAACAC CAAGGCGGCC GCCGCGGCCA TCGCGGCCGC CGGTCCTCCC GTCAACGGAC GGCTCACCGT CACCGTTGAC GGGGAGGACG TCGAGCTGTC CGGGGACGAG CTGATCATCA CGGAGACGCC GCGGCAGGGC TGGGCGGTCA CCGCCGAGTC CGGGCTCTCC GTCGCCCTCG ACCTGGAGAT CTCCCCGCAG CTCGCCCGCG CCGGGCTCGC CCGCGACGTC GTCCGGGTGC TCCAGGACGC GCGCAAGGCG GCGGGGCTGG AGATCACCGA CCGGGTGGAC GTCTCCTGGG CGGCGACGCG CGAGGAGACC GCGCTCGCTC TGCGTACCCA CGGTCAGACG GTGGCCGAGG AGGTGCTGGC GGTCTCCTTC ACCGAGGCGG CCCGCACGGA GCTACCTGCG GCGCAGCCGC GCGAGACGGC AGCCCGCTCG GCGGCCGAGG AGCTGGGCCT GGCGTTCACG CTCACCCGGC ACGAGACGAC CGGCGGCTGA
|
Protein sequence | MSTPLRHPTF APLPAQVDLP ALERETLARW RDTKVFHRSL EATADRPLWV FYEGPPTANG RPGAHHVEAR VFKDLFPRYR TMKGYHVPRR AGWDCHGLPV ELAVEKELGF TSKNDIEAFG IAEFNARCRE SVLRHVADFS AMTERMGYWV DLDGAYRTMD TSYVESVWWS LKQIFDQGLL VEDFRVTPYC PRDETPLSDH EVSQGYSDVD DPSVYVRFPL VADALGLAGQ GAQLLVWTTT PWTLVSNTAV AVHPEVEYVL ARAGDGELFV VAEPLVTAAL GEDAEIVERF RGAELAGARY TRPFELLAAE RFAAGTGVPH SVVLADYVTT TDGTGLVHQA PAFGAEDLAV CRASGLPVVN PIGTDGRFLA DVPLVGGMFF KDADAPLTAD LRERGRLWRA STYTHSYPHC WRCHTPLIYY PLPSWYIRTT AIRDELLAQN ERTTWHPERI KTGRYGEWLR GNVDWALSRN RYWGTPLPVW RCDDDPTHLV CVGSLAELSE LAGRNLADLD PHRPFVDEVT GTCPTCGGAS HRVPEVIDVW YDSGAMPFAQ WGAPHHNLAA FTRQYPAQYI CEAIDQTRGW FYTMMAVGTL VFGRSSYETV LCLGLLLDAD GRKMSKHLGN VLDPFELFER HGADAVRWLM LAGGSPWADR RVSHEAIEDI VRKVLLTYWN TSSFFALYAG AAGWRPGADP AADPRATPPA RRHVLDRWAL SELAATVAEV DDALENFDSL RAGRRIARFV DDLSNWYVRR SRRRFWAGDA DALSTLHTCL DALTRVMAPF TPFLTDWLWS RLFADASPRT PDSVHLAAWP ELPAGLHTPE LSEQMDLVRR IVELGRAARA ASGVRTRQPL PRAVVGASAF DELSPELIAQ ITEELNVTTV EPATSEVVDI SVKPNFRALG RRFGRNTKAA AAAIAAAGPP VNGRLTVTVD GEDVELSGDE LIITETPRQG WAVTAESGLS VALDLEISPQ LARAGLARDV VRVLQDARKA AGLEITDRVD VSWAATREET ALALRTHGQT VAEEVLAVSF TEAARTELPA AQPRETAARS AAEELGLAFT LTRHETTGG
|
| |