Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5088 |
Symbol | ileS |
ID | 5673423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6088793 |
End bp | 6092038 |
Gene Length | 3246 bp |
Protein Length | 1081 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243939 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001509353 |
Protein GI | 158316845 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00024165 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGTACCC CCGACGCCCC GACGCCCGCG TTCCGCCCAC TGCCCTCGCG GGTCGACCTG CCGGCCTTCG AGCGCGAGAC GCTGGAGCGC TGGAAGGATT CCAAGGTCTT CCACCGCTCG CTGCAGGCCG GCGCAGACCG GCCGCTGTGG GTGTTCTACG AAGGCCCGCC GACCGCCAAC GGCAAGCCGG GCGCCCACCA CGTCGAGGCG CGGGTCTTCA AGGACCTCTT CCCCCGTTAC AGGACGATGA AGGGCTACCA CGTGCCCCGC CGGGCGGGCT GGGACTGCCA CGGCCTGCCT GTCGAGCTCG CCGTCGAGAA GGAACTCGGC TTCACCAGCA AGAGCGACAT TGAGGACTAC GGCATCGCCG AGTTCAACGA GAGGTGCCGC GAGTCCGTCC TGCGCCACGT GGCCGACTTC TCCGCGATGA CGGAGCGGAT GGGCTACTGG GTCGACCTCG ACGGCGCCTA CCGGACGATG GACACCTCGT ACATCGAGAG CGTCTGGTGG TCCCTCAAGC AGATCTTCGA CAAGGGCCTG CTCGTCGAGG ACTTCCGGGT CACCCCGTAC TGCCCCCGGG ACGAGACGCC GCTGTCCGAC CACGAGGTCT CCCAGGGCTA CAGCGACACC GACGACCCGT CGGTCTACGT GCGGTTCCCG GTCGTCGAGG GCCCGCGCGG GCTGGCCGAG CACGGCGCGC ACCTGCTGGT GTGGACGACC ACCCCGTGGA CGCTGGTCTC CAACACCGCG GTCGCCGTCC ATCCCGAGGT CAGCTACGTG CTGGCCCGGG CCGCCGGCGG TGAACTGCTC GTCGTCGCCG AGCCGCTGCT GACCGCGGCC CTGGGCGAGG GCGCCGAGGT CGTCGAGCGC TTCACCGGCG CCGAGCTCGC CGGCACCCGC TATGCCCGGC CGTTCGAGCT GATCGCGCTG GAGCGCTTCG AGGAGGCGGC GGGGAAGACC GCCGTCGGCC GGGGCGCCGG GACATCCGCC CGCCCGCACT CGGTGGTCCT GGCGGACTAC GTGACGACCA CGGACGGCAC CGGGCTCGTG CACCAGGCAC CCGCCTTCGG CGCCGAGGAC CTCGCGGTCT GCCGCGCCAA CGGCCTCGGC GTCGTCAACC CGGTCGGGAC GGACGGGCGT TTCCTGCCCG AGATCCCGCT GGTCGGCGGG CTGTTCTTCA AGGACGCCGA CGCGCCGCTG TCCGCGGACC TCACCGCGCG CGGCCTGATG TGGCGGGCCG CGACCTACAC GCACAGCTAC CCGCGCTGCT GGCGCTGCCA CACACCGCTG ATCTACTACC CGCTGCCGTC CTGGTACATC CGCACGACCG CCGTCCGCGA GCGCCTGCTC GCCGAGAACG CGCGGACGGA CTGGCACCCT GAGCGCATCC GCGAGGGCCG CTACGGCGAG TGGCTGCGCA ACAACGTCGA CTGGGCGCTG TCGCGCAACC GGTACTGGGG CACCCCGCTG CCCGTCTGGC GGTGCGAGGC CGACCCGTCC CACCTGACCT GCGTCGGCTC GCTGGCGGAG CTCTCCGAGC TGACCGGCAC GGACCATTCG ACGCTCGACC CGCACCGCCC GTTCGTCGAC GACGTGACCC TGCCGTGCCC CACCTGTGGG GAGACCGCCC GGCGGGTGCC TGAGGTCATC GATGTCTGGT ACGACAGCGG CGCGATGCCG TTCGCCCAGT GGGGCGCCCC GCACCGCAAC GCCGACGAGT TCGCGAAGCA GTACCCGGCG CAGTACATCT GCGAGGCGAT CGACCAGACC CGGGGCTGGT TCTACACGCT CATGGCCGTG GGCACGCTGG TCTTCGACCG GTCGTCCTAC GAGACCGTGC TGTGCCTGGG CCTGCTGCTC GACGCCGAGG GCCGCAAGAT GAGCAAGCAC GTCGGTAACG TGCTGGACCC GTTCGACCTG TTCGACCGGC ACGGGGCGGA CGCCGTCCGG TGGCTGATGC TGGCCGGCGG CTCACCGTGG TCGGACCGCC GGGTCAGCCA CGAGTCGATC GAGGACATCG TCCGCAAGAT CCTGCTCACC TACTGGAACA CCGCGTCGTT CTTCGCGCTC TACGCGGGCG CGGCGGACTG GTGCCCGACC GGCGCCGACG GGTCGGGCGA GGGCGCCGCG GCCCCGGCCG AACGCCCGGT GCTGGACAGG TGGGCGCTCT CCGAGCTCGC CGACACCGTC GCCGAGGTGG ACGCCGCCCT CGAGGGCTTC GACGCGCTGC GCGCCGGGCG CCGCATCGCC CGCTTCGTGG ACGACCTGTC GAACTGGTAC GTCCGGCGTT CCCGCCGCCG CTTCTGGGCG GGCGACCCGA ACGCGCTGGC CACCCTCTAC ACCTGCCTGG ACGGCCTGAC CCGGGTGATG TCGCCGTTCA CCCCGTTCCT GACCGACTGG CTGTGGTCCC GGCTGTTCGC CGGCATCCTG CCCGGCGCCC CGGACTCGGT GCACCTGGCC TCGTGGCCGC AGCTGCCGGC GGACCTCGTC GAGCCGGGCC TGGCCGAGCG GATGGACCTC GTGCGCCGGA TCGTCGAGCT CGGCCGCGCG GCCCGGGCCG CCAGCGGGCT GCGCACCCGT CAGCCGCTGC CGCGTGCGGT GGTCGGCTCC TCGGCGTTCG ACGACCTCTC GGCGGAGCTG CGCGCGCAGA TCGCCGAGGA GCTCAACGTC GTCGCCGTCG AGGCCGCGAC CTCGGACCTC GTCGACATCA GCGTGAAGCC GAACTTCCGG GCGCTGGGAC GGCGCTTCGG CAACCGCACG AAGGCGGTCG CCGCCGCCGT CACCACCGCC GGGGCACCGG TGGACGGCCG GCTGACCGTC ACCGTGGACG GCGAGCGCAT CGAGCTCGGC GGTGAGGATC TGATCGTCAC CGAGACACCG CGCGAGGGCT GGTCGGTGAC GAGTGAGTCG GGCCTGTCGG TCGCCCTCGA CCTGACCGTC ACCCCGGAAC TGGCCCGGAC CGGGCTCGCC CGCGACGTGG TGCGCGTCCT GCAGGACGCG CGGCGGGCCG CGGGGCTGGA GATCACCGAC CGGGTCGAGC TGCGCTGGGT CGCCGCCAAG GAGGAGACGG CGGCAGCGTT GCGTGAGCAC GCGGCGACCG TCGCCGACGA GATCCTGGCC ACAGTGTTCC AGGAGGCACC GGTCGAGCAG ACGGCGGAGC CGGCATGGCA CCGGGGATCA TCCGCGGAGC TCGGCCTGAC CTTCGCGCTG ACCAGGACGA CCCCCGCGCC GCCGACGGTT GGCTGA
|
Protein sequence | MSTPDAPTPA FRPLPSRVDL PAFERETLER WKDSKVFHRS LQAGADRPLW VFYEGPPTAN GKPGAHHVEA RVFKDLFPRY RTMKGYHVPR RAGWDCHGLP VELAVEKELG FTSKSDIEDY GIAEFNERCR ESVLRHVADF SAMTERMGYW VDLDGAYRTM DTSYIESVWW SLKQIFDKGL LVEDFRVTPY CPRDETPLSD HEVSQGYSDT DDPSVYVRFP VVEGPRGLAE HGAHLLVWTT TPWTLVSNTA VAVHPEVSYV LARAAGGELL VVAEPLLTAA LGEGAEVVER FTGAELAGTR YARPFELIAL ERFEEAAGKT AVGRGAGTSA RPHSVVLADY VTTTDGTGLV HQAPAFGAED LAVCRANGLG VVNPVGTDGR FLPEIPLVGG LFFKDADAPL SADLTARGLM WRAATYTHSY PRCWRCHTPL IYYPLPSWYI RTTAVRERLL AENARTDWHP ERIREGRYGE WLRNNVDWAL SRNRYWGTPL PVWRCEADPS HLTCVGSLAE LSELTGTDHS TLDPHRPFVD DVTLPCPTCG ETARRVPEVI DVWYDSGAMP FAQWGAPHRN ADEFAKQYPA QYICEAIDQT RGWFYTLMAV GTLVFDRSSY ETVLCLGLLL DAEGRKMSKH VGNVLDPFDL FDRHGADAVR WLMLAGGSPW SDRRVSHESI EDIVRKILLT YWNTASFFAL YAGAADWCPT GADGSGEGAA APAERPVLDR WALSELADTV AEVDAALEGF DALRAGRRIA RFVDDLSNWY VRRSRRRFWA GDPNALATLY TCLDGLTRVM SPFTPFLTDW LWSRLFAGIL PGAPDSVHLA SWPQLPADLV EPGLAERMDL VRRIVELGRA ARAASGLRTR QPLPRAVVGS SAFDDLSAEL RAQIAEELNV VAVEAATSDL VDISVKPNFR ALGRRFGNRT KAVAAAVTTA GAPVDGRLTV TVDGERIELG GEDLIVTETP REGWSVTSES GLSVALDLTV TPELARTGLA RDVVRVLQDA RRAAGLEITD RVELRWVAAK EETAAALREH AATVADEILA TVFQEAPVEQ TAEPAWHRGS SAELGLTFAL TRTTPAPPTV G
|
| |