Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1039 |
Symbol | ileS |
ID | 7976822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1087667 |
End bp | 1090438 |
Gene Length | 2772 bp |
Protein Length | 923 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797992 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_002949165 |
Protein GI | 239826541 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0185965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTACA AAGAAACGCT GTTAATGCCA CAAACGGAAT TTCCGATGCG CGGAAACTTA CCGAAGCGTG AACCGGAAAT CCAGAAAAAA TGGGAAGAAA TGGACATTTA TCGAAAAGTG CAAGAACGGA CAAAAGGACG CCCGTTGTTT GTTCTGCATG ACGGGCCGCC GTACGCGAAC GGCGATATTC ATATGGGACA CGCTTTAAAT AAAATTTTAA AAGATATTAT CGTCCGCTAT AAATCGATGA GCGGCTATTG CGCGCCATAC GTTCCTGGAT GGGATACACA TGGACTTCCG ATTGAAACAG CGCTCACGAA AAAGGGAGTT GACCGAAAAT CGATGAGCGT TGCGGAATTT CGCAAGCTTT GTGAACAATA TGCGTATGAA CAAATTAATA ATCAGCGCGA GCAATTTAAG CGTCTTGGCG TGCGCGGCGA TTGGGAAAAT CCATATATTA CGTTGAAACC GGAATATGAA GCACAACAAA TTAAAGTGTT TGGCGAGATG GCGAAAAAAG GACTTATTTA TAAAGGCCTA AAGCCGGTAT ATTGGTCTCC ATCAAGCGAA TCGGCGTTGG CGGAAGCGGA AATTGAGTAT AAAGACAAAC GTTCCCCTTC CATTTATGTT GCATTCCCAG TGAAAGATGG AAAAGGAGTG CTTGACGGGG ATGAAAAAAT CGTCATTTGG ACAACAACGC CTTGGACGAT TCCAGCAAAC TTAGCGATTG CCGTCCATCC AGACCTTGAT TATCAAGTGG TGGAAACAAA CGGTGCAAAA TATGTCGTTG CTGCTGCGTT ATTAGAATCT GTTGCGAAAG AAATCGGCTG GGACGAAGTA ACGGTTGTGA AAACGATCAA AGGAAAAGAC TTAGAATATG TCGTGGCAAA ACATCCGTTT TATGACCGCG ATTCTTTAGT CATTTGCGGT GAACATGTTA CGACAGACGC CGGAACAGGA TGCGTTCATA CCGCGCCTGG ACATGGGGAA GATGACTTTA TTGTCGGGCA AAAATACGGC TTGGATGTGC TATGTCCAGT CGATGAGCGC GGCTATATGA CAAGCGAAGC GCCAGGCTTT GAAGGACTGT TTTACGATGA AGCAAACAAA GCGATTACGC AAAAATTAGA AGAAGTCGGG GCGCTTTTAA AACTCAGTTT CATCACTCAC TCGTATCCGC ACGACTGGCG TACGAAAAAG CCGACGATTT TCCGTGCTAC GACACAATGG TTTGCATCCA TCGATAAAAT TCGCGGTGAA CTGCTTCAGG CCATTAAAGA AACAAAATGG ATTCCGGAAT GGGGCGAAAT CCGCATCCAT AATATGATTC GCGACCGCGG CGATTGGTGC ATTTCTCGTC AACGTGCATG GGGGGTACCG ATTCCAGTCT TTTACGGGGA AAACGGTGAA CCAATTATTA CGGACGAAAC GATTGAACAT GTATCTAACT TGTTCCGTCA ATATGGTTCG AACGTTTGGT TTGAGCGTGA GGCGAAAGAT TTATTACCGG AAGGATTTAC TCATCCATCA AGCCCGAACG GTATCTTTAC GAAAGAAATA GACATTATGG ATGTATGGTT TGATTCCGGT TCTTCCCATC AGGCAGTGCT TGTAGAACGT GATGATTTAC AACGCCCGGC AGATTTATAT TTAGAAGGTT CCGACCAATA TCGCGGCTGG TTTAACTCTT CGCTTTCGAC TGCCGTCGCT GTTACCGGTA AAGCACCGTA TAAAGCTGTA TTAAGCCACG GATTTGTTCT CGATGGAGAA GGACGGAAAA TGAGCAAATC GCTTGGCAAT GTCGTTGTGC CGGCGAAAGT GATGGAACAG CTTGGTGCCG ACATTTTGCG TTTATGGGTT GCTTCTGTTG ATTATCAAGC GGACGTTCGC ATTTCTGACA GCATTTTAAA ACAAGTGGCG GAAGTATATC GCAAAATTCG CAATACGTTC CGCTTTATGC TCGGAAACTT ATTCGACTTT AATCCAGAAA CAGATGCGGT TCCGGTTAAC GAATTACGTG AAGTCGACCG CTACATGATT GTAAAATTAA ACCATTTGAT TGAAAACGTA AAGCATGCCT ATGAAACGTA TGATTTTGCG TCCATTTATC ACGATGTGAA CAATTTCTGT ACTGTTGATT TAAGTGCTTT CTATTTAGAT TTTGCAAAAG ATATTTTATA TATCGAAGCG CCAAACGATC GCGCCCGCCG TTCGATTCAG ACGGTATTGT ACGAAACGGT TGTCGCGTTA ACAAAGCTTG TAGCGCCGAT TTTGCCACAT ACTGCAGAGG AAGTATGGGA GCACATTCCA AATCGAAAGG AAAAAGAAGA AAGCGTTCAA CTTGTTGATA TGCCAGAGAC AATAAACATT GACGAAGAAG ACGCGATTGT TGCAAAATGG GATGCGTTTA TGAACTTGCG CGATGATGTG TTAAAAGCGT TAGAAGTGGC GCGCAACGAA AAAGTGATTG GCAAATCGTT GACCGCAAGC GTTACGGTAT ATCCGACAAA AGAAGCGCGA CAATTGCTTG GATCGATCGA AGAAGATCTA AAACAGTTAT TTATCGTATC CGAGTTTACG ATTGCAGATG ACTATGAACA TGCTCCAGAA GATGCGCAAA AATTGGCCAA TGTGGCCGTT ATCGTCAAAC CGGCGGAAGG AGAAACATGC GAGCGTTGCT GGGTGGTGAC TCCGGAAGTA GGAAAAGATG CGGATCATCC GACATTATGC CCTCGATGCG CCCATATTGT GAAAGAACAT TATTCCGCCT AA
|
Protein sequence | MDYKETLLMP QTEFPMRGNL PKREPEIQKK WEEMDIYRKV QERTKGRPLF VLHDGPPYAN GDIHMGHALN KILKDIIVRY KSMSGYCAPY VPGWDTHGLP IETALTKKGV DRKSMSVAEF RKLCEQYAYE QINNQREQFK RLGVRGDWEN PYITLKPEYE AQQIKVFGEM AKKGLIYKGL KPVYWSPSSE SALAEAEIEY KDKRSPSIYV AFPVKDGKGV LDGDEKIVIW TTTPWTIPAN LAIAVHPDLD YQVVETNGAK YVVAAALLES VAKEIGWDEV TVVKTIKGKD LEYVVAKHPF YDRDSLVICG EHVTTDAGTG CVHTAPGHGE DDFIVGQKYG LDVLCPVDER GYMTSEAPGF EGLFYDEANK AITQKLEEVG ALLKLSFITH SYPHDWRTKK PTIFRATTQW FASIDKIRGE LLQAIKETKW IPEWGEIRIH NMIRDRGDWC ISRQRAWGVP IPVFYGENGE PIITDETIEH VSNLFRQYGS NVWFEREAKD LLPEGFTHPS SPNGIFTKEI DIMDVWFDSG SSHQAVLVER DDLQRPADLY LEGSDQYRGW FNSSLSTAVA VTGKAPYKAV LSHGFVLDGE GRKMSKSLGN VVVPAKVMEQ LGADILRLWV ASVDYQADVR ISDSILKQVA EVYRKIRNTF RFMLGNLFDF NPETDAVPVN ELREVDRYMI VKLNHLIENV KHAYETYDFA SIYHDVNNFC TVDLSAFYLD FAKDILYIEA PNDRARRSIQ TVLYETVVAL TKLVAPILPH TAEEVWEHIP NRKEKEESVQ LVDMPETINI DEEDAIVAKW DAFMNLRDDV LKALEVARNE KVIGKSLTAS VTVYPTKEAR QLLGSIEEDL KQLFIVSEFT IADDYEHAPE DAQKLANVAV IVKPAEGETC ERCWVVTPEV GKDADHPTLC PRCAHIVKEH YSA
|
| |