Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_5819 |
Symbol | ileS |
ID | 7380605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011988 |
Strand | - |
Start bp | 838925 |
End bp | 841888 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643649355 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_002547592 |
Protein GI | 222106801 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGA CGACTGATAA AAAAGACTAT TCCGCCACGC TGGCCCTGCC GCAGACGGAG TTTCCAATGC GCGCAGGCCT GCCGCAAAAA GAGCCGGAAA TTGTTGCCCG CTGGCAGCAG ATGGGGCTTT ACAAGAAGCT CCGCGCCTCC GCCGCTGGCC GCGAAAAATT CGTGCTGCAC GATGGCCCTC CCTACGCCAA CGGCAATATC CACATCGGCC ACGCGCTGAA CAAGATCCTC AAAGACGTGA TCACCCGCTC ATTCCAGATG CGCGGGTTTG ATAGCAATTA CGTCCCCGGA TGGGATTGCC ACGGCCTTCC GATCGAGTGG AAGATCGAGG AAAAATACCG CGAAAAGGGC AAGGACAAGA ACGAAGTTCC GATCAACGAA TTCCGTCAGG AATGCCGTGA TTTTGCCGCT GGTTGGATCA AGGTGCAATC GGAAGAGTTC AAACGCCTCG GCATCGAAGG CGATTTTGAA AATCCCTACA CCACGATGAA TTTCCACGCG GAAGCCCGCA TCGCTGGCGA ATTGCTGAAA ATCGCCAAGA GTGGCCAGCT CTATCGTGGC TCCAAGCCGA TCATGTGGTC GGTGGTGGAA CGCACCGCCT TGGCCGAGGC CGAGGTGGAG TATCACGAGG TTGAAAGTGA TGCGATCTGG GTGAAGTTTC CGGTTAACGT TAAATCACCA ATCGTGGGTA GAGACGAAGA CGGTCGCATT GGTAAAGGCA GGACGGAAGC CGAATGGCAG CTTAATAAAT CATCCGTCGT CATCTGGACC ACCACGCCCT GGACCATCCC CGGAAACCGT GCGATCTCGT TTTCGTCTAA GATCGAATAT GGCCTGTATG AAGTCACCGA GGCTGCCAAT GATTTTGGCC CACAGCCGGG TGAAAAGCTG ATCTTTGCAG CGAAGCTTGC AGAGGAAGCC GCCAAGAAAG CGAAGCTGAC ATTCAATCTA GTTCGTCCTG TTACAGCAGA CGAACTCGCG TCGATCACCT GCGCCCATCC ACTCGCCGAT CTCGGCTACG ACTTCAAAGT CCCACTCATC GACGGCGATC ACGTCACCGA TGATGCGGGT ACTGGCTTCG TCCATACAGC GCCAAGCCAT GGTCGTGAAG ACTTTGACGC ATGGATGTCG GCTGCCCGCG CCTTGGAAGC CAGTGACATC TCCACCAAAA TTCCTTTCAC CGTCGATGAC GCTGGCTTCT ACACCGAAGA TGCCCCCGGC TTTGGCCCAT CGGCTGAAGG CGGCGCTGCC CGCGTGATGG ATGACAATGG CAAGAAGGGC GATGCCAACG AGCGCGTCAT CAAGGCCCTG ATCGCTGCCA ACAACCTGTT TGCCCGTGGC CGGATCAAGC ATGATTATCC GCATTCATGG CGCTCCAAGA AGCCGGTGAT CTTCCGCAAC ACGCCGCAAT GGTTTGTCTA TATGGACAAG GAATTGGGCG ATGGCACCAC GCTGCGCGCG CGCGCGCTTG GCGCCATCGA TGACACCCGT TTCGTGCCCG CCGCTGGCCA GAACCGTCTG CGTGCGATGA TCGAAGGCCG TCCCGATTGG GTGCTGTCGC GTCAGCGCGC ATGGGGCGTT CCGATCTGCG TGTTTGCCGA TGAGCAGGGT AATATTTTTC CAGACAAAGA CGGAAGCGTC GAGAAGCGCA TCCTCCAAGC CTTTGAGGTG GAAGGTGCGG ATGCATGGTT TGCTGACGGT GCCAAGGAGC GCTTCCTTGA AGGCGTGCCA AACCAAGAGC GCTGGACGCA GGTTCGTGAC ATTCTCGATG TGTGGTTCGA TTCGGGCTGC ACCCACACGT TTACGCTGGA AGACCGCCCG GACTTGAAAT GGCCTGCCGA TGTCTATCTC GAAGGCTCGG ATCAGCATCG CGGCTGGTTC CACTCATCCC TGTTGGAATC CTGCGCTACC CGTGGCCGTG CGCCCTATAA TGCCGTCATC ACCCATGGTT TCACCATGGC GGAAGATGGT CGCAAGATGT CGAAATCCAT CGGCAATACC ATTTCGCCTC AGGATGTTAT GGCCCAATCC GGTGCCGATA TCCTGCGCCT TTGGGTGATG AACACCGATT ATTGGGAAGA CCAGCGTCTG GGCAAAGCCA TCATCCAGAC CAATGTCGAT GCCTATCGCA AGATCCGCAA TACGGTCCGC TGGATGCTCG GCACGCTTGC CCATGACCAT GGCGAAGATA TTGCCTATGA GGCCTTGCCG GAGCTGGAAA AGCTGATGCT GCATCGGCTA GCCGAGCTGG ATGTGCTTGT GCGCGACAGC TATGACGCTT TCGAGTTCAA GAAGATCACC CGTGCGCTGA CCGATTTTGC CAATGTCGAG CTGTCGGCCT TCTATTTCGA TATCCGCAAG GATGCGCTCT ATTGCGACGC GCCATCATCG CCGCGCCGCC GTGCGTCCCT GTTCGTGATC CGCAAGCTGT TCGATTGCAT GGTGCTGTGG CTGGCGCCAA TGCTGCCCTT CACCACAGAG GAAGCCTGGC TGTCGCGCAA CCCGGATGCT GTTTCCGTGC ATCTGGAACA GTTCCCGTCA ATCCCGGCGC AGTGGCTGAA CCAAGTGCTG GACGGCAAAT GGGCAAAGAT CCGCAAGGTG CGCAGCGTCG TGACCGGCGC GCTGGAAGTG GAGCGCAAGG ACAAGCGCAT CGGCTCCTCG CTGGAAGCAG CTCCCGTCGT GCATATTGCC GATGCCGATC TGCTGGCAGC CCTTGAGGGT CAGGATTTCG CCGAAATCTG CATCACCTCG GCCATTTCGG TGGTTCAGGG CGAGGGTCCG TCCGATGCCT TCCGGTTGTC AGATGTAGGG GCGGTCTCCG TCGAGCCGAA ATTGGCGCAG GGCCGCAAAT GCGCCCGCTC CTGGCGGATC ACCGACGATG TCGGCTCCGA CCCTGATTAT CCCGATGTTT CGGCACGGGA TGCGGCGGCC TTGCGTGAAC TGACACTTGG TTGA
|
Protein sequence | MTETTDKKDY SATLALPQTE FPMRAGLPQK EPEIVARWQQ MGLYKKLRAS AAGREKFVLH DGPPYANGNI HIGHALNKIL KDVITRSFQM RGFDSNYVPG WDCHGLPIEW KIEEKYREKG KDKNEVPINE FRQECRDFAA GWIKVQSEEF KRLGIEGDFE NPYTTMNFHA EARIAGELLK IAKSGQLYRG SKPIMWSVVE RTALAEAEVE YHEVESDAIW VKFPVNVKSP IVGRDEDGRI GKGRTEAEWQ LNKSSVVIWT TTPWTIPGNR AISFSSKIEY GLYEVTEAAN DFGPQPGEKL IFAAKLAEEA AKKAKLTFNL VRPVTADELA SITCAHPLAD LGYDFKVPLI DGDHVTDDAG TGFVHTAPSH GREDFDAWMS AARALEASDI STKIPFTVDD AGFYTEDAPG FGPSAEGGAA RVMDDNGKKG DANERVIKAL IAANNLFARG RIKHDYPHSW RSKKPVIFRN TPQWFVYMDK ELGDGTTLRA RALGAIDDTR FVPAAGQNRL RAMIEGRPDW VLSRQRAWGV PICVFADEQG NIFPDKDGSV EKRILQAFEV EGADAWFADG AKERFLEGVP NQERWTQVRD ILDVWFDSGC THTFTLEDRP DLKWPADVYL EGSDQHRGWF HSSLLESCAT RGRAPYNAVI THGFTMAEDG RKMSKSIGNT ISPQDVMAQS GADILRLWVM NTDYWEDQRL GKAIIQTNVD AYRKIRNTVR WMLGTLAHDH GEDIAYEALP ELEKLMLHRL AELDVLVRDS YDAFEFKKIT RALTDFANVE LSAFYFDIRK DALYCDAPSS PRRRASLFVI RKLFDCMVLW LAPMLPFTTE EAWLSRNPDA VSVHLEQFPS IPAQWLNQVL DGKWAKIRKV RSVVTGALEV ERKDKRIGSS LEAAPVVHIA DADLLAALEG QDFAEICITS AISVVQGEGP SDAFRLSDVG AVSVEPKLAQ GRKCARSWRI TDDVGSDPDY PDVSARDAAA LRELTLG
|
| |