Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4057 |
Symbol | ileS |
ID | 5901519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4389177 |
End bp | 4392086 |
Gene Length | 2910 bp |
Protein Length | 969 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564578 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001685680 |
Protein GI | 167648017 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.446405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACG ACGCCACGAC GACCGCCCGC GACTACCGCG AGACCGTCTT CCTCCCCGAC ACCCCGTTCC CGATGCGCGG CGGCCTGCCC AAGAAGGAGC CCGAAATCCT CGAGGCCTGG GCGGCCCTGT CGGACAAGGG CCTGTACGGC GCCGTGCGCG CCGCGCGCCA GGCCGCCGGC CGCCCGCTGT TCGTGCTGCA CGACGGCCCG CCCTACGCCA ACGGCGCGAT CCACATCGGC CACGCCCTGA ACAAGACGCT GAAGGACTTC GTGGTCCGCT CGCGCTTCGC CCTGGGCTAC GACGTCGACT ACGTGCCCGG CTGGGACTGC CACGGCCTGC CGATCGAGTG GAAGATCGAG GAAGAATTCC GCGCCAAGGG CCGCCGCAAG GACGAGGTGC CCGCCGCTGA GTTCCGCAAG CGCTGCCGTG AATACGCCGC CGGCTGGATC GAGGCCCAGA AGGTCGAGTT CCAGCGCCTG GGCGTGCTGG GTGACTGGTG GAACCGCTAC GCCACCATGG ACTACGCCAG CGAGGCGACC ATCGTCGCCG AGTTCCACAA GTTCGCCACC AGCGGCCAGC TCTATCGCGG CTCCAAGCCG GTGATGTGGA GCCCCGTCGA GCGCACGGCC CTGGCCGACG CCGAGATCGA GTACCACGAC CACACCAGCC CGACGGTGTG GGTGAAGTTC CCGGTCAAGA GCACCGACGC CAGCATCGTG ATCTGGACCA CCACGCCGTG GACGATCCCG GCCAACCGGG CAATCAGCTT CAACCCCAAG GTCGAGTACG GTCTCTATGA AGTGGCGGCG CTCGAGGAGA ACCTCGAATT CGCCCCCTGG TCCAAGCCGG GCGACAGGCT GGTGGTGGCC GACAAGCTGG CCGAGGACGT GCGCAAGGCC GCCAAGGTGG CGAGCTGGAA GCGCTTGAGC GCCTTCGACC CGACCGATCT CATCGCCGCC CACCCTCTGG CCCTGTTCGA TGACGGCTAT GACTTCGACG TGCCGCTGCT GGCCGGCGAC CACGTGACCG ACGACGCCGG CACCGGCTTC GTCCACACAG CTCCAGGCCA CGGCGCCGAC GACTATCTGG TGTGGCTCAA GAGCGGCTAC GCCCTGGACG CCATCCCCGA CACCGTCGAC CCCGACGGGG CCTACTATCC GCACGTGCCG CTGTTCGCGG GCCTGAAGGT CATCGAGACC GAGGGCAAGA AGGCCGGCAA GTTCGGTCCG GCCAACGGCG CGGTCATGGA CAAGCTGATC GAGGCCGGAA ACCTGCTGGC GCGCGGCCGG GTCGAGCACA GCTATCCGCA CAGCTGGCGC TCCAAGGCCC CGGTGATCTT CCGCAACACC CCGCAGTGGT TCATCCGCAT GGACCAACCC GTCCCGACCC TGGGCGGCAA GACCCTGCGC GAGGTGGCGG TCAACGCCAT CGCACAGACC GCCTTCCACC CCGAGGCCGG GCGCAACCGT ATCGGTTCGA TGGTCGAGTC GCGCCCCGAC TGGCTGATCA GCCGCCAGCG CGCCTGGGGC ACGCCGCTGG CCATGTTCGT CGACAAGGAG ACCGGCGTCC CGCTGATGGA CGAGGCGGTC AACCGCCGCA TCCTCGACGC CATCCAGGAC GGCGGCAGCG ACGCCTGGTT CGAGCTGCCG GACGAGCACT TCCTGGGCGA CCGCGACCCG GCCCAGTACG AGAAGGTCGT CGACATCCTC GACGTCTGGT TCGACAGCGG CGCCACCCAC GCCTTCACCC TGGAAGGCCG CAACGACAGC CGCTCGCCCG CCGACCTCTA TCTGGAGGGC AGCGACCAGC ACCGCGGCTG GTTCCAGTCC AGCTTGCTGG AGAGCTGCGG CACGCGCGGC CGCGCGCCGT ACAAGGGCGT CCTGACCCAC GGGTTCACCC AGGACGAGAA CGGCGAGAAG ATGTCCAAGT CCAAGGGCAA CACCGTCGAG CCCCAGACCA TCACCAAGGA AAGCGGCGCC GAGATCCTGC GGCTGTGGGC GGCGATGGTC GACTATTCCG AGGATCAGCG GATCGGCAAG ACGATCCTGG CCACGACGAC CGACGCCTAT CGCAAGCTGC GCAACACCAC CCGCTACCTG CTGGGCGCCC TGGCCGGCTT CGACGAGGCC GAGCGGGTCA CCGACTACGC CGACTTCCCG CCGCTGGAGA GGTACATCCT GCACCGCCTG TGGGAGCTGG ATGGCCAGGT GAAGGCCGCC TACGAGGCCT ATCGCTTCAG CGACGTGATC CGGCCGCTGA TCGACTTCTG CCAGGGCGAC CTGTCCAGCC TGTTCTTCGA CATCCGCAAG GACAGCCTCT ATTGCGACGC GCCCCCGGCT CTGCGCCGCC GAGCCTATCG CACGGTGCTC GATTACGTGT TCGAGCGCCT GACGGTGTGG CTGTCGCCGC TGACGAGCTT CACCATGGAA GAGGCCTGGA CGACGCGCTT CCCCGAGGCG GGCAGCAACG TGCTGCGGGT GATGCCGGAG ACGCCGGACG CCTGGCGCAA CGACGCCGAG GCCGCGCGGT GGGCCAAGGT CGAGACCGTC ACCTCGGTGG TGACCTCGGC CCTGGAGGTC GAGCGCCGCG ACAAGCGCAT CGGCTCGGCC CTCGAAGCCG CGCCGGTGGT GCACATCAGC GAGCCCGCCC TGCTGGCCGC CTTCGACGGC CTGGACGCCG CCGAGGTGTT CCGCACCAGC GCCGCGACCC TGGTCGCGGG TGACGCGGCG AACGCCTTCG CGCTGGACGA GGTCAAGGGC GTGGCCGTCG AGGTCAAGCT GGCCCAAGGC AAGAAGTGCG CCCGCTCGTG GCGCATCCTG CCGGAGGTGG GAACCGATCC CCGCTATCCG GAGCTGTCCT TGCGCGACGC CGAAGCGGTG GCGTGGTGGG ATGGCCGGCA CGCTTCCTAG
|
Protein sequence | MADDATTTAR DYRETVFLPD TPFPMRGGLP KKEPEILEAW AALSDKGLYG AVRAARQAAG RPLFVLHDGP PYANGAIHIG HALNKTLKDF VVRSRFALGY DVDYVPGWDC HGLPIEWKIE EEFRAKGRRK DEVPAAEFRK RCREYAAGWI EAQKVEFQRL GVLGDWWNRY ATMDYASEAT IVAEFHKFAT SGQLYRGSKP VMWSPVERTA LADAEIEYHD HTSPTVWVKF PVKSTDASIV IWTTTPWTIP ANRAISFNPK VEYGLYEVAA LEENLEFAPW SKPGDRLVVA DKLAEDVRKA AKVASWKRLS AFDPTDLIAA HPLALFDDGY DFDVPLLAGD HVTDDAGTGF VHTAPGHGAD DYLVWLKSGY ALDAIPDTVD PDGAYYPHVP LFAGLKVIET EGKKAGKFGP ANGAVMDKLI EAGNLLARGR VEHSYPHSWR SKAPVIFRNT PQWFIRMDQP VPTLGGKTLR EVAVNAIAQT AFHPEAGRNR IGSMVESRPD WLISRQRAWG TPLAMFVDKE TGVPLMDEAV NRRILDAIQD GGSDAWFELP DEHFLGDRDP AQYEKVVDIL DVWFDSGATH AFTLEGRNDS RSPADLYLEG SDQHRGWFQS SLLESCGTRG RAPYKGVLTH GFTQDENGEK MSKSKGNTVE PQTITKESGA EILRLWAAMV DYSEDQRIGK TILATTTDAY RKLRNTTRYL LGALAGFDEA ERVTDYADFP PLERYILHRL WELDGQVKAA YEAYRFSDVI RPLIDFCQGD LSSLFFDIRK DSLYCDAPPA LRRRAYRTVL DYVFERLTVW LSPLTSFTME EAWTTRFPEA GSNVLRVMPE TPDAWRNDAE AARWAKVETV TSVVTSALEV ERRDKRIGSA LEAAPVVHIS EPALLAAFDG LDAAEVFRTS AATLVAGDAA NAFALDEVKG VAVEVKLAQG KKCARSWRIL PEVGTDPRYP ELSLRDAEAV AWWDGRHAS
|
| |