Gene Caul_4057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4057 
SymbolileS 
ID5901519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4389177 
End bp4392086 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content69% 
IMG OID641564578 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001685680 
Protein GI167648017 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.446405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG ACGCCACGAC GACCGCCCGC GACTACCGCG AGACCGTCTT CCTCCCCGAC 
ACCCCGTTCC CGATGCGCGG CGGCCTGCCC AAGAAGGAGC CCGAAATCCT CGAGGCCTGG
GCGGCCCTGT CGGACAAGGG CCTGTACGGC GCCGTGCGCG CCGCGCGCCA GGCCGCCGGC
CGCCCGCTGT TCGTGCTGCA CGACGGCCCG CCCTACGCCA ACGGCGCGAT CCACATCGGC
CACGCCCTGA ACAAGACGCT GAAGGACTTC GTGGTCCGCT CGCGCTTCGC CCTGGGCTAC
GACGTCGACT ACGTGCCCGG CTGGGACTGC CACGGCCTGC CGATCGAGTG GAAGATCGAG
GAAGAATTCC GCGCCAAGGG CCGCCGCAAG GACGAGGTGC CCGCCGCTGA GTTCCGCAAG
CGCTGCCGTG AATACGCCGC CGGCTGGATC GAGGCCCAGA AGGTCGAGTT CCAGCGCCTG
GGCGTGCTGG GTGACTGGTG GAACCGCTAC GCCACCATGG ACTACGCCAG CGAGGCGACC
ATCGTCGCCG AGTTCCACAA GTTCGCCACC AGCGGCCAGC TCTATCGCGG CTCCAAGCCG
GTGATGTGGA GCCCCGTCGA GCGCACGGCC CTGGCCGACG CCGAGATCGA GTACCACGAC
CACACCAGCC CGACGGTGTG GGTGAAGTTC CCGGTCAAGA GCACCGACGC CAGCATCGTG
ATCTGGACCA CCACGCCGTG GACGATCCCG GCCAACCGGG CAATCAGCTT CAACCCCAAG
GTCGAGTACG GTCTCTATGA AGTGGCGGCG CTCGAGGAGA ACCTCGAATT CGCCCCCTGG
TCCAAGCCGG GCGACAGGCT GGTGGTGGCC GACAAGCTGG CCGAGGACGT GCGCAAGGCC
GCCAAGGTGG CGAGCTGGAA GCGCTTGAGC GCCTTCGACC CGACCGATCT CATCGCCGCC
CACCCTCTGG CCCTGTTCGA TGACGGCTAT GACTTCGACG TGCCGCTGCT GGCCGGCGAC
CACGTGACCG ACGACGCCGG CACCGGCTTC GTCCACACAG CTCCAGGCCA CGGCGCCGAC
GACTATCTGG TGTGGCTCAA GAGCGGCTAC GCCCTGGACG CCATCCCCGA CACCGTCGAC
CCCGACGGGG CCTACTATCC GCACGTGCCG CTGTTCGCGG GCCTGAAGGT CATCGAGACC
GAGGGCAAGA AGGCCGGCAA GTTCGGTCCG GCCAACGGCG CGGTCATGGA CAAGCTGATC
GAGGCCGGAA ACCTGCTGGC GCGCGGCCGG GTCGAGCACA GCTATCCGCA CAGCTGGCGC
TCCAAGGCCC CGGTGATCTT CCGCAACACC CCGCAGTGGT TCATCCGCAT GGACCAACCC
GTCCCGACCC TGGGCGGCAA GACCCTGCGC GAGGTGGCGG TCAACGCCAT CGCACAGACC
GCCTTCCACC CCGAGGCCGG GCGCAACCGT ATCGGTTCGA TGGTCGAGTC GCGCCCCGAC
TGGCTGATCA GCCGCCAGCG CGCCTGGGGC ACGCCGCTGG CCATGTTCGT CGACAAGGAG
ACCGGCGTCC CGCTGATGGA CGAGGCGGTC AACCGCCGCA TCCTCGACGC CATCCAGGAC
GGCGGCAGCG ACGCCTGGTT CGAGCTGCCG GACGAGCACT TCCTGGGCGA CCGCGACCCG
GCCCAGTACG AGAAGGTCGT CGACATCCTC GACGTCTGGT TCGACAGCGG CGCCACCCAC
GCCTTCACCC TGGAAGGCCG CAACGACAGC CGCTCGCCCG CCGACCTCTA TCTGGAGGGC
AGCGACCAGC ACCGCGGCTG GTTCCAGTCC AGCTTGCTGG AGAGCTGCGG CACGCGCGGC
CGCGCGCCGT ACAAGGGCGT CCTGACCCAC GGGTTCACCC AGGACGAGAA CGGCGAGAAG
ATGTCCAAGT CCAAGGGCAA CACCGTCGAG CCCCAGACCA TCACCAAGGA AAGCGGCGCC
GAGATCCTGC GGCTGTGGGC GGCGATGGTC GACTATTCCG AGGATCAGCG GATCGGCAAG
ACGATCCTGG CCACGACGAC CGACGCCTAT CGCAAGCTGC GCAACACCAC CCGCTACCTG
CTGGGCGCCC TGGCCGGCTT CGACGAGGCC GAGCGGGTCA CCGACTACGC CGACTTCCCG
CCGCTGGAGA GGTACATCCT GCACCGCCTG TGGGAGCTGG ATGGCCAGGT GAAGGCCGCC
TACGAGGCCT ATCGCTTCAG CGACGTGATC CGGCCGCTGA TCGACTTCTG CCAGGGCGAC
CTGTCCAGCC TGTTCTTCGA CATCCGCAAG GACAGCCTCT ATTGCGACGC GCCCCCGGCT
CTGCGCCGCC GAGCCTATCG CACGGTGCTC GATTACGTGT TCGAGCGCCT GACGGTGTGG
CTGTCGCCGC TGACGAGCTT CACCATGGAA GAGGCCTGGA CGACGCGCTT CCCCGAGGCG
GGCAGCAACG TGCTGCGGGT GATGCCGGAG ACGCCGGACG CCTGGCGCAA CGACGCCGAG
GCCGCGCGGT GGGCCAAGGT CGAGACCGTC ACCTCGGTGG TGACCTCGGC CCTGGAGGTC
GAGCGCCGCG ACAAGCGCAT CGGCTCGGCC CTCGAAGCCG CGCCGGTGGT GCACATCAGC
GAGCCCGCCC TGCTGGCCGC CTTCGACGGC CTGGACGCCG CCGAGGTGTT CCGCACCAGC
GCCGCGACCC TGGTCGCGGG TGACGCGGCG AACGCCTTCG CGCTGGACGA GGTCAAGGGC
GTGGCCGTCG AGGTCAAGCT GGCCCAAGGC AAGAAGTGCG CCCGCTCGTG GCGCATCCTG
CCGGAGGTGG GAACCGATCC CCGCTATCCG GAGCTGTCCT TGCGCGACGC CGAAGCGGTG
GCGTGGTGGG ATGGCCGGCA CGCTTCCTAG
 
Protein sequence
MADDATTTAR DYRETVFLPD TPFPMRGGLP KKEPEILEAW AALSDKGLYG AVRAARQAAG 
RPLFVLHDGP PYANGAIHIG HALNKTLKDF VVRSRFALGY DVDYVPGWDC HGLPIEWKIE
EEFRAKGRRK DEVPAAEFRK RCREYAAGWI EAQKVEFQRL GVLGDWWNRY ATMDYASEAT
IVAEFHKFAT SGQLYRGSKP VMWSPVERTA LADAEIEYHD HTSPTVWVKF PVKSTDASIV
IWTTTPWTIP ANRAISFNPK VEYGLYEVAA LEENLEFAPW SKPGDRLVVA DKLAEDVRKA
AKVASWKRLS AFDPTDLIAA HPLALFDDGY DFDVPLLAGD HVTDDAGTGF VHTAPGHGAD
DYLVWLKSGY ALDAIPDTVD PDGAYYPHVP LFAGLKVIET EGKKAGKFGP ANGAVMDKLI
EAGNLLARGR VEHSYPHSWR SKAPVIFRNT PQWFIRMDQP VPTLGGKTLR EVAVNAIAQT
AFHPEAGRNR IGSMVESRPD WLISRQRAWG TPLAMFVDKE TGVPLMDEAV NRRILDAIQD
GGSDAWFELP DEHFLGDRDP AQYEKVVDIL DVWFDSGATH AFTLEGRNDS RSPADLYLEG
SDQHRGWFQS SLLESCGTRG RAPYKGVLTH GFTQDENGEK MSKSKGNTVE PQTITKESGA
EILRLWAAMV DYSEDQRIGK TILATTTDAY RKLRNTTRYL LGALAGFDEA ERVTDYADFP
PLERYILHRL WELDGQVKAA YEAYRFSDVI RPLIDFCQGD LSSLFFDIRK DSLYCDAPPA
LRRRAYRTVL DYVFERLTVW LSPLTSFTME EAWTTRFPEA GSNVLRVMPE TPDAWRNDAE
AARWAKVETV TSVVTSALEV ERRDKRIGSA LEAAPVVHIS EPALLAAFDG LDAAEVFRTS
AATLVAGDAA NAFALDEVKG VAVEVKLAQG KKCARSWRIL PEVGTDPRYP ELSLRDAEAV
AWWDGRHAS