Gene Caul_4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4525 
SymbolthrS 
ID5901986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4897398 
End bp4899377 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content66% 
IMG OID641565044 
Productthreonyl-tRNA synthetase 
Protein accessionYP_001686143 
Protein GI167648480 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.082511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC TGATTTTCCC CGACGGCTCG GCCCGCCAGT ACGCTGACGG CTCGACGGGC 
CGCGACGTGG CCGCCTCGAT CTCCAAGTCG CTGGAGAAGA AGGCCCTGCT GATCAAGCTG
GACGGCAAGC TGCTGGACCT GGATCGTCCG CTGACGCCAG ATCTGCTGGG CGGCGGCAAC
CGCTTCGAGA TCATCACGCG AGACTCGCCC GACGCCCTGG AGGTGATCCG GCACGACACC
GCCCACGTCC TGGCGGAGGC CGTGCAGGAA CTGTTCCCCG GCACGCAGGT TACCATTGGG
CCGAACGTCG AGGACGGATT CTACTACGAC TTCGCCCGCG ACGAGCCCTT CAGCCTCGAT
GACCTGCCGA AGATCGAGGA GCGCATGCGC CAGATCGTCG ATCGCGATGA GAAGATCCGC
CGCGAGGAAG TCGATCGCGA CGCGGCTATC GCCGACTTCG AGGCCATGGG CGAAAGCTAC
AAGGCCCAGA TCATCCGTGA CCTGCCGGCC AGCGATACGA TCACGGTCTA TCACCAGGGC
GAAAAGTGGA AGGACCTGTG CCGCGGGCCT CACCTTCCCT CGACCAAGGC GGTGGGCAAG
GCCTTCAAGC TGACGAAGCT GGCCGGCGCC TATTGGCGCG GCGACCAGAA CAACGCCCAG
CTGCAGCGCA TCTACGGCAC GTCGTGGGCC ACCGAAGCCG ACCTCGAAGC GCATCTGAAG
CGCATCGAGG AGGCCGAACG GCGTGATCAC CGCAAGTTGG GCAAGACCAT GGACCTCTTC
CACATCCAGG AAGAGGGCAA GGGCATGGTG TTCTGGCACC CCAAGGGCTG GACGCTGTAC
CTGGCGCTGG AGGCCTATAT GCGCCGCCGG CTCGACGCGG CCGGCTATCG CGAGGTCAAG
ACGCCGCAGA TTCTCGACAA GAGCCTGTGG GAGCGCTCGG GCCACGCCGA GAAGTTCGGC
CACGCCATGT TCATGTGCGA GAGCGCCGAG GGCGAGGTCC TGGCCGTCAA GCCGATGAAC
TGCCCCGGCC ACATCCAGAT CTTCAACGTT GGCCAGAAGA GCTATCGCGA GCTGCCGCTG
CGCATGGCCG AGTTCGGGGC CTGCCACCGC TACGAGCCGT CAGGCGCCAT GCACGGCATC
ATGCGGGTGC GGGCCTTCAC CCAGGACGAC GCTCACATCT TCTGCCGCGA AGAGCAGGTC
ACGGAGGAAA GCGCGCGCTT CATCGAACTA TTGCGCAGCG TCTATAGCGA CCTCGGCATG
CATCTGGCCG ACACCAAGTT CTCGACGCGT CCCGAGCTGC GGGCGGGCGA AGACGCGGTC
TGGGACAAGG CCGAGGCCGC CCTGTCCGCC GCCGCCGAGG CGGCCGGCGA GACGCTGGTG
CTGCAAGAGG GCGAAGGCGC CTTCTACGGC CCCAAGCTGG AATTCTCGCT GAAGGACGCC
ATCGGCCGGG TCTGGCAGTG CGGGACGCTG CAGCTGGACT TCGTACTGCC CGAGCGGCTG
GACGCCGAAT ATGTCGCCGA GGACGGCTCC AAGAAGCGCC CCGTCATGCT GCACCGGGCG
ATTCTCGGCT CGTTCGAGCG GTTCATCGGC ATTCTGCTGG AGAACTACGC CGGCCATCTG
CCGCTTTGGC TGGCGCCGGT TCAGGTCGTC GTGGCCACGA TCACGTCCGA TGCCGACGAT
TACGCACAAC GTGTGGCTGA GCGATTGACT TCGATGGGCA TTCGCGCAGA GGTGGATTTC
AGGAACGAGA AGATCAACTA CAAGATCCGC GAGCACAGCC TCGCAAAGGT GCCGGTGATT
GCAGTGGTCG GGCGTAAGGA AGCCGAGAAC GGAGAGGTCG CCCTGCGTCG CCTTGGCGGC
GAGGGGCAGA AGGTGCTGTC GCTGGAAGAC GCCGTGCGTG CGTTGACCGA GGAAGCGACG
CCGCCGGACC TGGCCCGTGA TCGCGCCGTG GCGGCGCCCG CGGAGTTGGC GCAGGCCTGA
 
Protein sequence
MIDLIFPDGS ARQYADGSTG RDVAASISKS LEKKALLIKL DGKLLDLDRP LTPDLLGGGN 
RFEIITRDSP DALEVIRHDT AHVLAEAVQE LFPGTQVTIG PNVEDGFYYD FARDEPFSLD
DLPKIEERMR QIVDRDEKIR REEVDRDAAI ADFEAMGESY KAQIIRDLPA SDTITVYHQG
EKWKDLCRGP HLPSTKAVGK AFKLTKLAGA YWRGDQNNAQ LQRIYGTSWA TEADLEAHLK
RIEEAERRDH RKLGKTMDLF HIQEEGKGMV FWHPKGWTLY LALEAYMRRR LDAAGYREVK
TPQILDKSLW ERSGHAEKFG HAMFMCESAE GEVLAVKPMN CPGHIQIFNV GQKSYRELPL
RMAEFGACHR YEPSGAMHGI MRVRAFTQDD AHIFCREEQV TEESARFIEL LRSVYSDLGM
HLADTKFSTR PELRAGEDAV WDKAEAALSA AAEAAGETLV LQEGEGAFYG PKLEFSLKDA
IGRVWQCGTL QLDFVLPERL DAEYVAEDGS KKRPVMLHRA ILGSFERFIG ILLENYAGHL
PLWLAPVQVV VATITSDADD YAQRVAERLT SMGIRAEVDF RNEKINYKIR EHSLAKVPVI
AVVGRKEAEN GEVALRRLGG EGQKVLSLED AVRALTEEAT PPDLARDRAV AAPAELAQA