Gene Caul_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1747 
Symbol 
ID5899202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1837019 
End bp1838728 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content67% 
IMG OID641562237 
Productmethionyl-tRNA synthetase 
Protein accessionYP_001683374 
Protein GI167645711 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.733283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCA TCCTGATCAC CTCCGCCCTG CCGTACATCA ACGGCGTCAA GCACCTGGGC 
AATCTGGCGG GATCGATGTT GCCGGCCGAC GTCTATGCGC GGTTCCAGCG GGCCCGGGGC
AACGACACGC TCTATATCTG CGCCACCGAC GAGCATGGCA CTCCGGCCGA GCTGGCGGCC
GCCGCCGCTG GCCAGGACGT CGCCGAGTAC TGCGCCGAGC AGCATGTTCT GCAGCATGAC
GTCGGCCGGG CGTTCGGCCT GTCGTGGGAC CATTTCGGCC GCTCGTCCTC GCCGCAGAAC
CACCGCCTGA CCCAGCATTT CTGCGAGGTG CTGGAGGAGC GCGGCCTGAT CGAGGAACGC
GTCGATCAGA TGGTCTATTC CATCGACGAC GCGCGCTTCC TGCCCGACCG CTACATCGAG
GGCACCTGCC CGCACTGCGG CTTCGACAAG GCCCGGGGCG ATCAGTGCGA CAACTGCGGC
AACCTGCTGG ACCCGACCGA GCTGAAGGAT CCCTATTCGG TGATCTCCGG CTCGCGGAAC
CTGGAGGTGC GCGACACCCG TCACCTGTAT TTGCTGCAGA CCAAGATGGC CGACAAGATC
CGGGCCTGGA TCGACAGCCA CCCGGACTGG CAGCTGCTGG CCAAGTCGAT CGCCTACAAG
CACCTGGACG AGGGGCTGAT CGATCGCGGC ATCACCCGGG ATCTCGCCTG GGGCATCCCT
GTGACCAAGG GCGAGTTCCC GCGTCCGGGC TTCGAGGACA AGGTGTTCTA CGTCTGGTTC
GACGCGCCCA TCGAATATAT CGCCGCCACC CAGGAATGGG CGGACGAGGG CGCCGGTCGC
GACTGGAAGT CGTGGTGGCG CACCGACGAG GGCGCGGACG ACGTGCGCTA CGTCCAGTTC
ATGGGCAAGG ACAATGTCGC CTTCCACACG GTCAGCTTCC CGGCGACCAT TCTCGGTTCG
CAAGAGCCTT GGAAGAGCGT CGACATGCTC AAGGCCTTCA ACTGGCTGAA CTGGTACGGC
GGCAAGTTCT CGACCAGCAA CAAGCGCGGC GTGTTCATGG ACGCGGCGCT GGAGATCCTG
CCGCCCGACT TCTGGCGCTG GTATCTGACC TCCAATGCGC CAGAAAGCAG CGACACCGCC
TTCACCTGGG AGCAGTTCGC CAGCGCCGTG AACCGCGACC TGGCCGACGT GCTGGGCAAT
TTCGTCAACC GCATCCTGAA GTTCACGGAA GGCAAGTTCG ACGGCGTGAT CCCCGACGGC
GGCGCCCCCG GCCCGCTGGA GGAGAAGCTC TACGCCGACG TCTCGGCCCG CCTGGCCGAC
CTGACCGAGC AGATGGACGC CGTCGAGGTG CGCAAGAGCG CACAGGCCCT GCGCGCCCTG
TGGGTGGTCG GCAACGAGTA CCTGCAGGAA GCCGCCCCGT GGACGGCGAT CAAGACCGAC
CGCGACCGCG CCGCCGTGAT CGTCCGCACC GCGCTGAACC TGGCGGCGTT GTACGCCCGC
ATCTCCGCGC CGTTCATCCC GTTCGCGGCG GAGAAGATCG GCGAGGCGTT TCAGCTGCCC
TGGCCGCCGG TCTGGCCGAC GACGGACGCC GCCGCCGAGC TCTCCAGCCT GCCGGTCGGG
CTGTCGGTGC GGGCGCCGGA GGTGCTGTTC AAGAAGATCG AGGACGAGCA GATCGCCGAA
TGGACGCGGC GCTTCGGCGG AGCGGAGTAG
 
Protein sequence
MARILITSAL PYINGVKHLG NLAGSMLPAD VYARFQRARG NDTLYICATD EHGTPAELAA 
AAAGQDVAEY CAEQHVLQHD VGRAFGLSWD HFGRSSSPQN HRLTQHFCEV LEERGLIEER
VDQMVYSIDD ARFLPDRYIE GTCPHCGFDK ARGDQCDNCG NLLDPTELKD PYSVISGSRN
LEVRDTRHLY LLQTKMADKI RAWIDSHPDW QLLAKSIAYK HLDEGLIDRG ITRDLAWGIP
VTKGEFPRPG FEDKVFYVWF DAPIEYIAAT QEWADEGAGR DWKSWWRTDE GADDVRYVQF
MGKDNVAFHT VSFPATILGS QEPWKSVDML KAFNWLNWYG GKFSTSNKRG VFMDAALEIL
PPDFWRWYLT SNAPESSDTA FTWEQFASAV NRDLADVLGN FVNRILKFTE GKFDGVIPDG
GAPGPLEEKL YADVSARLAD LTEQMDAVEV RKSAQALRAL WVVGNEYLQE AAPWTAIKTD
RDRAAVIVRT ALNLAALYAR ISAPFIPFAA EKIGEAFQLP WPPVWPTTDA AAELSSLPVG
LSVRAPEVLF KKIEDEQIAE WTRRFGGAE