Gene Caul_4394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4394 
Symbol 
ID5901855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4767931 
End bp4768893 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content69% 
IMG OID641564912 
Producthomoserine kinase 
Protein accessionYP_001686012 
Protein GI167648349 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.814373 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTCT ATACCGACAT CACCGACGAC GAACTCGCCG CCTTCCTGGG CGACTACGAC 
CTGGGACAGG CCGTGGCGTT CAAGGGCATC GCCGAGGGGG TGGAGAACTC CAACTTCCTG
CTAGAGACTA CGACCGGCCG GTTCATTCTC ACGGTCTACG AGCGGCGCGC CAAGCCCGAG
GACCTGCCCT ATTTCCTCGA CCTGCTGACT TGGCTGGCCG ATCATGGCTA TCCCAGCGCC
AAGCCGATCG CCGACCGGTC GGGCGCGACG CTGAAGACCA TTCGCGGCAA GCCCGCCGCC
CTGGTCGAGT TCCTGCCCGG TCTGTCGGCC CGTCGCCCCA CGGTCGCCCA CTGTCGCGAG
GCCGGCGAGG GCCTGGCTTG GCTGCACCTG GCCGGCGAGG GCTATCCCGC TCGCCGCGCC
AACGACCTGG GCCAGCCGCA CTGGGCCTCG CTGTTTTCCA AACACCGCAA GGACGCCGAG
GGCCTCAAGC CCGGTCTGGC GGCGACCATC GACAAGGATC TGGCCGAGCT GGCGCTGGCC
TGGCCGCGCG GCCTGCCGTC GGGGGTGATC CACGCCGACT ACTTCCCCGA CAACGTGTTC
TTCAAGCCGG ACGGCAAGTT CGCCGCCGCG ATCGACTTCT ATTTCGCCTG CGACGACGCC
TATGCCTACG ACATCGCCGT GACCCTGAAC GCCTGGTGCT TCGAGTCCGA CGGCAGCTTC
AACATCACCG CCGCCCGGGC CCTGATCGCC GGCTATGAGC GCCGCCGGCC GCTGAGCCCG
GCCGAGCGCG ACGCCATCCC GATCCTGGCG CGCGGCGCGG CCATGCGCTT CTTCCTGACC
CGCCTGGCCG ACTGGGGATC GACCCCGGCG GGCGCCCTGG TCAGGCCGAA GGACCCCCTG
GAATACGAGC GCAAGCTGGC CGTCCATCGC GAGGGCCTCA GCCTGTTCGG AGCCGGCGCG
TGA
 
Protein sequence
MAVYTDITDD ELAAFLGDYD LGQAVAFKGI AEGVENSNFL LETTTGRFIL TVYERRAKPE 
DLPYFLDLLT WLADHGYPSA KPIADRSGAT LKTIRGKPAA LVEFLPGLSA RRPTVAHCRE
AGEGLAWLHL AGEGYPARRA NDLGQPHWAS LFSKHRKDAE GLKPGLAATI DKDLAELALA
WPRGLPSGVI HADYFPDNVF FKPDGKFAAA IDFYFACDDA YAYDIAVTLN AWCFESDGSF
NITAARALIA GYERRRPLSP AERDAIPILA RGAAMRFFLT RLADWGSTPA GALVRPKDPL
EYERKLAVHR EGLSLFGAGA