Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4394 |
Symbol | |
ID | 5901855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4767931 |
End bp | 4768893 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564912 |
Product | homoserine kinase |
Protein accession | YP_001686012 |
Protein GI | 167648349 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.814373 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTCT ATACCGACAT CACCGACGAC GAACTCGCCG CCTTCCTGGG CGACTACGAC CTGGGACAGG CCGTGGCGTT CAAGGGCATC GCCGAGGGGG TGGAGAACTC CAACTTCCTG CTAGAGACTA CGACCGGCCG GTTCATTCTC ACGGTCTACG AGCGGCGCGC CAAGCCCGAG GACCTGCCCT ATTTCCTCGA CCTGCTGACT TGGCTGGCCG ATCATGGCTA TCCCAGCGCC AAGCCGATCG CCGACCGGTC GGGCGCGACG CTGAAGACCA TTCGCGGCAA GCCCGCCGCC CTGGTCGAGT TCCTGCCCGG TCTGTCGGCC CGTCGCCCCA CGGTCGCCCA CTGTCGCGAG GCCGGCGAGG GCCTGGCTTG GCTGCACCTG GCCGGCGAGG GCTATCCCGC TCGCCGCGCC AACGACCTGG GCCAGCCGCA CTGGGCCTCG CTGTTTTCCA AACACCGCAA GGACGCCGAG GGCCTCAAGC CCGGTCTGGC GGCGACCATC GACAAGGATC TGGCCGAGCT GGCGCTGGCC TGGCCGCGCG GCCTGCCGTC GGGGGTGATC CACGCCGACT ACTTCCCCGA CAACGTGTTC TTCAAGCCGG ACGGCAAGTT CGCCGCCGCG ATCGACTTCT ATTTCGCCTG CGACGACGCC TATGCCTACG ACATCGCCGT GACCCTGAAC GCCTGGTGCT TCGAGTCCGA CGGCAGCTTC AACATCACCG CCGCCCGGGC CCTGATCGCC GGCTATGAGC GCCGCCGGCC GCTGAGCCCG GCCGAGCGCG ACGCCATCCC GATCCTGGCG CGCGGCGCGG CCATGCGCTT CTTCCTGACC CGCCTGGCCG ACTGGGGATC GACCCCGGCG GGCGCCCTGG TCAGGCCGAA GGACCCCCTG GAATACGAGC GCAAGCTGGC CGTCCATCGC GAGGGCCTCA GCCTGTTCGG AGCCGGCGCG TGA
|
Protein sequence | MAVYTDITDD ELAAFLGDYD LGQAVAFKGI AEGVENSNFL LETTTGRFIL TVYERRAKPE DLPYFLDLLT WLADHGYPSA KPIADRSGAT LKTIRGKPAA LVEFLPGLSA RRPTVAHCRE AGEGLAWLHL AGEGYPARRA NDLGQPHWAS LFSKHRKDAE GLKPGLAATI DKDLAELALA WPRGLPSGVI HADYFPDNVF FKPDGKFAAA IDFYFACDDA YAYDIAVTLN AWCFESDGSF NITAARALIA GYERRRPLSP AERDAIPILA RGAAMRFFLT RLADWGSTPA GALVRPKDPL EYERKLAVHR EGLSLFGAGA
|
| |