Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0565 |
Symbol | |
ID | 3784785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 651780 |
End bp | 652730 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810647 |
Product | homoserine kinase |
Protein accession | YP_411265 |
Protein GI | 82701699 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.588642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTCT TCACCATAGT TACCCATGAG CAGCTTTCCG CATGGCTGAG AAATTATTCC ATCGGCAAAT TGGTCAATCT GCAGGGTATT TCTTCCGGCA TTGAAAATAC GAATTATTTT GTCACTACCA GCCACGGCAA ATATGTTCTG ACACTATTTG AAAAACTGAC TTCGGCGGAA CTACCCTACT ATTTGAATCT GATGGCCTAC CTGGCGCGCC ACGGCCTTCC CTGCCCCAGC CCAGTGGCGG ATCTCGCCAA TAAATTCCTG GGAGAGCTCA ATGGCAAGCC GGCGAGCATC GTCACCTGCC TTCCCGGAAA GTCGCAGGAA TCTCCCACGG CCACCCATTG CGCCGAGGTT GGCGAACTCC TGGCCAACAT GCATTTATCC GGCTTATCCT ATCCGGAAAA AATGGAGAAT TTGCGAGGGC CCAGATGGTG GAAAGCTGCG GCGCAGGAAG TCATGCCGTT TCTTTCGGAG GATGAGGCTG CGATTCTCGG GGAGGAATTA CGCTTTCAAT CATCGCATAG AACGGAAAGC CTTCCCCGGG GCGTGATCCA TGCCGACCTG TTTCGTGACA ATGTCCTTTT CAAGGATGGC GCAATGGGCG GAGTGATCGA TTTCTACTTT GCCTGCAACG ATGTGCTGCT GTACGACCTG GCGATTACCG CGAACGACTG GTGTCTCAAC GAAAATGCGG AATTGGATCC GGAGCGGACT TTATCGCTGC TCGAAGCCTA TCACCGCACC CGGCCGCTGC TGGAGATTGA GCGTGACGCC TGGCCAGTGA TGCTGCGCGC CGGCGCGCTG CGCTTCTGGC TTTCACGCCT GCAGGACTAT CACCTGCCGC GCGCCGGAGA ACTTACGCAC GTGAAGGATC CTGCTCACTT CATGCGTATA CTGCAAAGCC ATGCAGCAGC ACGCTCGAAG CTTGCCGAGG TGTGGATCTG A
|
Protein sequence | MSVFTIVTHE QLSAWLRNYS IGKLVNLQGI SSGIENTNYF VTTSHGKYVL TLFEKLTSAE LPYYLNLMAY LARHGLPCPS PVADLANKFL GELNGKPASI VTCLPGKSQE SPTATHCAEV GELLANMHLS GLSYPEKMEN LRGPRWWKAA AQEVMPFLSE DEAAILGEEL RFQSSHRTES LPRGVIHADL FRDNVLFKDG AMGGVIDFYF ACNDVLLYDL AITANDWCLN ENAELDPERT LSLLEAYHRT RPLLEIERDA WPVMLRAGAL RFWLSRLQDY HLPRAGELTH VKDPAHFMRI LQSHAAARSK LAEVWI
|
| |