Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0858 |
Symbol | |
ID | 6262580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | - |
Start bp | 947227 |
End bp | 948180 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642611336 |
Product | homoserine kinase |
Protein accession | YP_001875750 |
Protein GI | 187251268 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000127482 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000000719161 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCTTT ATGTAAAGTT AAATAAAGAC GAAATAGAGG CTTTTATAGC GGACTATAAC CTTAAGCTTA TTGATTTTAA GGGTATTATA GAAGGCGTGC AAAACACAAA TTATTTTTTG CTTACTACTT CCGGTAAATA TATTCTTACC GTTTGTGAGG AAGAAATTAA CCCCACCGAC CTTCCTTTTT TTAATTCGGC TATGTTATAC GCCGCTCTGC ATGGCGTTCC GTGTCCGGTT CCTTTAAAAA ACAAATATGG CGCCTTTACG GGTCGGCTTA AAAATAAACC GGCCGGAATA GTAACTTTTT TAGAAGGCAA ATCCGTAACG GACATTACTT TTTCCCATCT TGAAAATCTG GGGCGTTTTT TAGGTAAGCT GCATATTCAA ACTAAAGATT TTAAAGAAGA AAGGGCAAAC CCGTTATGCT TGGATAATGT GACTGAGCTT ATAAGGAAAA ATAAAAAAAT TGATAATATC TCGCCAAATC TTTCTGCCGA AATTAACAAA GAATTAAACC TTGTGTCTGA AGAGCTTAAA AGCTTTTATA ATTTACCCAA AGGTTTTGTG CATGCGGATA TTTTCCCCGA CAATATGTTT TTTGAAGGAA ATAATGTAAG CGGCATTATT GATTTTTATT TTTGCTGCAG CGATTACTTG GCATATGATT TGGCGGTTAC CGCCAACGCC TGGTGTTTTG ATAATAAAGG TTTTGATTAT AACGAGAAAA AAATAAAAAT TTTATTGGAT TCTTACCAAA AAATACGCCC GCTTGAACAA GCAGAAAAAT ATGCTTTTAA CGCGCTTTTG CGCCGCGCGG CCTTAAGGTT TTTTGCAACA AGAGCGTGGG ATATGAAGTA CCCCAAACCA AACGCCGTAG TAGGCGTAAA AGACCCTATG GAATATGTGG CTAAATTAAG GGCTTTTAAG TCGGCGGGGG ATTTGTTTAA ATAA
|
Protein sequence | MALYVKLNKD EIEAFIADYN LKLIDFKGII EGVQNTNYFL LTTSGKYILT VCEEEINPTD LPFFNSAMLY AALHGVPCPV PLKNKYGAFT GRLKNKPAGI VTFLEGKSVT DITFSHLENL GRFLGKLHIQ TKDFKEERAN PLCLDNVTEL IRKNKKIDNI SPNLSAEINK ELNLVSEELK SFYNLPKGFV HADIFPDNMF FEGNNVSGII DFYFCCSDYL AYDLAVTANA WCFDNKGFDY NEKKIKILLD SYQKIRPLEQ AEKYAFNALL RRAALRFFAT RAWDMKYPKP NAVVGVKDPM EYVAKLRAFK SAGDLFK
|
| |