Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3817 |
Symbol | |
ID | 7874059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4212592 |
End bp | 4213548 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700759 |
Product | homoserine kinase |
Protein accession | YP_002890783 |
Protein GI | 237654469 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.248965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCT TCACCCCCGT TCCCGAATCC GTGCTCGCCG ACTGGCTCAA GGACTACGCC ATCGGCCGCC TGGTGGAACT CGAGGGCATC TCGGCCGGCG TGCAGAACAG CAACTTTTTC GTCACCACCA CGCTCGGGCG CTACGTGCTG ACGCTCTTCG AGGGCATCCC GCGCGCCGAG CTGCCCTACT ACCTGCACCT GATGGCGCAC CTGTCGCGCC ACGGCCTGCC CGTGCCGGGC CCGATCGCCA ACCGCCACAA CGAATACCTC GGCACCCTGC AGGAGCGCCC CGCCGCACTG GTGGTACGGC TCTCGGGCCG CTCCGAGATG AGCCCCGGCC TGGCGCACTG CGGGAGGGTC GGCGCCATGC TCGCCGGCCT GCACCTCGCC GGCCAGTCCT ACGGTCGCCG CCAGGAAAAC CCGCGCGGCG CCGCCTGGCG CGCCGCCACC GCGCAGGTGC TGCGGCCCTT CCTGTCCGCC GACGAACAGA CCCTGCTCGA CGCCGAGATC GCCTTCCAGG CCACGGTCGA CGTCGCCGCC CTGCCCGCCG GCGCGATCCA CGCCGACCTC TTCCGCGACA ACGTGCTGTG GGACACCGAT GCCGACGGCG GGGTGCGCAT CGGCGGCGTC ATCGACTTCT ACTTCGCCGG CTACGACGCG CTGCTGTTCG ACGTCGCCGT CACCGTCAAC GACTGGTGCT CCACCCCCGA CGGCGGGCTG GATGCCGAGC GTGCCGCCGC GCTGCTCGAC GCCTACCACG CCGAGCGCCC TTTCACCGAC GCCGAGCGCG CCGCCTGGCC GGCCATGCTG CGCGCGGCCG CGCTGCGCTT CTGGATGTCG CGCGCGGCTG ACTTCCACCT GCCGCGCGCG GGCGAGATGG TGCTGGTGAA GGACCCGAAC GAGTACCGCG ACATCCTGCG CCTGCGCATC GCCACGGCGC CGCCCCTGCC GCGCTGA
|
Protein sequence | MSVFTPVPES VLADWLKDYA IGRLVELEGI SAGVQNSNFF VTTTLGRYVL TLFEGIPRAE LPYYLHLMAH LSRHGLPVPG PIANRHNEYL GTLQERPAAL VVRLSGRSEM SPGLAHCGRV GAMLAGLHLA GQSYGRRQEN PRGAAWRAAT AQVLRPFLSA DEQTLLDAEI AFQATVDVAA LPAGAIHADL FRDNVLWDTD ADGGVRIGGV IDFYFAGYDA LLFDVAVTVN DWCSTPDGGL DAERAAALLD AYHAERPFTD AERAAWPAML RAAALRFWMS RAADFHLPRA GEMVLVKDPN EYRDILRLRI ATAPPLPR
|
| |