Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0672 |
Symbol | |
ID | 5105278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 615041 |
End bp | 615976 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506576 |
Product | homoserine kinase |
Protein accession | YP_001190771 |
Protein GI | 146303455 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000904789 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTTCAG TGAAAGCCAC AGCATTCTCA TCTTCTGCTA ACCTTGGTGC CGGTTACGAC GTGCTTTCCA TGAGCCATTT GGCCTTCAGC GACACTGTTT ACGCGGAGCT TATCAACGAA AAGGAACGCA GGGTCATGAT GGAGTCCAAC TCGAATGTCC CTTTAGATCC AGCTAGAAAC TCAGCAGGCG CACCAGTTGA GGCATTGCTC AAGGAATTTC AATTAAATTA CACGATAAAA CTGAATGTGA TAAAGGGAGT GCCCCACGGA TTAGGCCTAG GAAGTAGCGG AGCATCTGCA GTGGCTGCAG TGGCTGCGGT TAACGAGCTG CTTAACCTCC ACCTGACGCT CGAGGACATC GTGAAGTTTG CTGTGATTGG GGAACAAGCC GTTAGTGGAT CCCCTCACCC AGATAACGTG GCAGCTAGTG CCTTTGGTGG AATAGTGGCA GTGACGTCGC ACGATCCCAT TAGGGTAGTG AGGATACCAA TTAACCTGAA CTTCAGGTTA ATGTTGGTCA TACCGAGGGT TAATACGGGT GAAGGGAAGA CCAAGAAGGC AAGGGAGCTA GTTCCCAAGC AGATAGAGGT TTCAAAGATG GTCGAAAACA CAAGGTTCCT ATCCTCATTC ATTCTAGGAT TAGTTAAGGG AGATAGGGCA CTGGTTAGGG AGGGGCTTAA CGACGCCGTT GTGGAAAGGT CCAGGGAACC AATGTATCCA CACTACCCCA AGTTGAAGGA AATAGCACTG AGGTACGATG CTATTGGAGC TTGCGTAAGT GGAGCCGGAC CTACGGTGTT AATCCTTTAC GATGATTCGA CCAAACTGGG AGAGATCAAA CAGGAGGGTG GGAAAGTATG TGCCCAGCAC GGTTTTCAAT GCGATTTCAT TACCACGGAA ATAGGGGAGG GAGTCAGGGT TGAGGGACTC AACTAA
|
Protein sequence | MRSVKATAFS SSANLGAGYD VLSMSHLAFS DTVYAELINE KERRVMMESN SNVPLDPARN SAGAPVEALL KEFQLNYTIK LNVIKGVPHG LGLGSSGASA VAAVAAVNEL LNLHLTLEDI VKFAVIGEQA VSGSPHPDNV AASAFGGIVA VTSHDPIRVV RIPINLNFRL MLVIPRVNTG EGKTKKAREL VPKQIEVSKM VENTRFLSSF ILGLVKGDRA LVREGLNDAV VERSREPMYP HYPKLKEIAL RYDAIGACVS GAGPTVLILY DDSTKLGEIK QEGGKVCAQH GFQCDFITTE IGEGVRVEGL N
|
| |