Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1479 |
Symbol | |
ID | 4071649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1789269 |
End bp | 1790216 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983488 |
Product | homoserine kinase |
Protein accession | YP_590555 |
Protein GI | 94968507 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC TCGCCGAACG CGGAGAAATC GAGATTACGG TCCCGGCCTC GGTGGCAAAC CTGGGGCCGG GATTCGACGT GCTTGCCGTG GCGGTGCAGC TTTATTTGCG GCTGAAAGTG CGGGCGATCG AGGGCAGAAA CGAGCTTCGT TTCAATTTCA TTGGGCAGCA ACTCAAGGGC GACAACTATA TCGAGCGCGC GTTCAATTTC CTCGCGCGGC AGCACAGCGG GTCGTTCCCT TCCCTTGAAG TGGATGTGCA TTGCGACATT CCTATACGTT CGGGTCTGGG AAGCAGTGCC GCAGCCACGG TTGCGGGTTT GCGCTTGTAC GAGGCAATCA TGGAGCCGAT GCAGGCGCGC GATCTGCTGA ATGCCGCGGT GGCGCTGGAA GGACATCCCG ATAATGTTTC TGCTGCGCTG CTCGGTGGCA TGACTGCAAG TTGCCAGCTT CCCGATGGTT CGACTTCAGC GGTCTCGATG CCCTGGCCGG CATCGCTGTG CCTGATCGTT GCGACACCCG AATACAATCT CTCGACGTCG GCAGCGCGGA GCGTGCTGCC GGAGAGAGTT TCGCGACACG ATGCGGTATT TAACCTCCAG CGAATGGCGC ATCTGTTGCA CGCGTTGCAA AGCGAAGACT TCTCTCTTCT TCACGAAGCG CTTTCTGATC GCCTGCATCA GCCACATCGG CAGAAACTCA TTCCCGGACT CGACCAAGCG TTGATGCTTG ATCATCCTGA CATTCTTGGC GTGTGCCTGA GCGGCGCGGG ACCGTCCATT GTGTGCTTCG CGACGCAGAG CTTTAGCGAG ATCGAGCGAA TGCTAGCCAA TATCTACGAG GGGCTTGGAC TTCCCTACCA GGTTCGTACG CTGGCTGTGC ACCGCAATGA CGAAGCTCCG GTCAGCGAGG TCCCTCCAAA CGAACCGGAT CCTTCGGTTT TCGCGTAA
|
Protein sequence | MSALAERGEI EITVPASVAN LGPGFDVLAV AVQLYLRLKV RAIEGRNELR FNFIGQQLKG DNYIERAFNF LARQHSGSFP SLEVDVHCDI PIRSGLGSSA AATVAGLRLY EAIMEPMQAR DLLNAAVALE GHPDNVSAAL LGGMTASCQL PDGSTSAVSM PWPASLCLIV ATPEYNLSTS AARSVLPERV SRHDAVFNLQ RMAHLLHALQ SEDFSLLHEA LSDRLHQPHR QKLIPGLDQA LMLDHPDILG VCLSGAGPSI VCFATQSFSE IERMLANIYE GLGLPYQVRT LAVHRNDEAP VSEVPPNEPD PSVFA
|
| |