Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1058 |
Symbol | |
ID | 6026596 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 1114298 |
End bp | 1115515 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641593870 |
Product | proposed homoserine kinase |
Protein accession | YP_001717202 |
Protein GI | 169831220 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3635] Predicted phosphoglycerate mutase, AP superfamily |
TIGRFAM ID | [TIGR00306] 2,3-bisphosphoglycerate-independent phosphoglycerate mutase, archaeal form [TIGR02535] proposed homoserine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000198822 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTTTGA AATACCTGGT GTTGGTGGGA GACGGGATGG CCGACGAGCC CCGGCCGGAA CTGGATGGGA TGACGCCGCT GCAATACGCC CGGACGCCAC ACATGGACCT CGTGGCGGCT TGTGGGGAGA TCGGGCAGGT ACGGACGGTG CCGGCCGGGT ACCAACCGGG CAGTGACGTG GCCAATCTCT CGGTGCTGGG TTACGACCCG CAGAAGTATT ATACCGGACG GGCGCCGCTG GAGGCGGTGA GCATGGGGAT CGACTTGCAG GAACGGGACG TCGCCTTCCG CTGTAACCTG GTTACCCTCA CCGACGCGGA ACCCTACGAG GAACGGGAAA TGGTCGACTA CAGCGCCGGT GAAATCAGCA CGGCCGAGGC CCGCGAACTG ATTCTCTTTT TGGATCGGGA GCTTGGTTCC GAAGCGCTCC GCTTCTATCC GGGCGTCAGC TACCGGCACC TGCTGGTATG GCGGGAAGGA CCCGTGGAAA CCCGGCTCAC CCCGCCGCAC GACATTCCGG GAATGGAGGT GCGCGCGCAC TTGCCGGCGG GGGTTGGTGA CGGGGTACTG AAATCGCTGA TGGTACGGAG CGCCGAACTG CTCGCCGGAC ATCCCGTGAA CCGGGCACGC CGGCAGCGAA ACGAGCGCAC GGCAAACTCC ATCTGGTTCT GGGGGCAGGG CCGCCGGCCG GCCCTGCCTC CGTTCCGCGA CCTCTTCGGT CTGCAGGGAG CGGTGATCTC GGCCGTCGAC CTGATCAAGG GCATCGGGCT GTGCGCCGGC CTGGAAAGCA TCGAGGTCGA AGGCGCGACC GGGACGATTC ACACCAACTT CCGCGGAAAA GCGGTGGCGG CGCTGAACGC CCTTGCTTCC GGAGCCGATT TCGTCCTGAT CCACGTGGAG GCTCCGGACG AGGCCAGTCA CCACGGGGAT TTAGAAACGA AAATACGGGC GATCGAAGAG ATCGACCAGC GAGTGTTGGG GGAGGTCCTC CGCGGGGCGA GGGACATCGG CCCTCTCAGG ATAATGGTTC TTCCAGACCA CCCCACGCCG CTCCGAACCA GGACCCATTC CGCCGACCCG GTGCCCTTCG CCATACTGCG GGAAGGAACC GGGGGTGGTC GTAAGGCGGA ACGGGGGTTT GACGAAGTCT CAGCCGCGCG AAGCGGTGTC TATTTTAGAA TCGGCTGCAA GCTGATGCCT TACTTCATCA GGCGCTAG
|
Protein sequence | MLLKYLVLVG DGMADEPRPE LDGMTPLQYA RTPHMDLVAA CGEIGQVRTV PAGYQPGSDV ANLSVLGYDP QKYYTGRAPL EAVSMGIDLQ ERDVAFRCNL VTLTDAEPYE EREMVDYSAG EISTAEAREL ILFLDRELGS EALRFYPGVS YRHLLVWREG PVETRLTPPH DIPGMEVRAH LPAGVGDGVL KSLMVRSAEL LAGHPVNRAR RQRNERTANS IWFWGQGRRP ALPPFRDLFG LQGAVISAVD LIKGIGLCAG LESIEVEGAT GTIHTNFRGK AVAALNALAS GADFVLIHVE APDEASHHGD LETKIRAIEE IDQRVLGEVL RGARDIGPLR IMVLPDHPTP LRTRTHSADP VPFAILREGT GGGRKAERGF DEVSAARSGV YFRIGCKLMP YFIRR
|
| |