Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2431 |
Symbol | |
ID | 4072865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2870504 |
End bp | 2871535 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984447 |
Product | GHMP kinase |
Protein accession | YP_591506 |
Protein GI | 94969458 |
COG category | [R] General function prediction only |
COG ID | [COG2605] Predicted kinase related to galactokinase and mevalonate kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.696498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.378909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCGTA AGAAACCCGG CTCTCCTCAA CAGGTAATTG CCGAGGCGTG CTGTCGCGTG GACCTCGCCG GCGGCACCCT CGATCTGTGG CCTCTTTACC TTTTTCATAA AAACTCCGTC ACGGTGAATT TTGGGGTCAA TATCATGACC CGCTGCCAGA TCACCGCCCG CGACGACGAC CACATTTCGC TGATCTCAAA AGACACGCTG CGTGGCGACG ACTTCGAAGA CCTGAAGACG CTGCGTGCGG CGAAAGAACA CCGTCATGCA CTCGCCGCGC AACTGCTGCG CTTCTTCGAG CCGGACTGCG GCTTGAACCT GGAGACGAAT TCCGAATCGC CCGCGGGCGC GGGAATCTCC GGTTCGTCGG CGCTGATGAT CGCCATTACC GCGGCGCTGG CGCGGTTCAC CGGTCGCAAG CTCACGCTGG AGCAGATTCG CACCATCTCG CAAAACGTTG AAGCGCAGGT GATCAACGTT CCTACCGGAT GCCAGGATTA CTATCCGGCG CTTTATGGCG GCGTGAACGC GGTGCATCTG CAGCCGGATG GAATAATCCG CGAGGCGATT GATGTTGCAC CCGAGGAGAT CGAGAAGCGC TTCGTGCTGA TCTATACCGG CGCGCCGCGG CAATCGGGGA CCAACAACTG GGAGGTCTTC AAAGCGCACA TCGACGGCGA CAGCATTGTG CAGCGCAACT TCGACCGCAT CGCCGACATC GCCGACAGCA TGCACCACGC GCTCGCCGCC CACGATTGGG ATGAAGTCGC GCGCCTGCTG CGCGAAGAGT GGAAGCAGCG TCGAACGAAC GCGCCGAACA TCACGACGAA GTTCATTGAT GAACTGATCG AAGTAGCCCG GAAGAAGGGC GCCCGCGCAG CGAAAGTCTG CGGCGCCGGC GGCGGCGGCT GCGTGATCAT CATGACCCAC GAAGATTCCC GCGATAAAGT AAGCGCGGCG CTGGCCGAAG CGGGAGCTAC GGTGTTGCCG TTGCAGGTGG CCCGGAAGGG GCTGCAGGTT CGGAGTAAGT AG
|
Protein sequence | MARKKPGSPQ QVIAEACCRV DLAGGTLDLW PLYLFHKNSV TVNFGVNIMT RCQITARDDD HISLISKDTL RGDDFEDLKT LRAAKEHRHA LAAQLLRFFE PDCGLNLETN SESPAGAGIS GSSALMIAIT AALARFTGRK LTLEQIRTIS QNVEAQVINV PTGCQDYYPA LYGGVNAVHL QPDGIIREAI DVAPEEIEKR FVLIYTGAPR QSGTNNWEVF KAHIDGDSIV QRNFDRIADI ADSMHHALAA HDWDEVARLL REEWKQRRTN APNITTKFID ELIEVARKKG ARAAKVCGAG GGGCVIIMTH EDSRDKVSAA LAEAGATVLP LQVARKGLQV RSK
|
| |