Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0003 |
Symbol | |
ID | 4070013 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2996 |
End bp | 3931 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982003 |
Product | glucokinase |
Protein accession | YP_589082 |
Protein GI | 94967034 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000615827 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.192124 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGGG CTGTCGATAT CGGCGGAACA AAGATCGCTG TGGGCGTGGT GGACGCAGAC GGCGTGGTGA TTGCGAGCGA CGAATGTCCC ACCGAAGCGA AGCGTGGGTA TGCTGATGCG CTGAACCGGA TCAGTGCGAT GTTGCGTGCC TGTGCCGAGA AAAGCGGCGA GGTGATCACG GGGGTTGGAA TCGGCAGCAC CGGCCCAGTC GATCCGCTTA CGGGCGAAAT CGGCAACGCC GAGTTCATCA AGGAGTGGAT GGGCTGCAAT CCGGTGCGCG ACCTGGCCGA ACGGTTCGGC GTGAAGGTCG CAATGGAGAA CGACGCGGAT GCCGCTGCTC TTGGCGAGGC AGCATGGGGT GCTGGCCGCG GTCGCAAGCA CATGATCTTC GTAACCGTGG GAACCGGGAT CGGTGGCGGC ATTATTCTTG GCGGCAGGCT CTATCGTGGC GCAGATGGTG CGCACCCGGA GATTGGACAC TACACGATGG ATTCTTCTGG CCCTCTCTGC TTCTGCGGCA TCCATGGTTG CTGGGAGGTA CTGTGCGCAG GACCGGCGAT GGGCGCGTGG ATGACTTCGC AAGCGCCTGC CGATTGGCCG CCTGAAGACT TCTCTGCCAA GCGCATTTGC GAACGCGCGC GTGAGGGCGA TCCTATTGCG AAACGGGGGG TGGAGCGGGA AGCACACTAT CTCGGGCTGG GCGTCGCGAA CCTGATCACG CTATTTACGC CGGAGGTCAT TGTTCTCGGA GGCAACGTGA TGCGAAGTGC GGATTTGTTC ATGGAACAGA TCCACGCCGA GGTCCGTCGC TGCTGCACCC AGGTTCCCTA CGAGAAGACG GATATCCGGC TCGCCTCGCT GGGACCTCAA ACCGGACTGG TCGGCGCCGC GCGGGTTTGG CATCATCGAT TTCGGCAAGA TGGGGAGGTC GCGTGA
|
Protein sequence | MIGAVDIGGT KIAVGVVDAD GVVIASDECP TEAKRGYADA LNRISAMLRA CAEKSGEVIT GVGIGSTGPV DPLTGEIGNA EFIKEWMGCN PVRDLAERFG VKVAMENDAD AAALGEAAWG AGRGRKHMIF VTVGTGIGGG IILGGRLYRG ADGAHPEIGH YTMDSSGPLC FCGIHGCWEV LCAGPAMGAW MTSQAPADWP PEDFSAKRIC ERAREGDPIA KRGVEREAHY LGLGVANLIT LFTPEVIVLG GNVMRSADLF MEQIHAEVRR CCTQVPYEKT DIRLASLGPQ TGLVGAARVW HHRFRQDGEV A
|
| |