Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1548 |
Symbol | |
ID | 4445915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1725726 |
End bp | 1726817 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639689363 |
Product | glucokinase |
Protein accession | YP_831042 |
Protein GI | 116670109 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0569057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACAC CTTCGCCCGG CGACCTGCCC AGACCCTACC GACGGACGGC CGCGTGGCGG CGCAGGAAGC CTTCCGGGCA GGATGCGCTG GCCGCCAGCC AGGGCTTCCG GGAGCACCTC CGGCTCGGGC GCAAAGGGTT GGCGATCGGT GTGGACATCG GAGGCACCAA GGTCGCGGCC GGTGTGGTGG ACGCCGACGG CCGGATCCTC AGCCAGGCCA GGCGCTCCAC ACCCGGGAAC GACCCCCGGG CCGTGGAACA GGTGATCGTC GAACTCGTGG AGGAGCTGAG CCGCGGCCAC CGCATCTGGT CGGTGGGAAT CGGCGCGGCC GGGTGGATGG ACCTCGACGG CGGCACGGTG CTGTTCAGCC CGCACCTCGC GTGGCGCAAC GAACCGTTGC GGGACAACCT CCAGCGACTG CTGCGCCGTC CGGTCCTGCT GACCAACGAC GCCGATGCCG CCGCGTGGGC GGAATGGCGC TTTGGTGCCG GGCAGGGGCA AAGCAGGCTC GTCTGCATCA CCCTGGGCAC GGGGATCGGC GGTGCCATGG TGATGGACGG CCGGCTGGAA CGCGGCCGCT TTGGGGTTGC GGGAGAATTC GGCCACCAGA TCATCATGCC CGGCGGGCAC CGTTGCGAAT GCGGAAACCG CGGCTGCTGG GAACAGTATG CGTCTGGAAA CGCGCTGGGC CGCGAAGCCC GCGAACTGGC TGCTGCCAAT TCGCCCGTGG CGCAGGAACT CCTGAAGGCC GTGGACGGTC AGGTGGACCG CATCACCGGG GCCATCGTGA CGGAACTGGC CAAGGCGGGG GACCCCACGT CCCGGGAACT GCTGGAGGAC GTTGGGGAGT GGCTGGGCCT GGGGCTGGCC AACCTGGCTG CCGCGCTGGA CCCTGGAAAG TTCGTCATCG GTGGAGGCCT GTGCGATGCG GGTGAACTGC TGGTGGCGCC GGCACGGAAG GCCTTTGCGC GGAACCTGAC AGGGCGCGGA TTCCGGCCGG CTGCCGAAAT CGCGCTCGCA GCCCTGGGCC CCAACGCCGG GCTGATCGGG GCCGCAGACC TGTCCCGTGT CAGCAGCAGG ATGCACGGCT GA
|
Protein sequence | MHTPSPGDLP RPYRRTAAWR RRKPSGQDAL AASQGFREHL RLGRKGLAIG VDIGGTKVAA GVVDADGRIL SQARRSTPGN DPRAVEQVIV ELVEELSRGH RIWSVGIGAA GWMDLDGGTV LFSPHLAWRN EPLRDNLQRL LRRPVLLTND ADAAAWAEWR FGAGQGQSRL VCITLGTGIG GAMVMDGRLE RGRFGVAGEF GHQIIMPGGH RCECGNRGCW EQYASGNALG REARELAAAN SPVAQELLKA VDGQVDRITG AIVTELAKAG DPTSRELLED VGEWLGLGLA NLAAALDPGK FVIGGGLCDA GELLVAPARK AFARNLTGRG FRPAAEIALA ALGPNAGLIG AADLSRVSSR MHG
|
| |