Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1896 |
Symbol | |
ID | 3917117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2009128 |
End bp | 2010096 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444640 |
Product | glucokinase |
Protein accession | YP_497170 |
Protein GI | 87199913 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.355899 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGTAG TTGCCGTCGA TATTGGCGGA ACCCACGCCC GCTTCGCGAT TGCCGAAGTC GCGGAAGGCC GCGTGGTCTC GCTTGGCGAG GCCGTTACGC TCAAGACGGC GGAGCATGGC TCGTTCCAGC TTGCCTGGGA GGATTTCGAG CGCGTTCGTG GAGAGCCATT GCCCAAGGCC GCGGCTATTG CCGTCGCCGG GCCTGTCGGC GGCGAGATCA TCAAGTTCAC GAACAATCCG TGGATCATCC GCCCGGCGTT GATCCCGGAG AAGCTGGGGG CGGAGCAATA TGTCGTCGTC AACGACTTCG CCGCCGTGGC CCATGCTGTC GCGCAGGCGG ACCAGAGCCA CTTTCTACAC CTGAGCGGCC CGGACGAACC GCTGCCCGCC AGCGGCGTGA CCAGCGTGGT CGGACCAGGG ACAGGGCTCG GCGTGGCGCA GTTGTGGCGT GACGGGAACA ACTACCGGGT CCAGCCCACC GAAGGCGGGC ACATCGACTT CGCGCCGCTA GATTCGATCG AGGACGCGAT CCTTGCCGGT CTGCGCAAGC GCCACCGCCG CGTTTCGGCG GAACGCGTCG TGGCCGGGCC GGGCATCGTC GATATCTACG AGGCGCTTGC TCTGATCGAA GGACGACCGT TTACACCCCG GTCTGACCGC GAGTTGTGGG AACTCGGGAC TTCCGGAGCG GACAGCCTTG CCGCCGCTGC AGTAGACAGG TTCTGCCTCT CGCTCGGCAG CGTGGCAGGC GATCTTGCCC TGGCTCACGG CGCCAACGGC GTCGTCATGG CCGGCGGGCT TGGCCTGCGG ATCAAGGACA CGCTGGTTCG GTCTGGCTTC TCGGACAGGT TCAGGGCAAA AGGGCGGTTC GAGGCCCTCA TGGCGGCCAT TCCGGTCAAG CTGATCACGC ATCCGCAGCC CGGCCTGTTC GGTGCGGCAG CGGCCTTTGC GCAAGCGCAT ACGTCGTGA
|
Protein sequence | MQVVAVDIGG THARFAIAEV AEGRVVSLGE AVTLKTAEHG SFQLAWEDFE RVRGEPLPKA AAIAVAGPVG GEIIKFTNNP WIIRPALIPE KLGAEQYVVV NDFAAVAHAV AQADQSHFLH LSGPDEPLPA SGVTSVVGPG TGLGVAQLWR DGNNYRVQPT EGGHIDFAPL DSIEDAILAG LRKRHRRVSA ERVVAGPGIV DIYEALALIE GRPFTPRSDR ELWELGTSGA DSLAAAAVDR FCLSLGSVAG DLALAHGANG VVMAGGLGLR IKDTLVRSGF SDRFRAKGRF EALMAAIPVK LITHPQPGLF GAAAAFAQAH TS
|
| |