Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_2004 |
Symbol | |
ID | 5602693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 2195001 |
End bp | 2195921 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640937542 |
Product | N-acetyl-D-glucosamine kinase |
Protein accession | YP_001478235 |
Protein GI | 157370246 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.738789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0249015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTACG GTTTCGATAT GGGCGGCACC AAGATTGAGC TGGGCGTATT CGACGCAGAC CTGCAGCGCA TTTGGCAAAA AAGAGTGCCG ACGCCGCGTG AAGATTATCA GCAACTGCTG GCGACGTTGC GCGACCTGAC CTTTGAAGCG GATGCGTTCT GCGGCCAGAA GGGCATGGTC GGCATCGGCA TTCCGGGTCT GCCCAACGAT GACGACGGCA CGGTGTTCAC CGCCAACGTG CCGGCGGCGA TGGGGCAGAA GCTGCCGCAT GACCTGGCCG AACTGATTGG CCGCGAAGTA CGCATTGATA ATGACGCCAA CTGCTTTGCA CTGTCGGAAG CCTGGGACGA AGAATTCTGT CACTATCCGA CGGTGTTGGG CATCATTCTC GGCACCGGGG TTGGCGGCGG GCTAATCGTC GATGGCAAGG TGGTCTCCGG GCGTAATTAT ATCGCCGGTG AATTCGGCCA CTTCCGCCTG CCGGTAGATG CCCTTGAGGT GCTGGGGCGT GATATTCCTC GCGTTCCTTG TGGTTGCGGT CATCAGGGTT GCATCGAAAA TTACATTTCC GGCCGCGGTT TTGAGTGGAT GTACGCCCAC TTTTACCAGC AGCACTTGCC TGCGCAGCAG ATCATTGCGC ATTATCAGTC GGGGGAGCCG CAGGCCGTGG CTCACGTCGA ACGTTTTATG GACGTGCTGG CGATATGTCT GGGTAATCTG CTGACCATCA TCGACCCTCA TCTGGTGGTG ATTGGTGGCG GTTTGTCGAA CTTTGAAGCG ATCTACCAGG AATTACCGCA GCGTTTGCCG GCGCATCTGT TGCGGGTGGC CAAGTTGCCA CGGATTGAAA AGGCGCGCTA CGGCGATGCC GGTGGGGTCC GCGGAGCTGC GTTCCTCAAT TTGGTCAACA GGGAAAAGTA A
|
Protein sequence | MYYGFDMGGT KIELGVFDAD LQRIWQKRVP TPREDYQQLL ATLRDLTFEA DAFCGQKGMV GIGIPGLPND DDGTVFTANV PAAMGQKLPH DLAELIGREV RIDNDANCFA LSEAWDEEFC HYPTVLGIIL GTGVGGGLIV DGKVVSGRNY IAGEFGHFRL PVDALEVLGR DIPRVPCGCG HQGCIENYIS GRGFEWMYAH FYQQHLPAQQ IIAHYQSGEP QAVAHVERFM DVLAICLGNL LTIIDPHLVV IGGGLSNFEA IYQELPQRLP AHLLRVAKLP RIEKARYGDA GGVRGAAFLN LVNREK
|
| |