Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0780 |
Symbol | galK |
ID | 6145907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 780841 |
End bp | 781989 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615668 |
Product | galactokinase |
Protein accession | YP_001742860 |
Protein GI | 170679731 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCTGA AAGAAAAAAC ACAATCTCTG TTTGCCAACG CATTTGGCTA CCCTGCCACT CACACCATTC AGGCGCCTGG CCGCGTGAAT TTGATTGGTG AACACACCGA CTACAACGAC GGTTTCGTTC TGCCCTGCGC GATTGATTAT CAAACCGTGA TCAGTTGTGC ACCACGCGAT GACCGTAAAG TTCGCGTGAT GGCAGCCGAT TATGAAAATC AGCTCGATGA GTTTTCCCTC GATGCGCCCA TTGTCGCACA TGAAAGCTAT CAATGGGCTA ACTACGTTCG TGGCGTGGTG AAACATCTGC AACTGCGTAA CAACAACTTC GGCGGTGTGG ACATGGTGAT CAGCGGCAAT GTGCCGCAGG GTGCCGGGTT AAGTTCTTCC GCTTCACTGG AAGTCGCGGT CGGAACCGTA TTGCAGCAGC TTTATCATCT GCCGCTGGAC GGCGCACAAA TCGCGCTTAA TGGTCAGGAA GCAGAAAACC AGTTTGTAGG CTGTAACTGT GGAATCATGG ATCAACTGAT TTCCGCGCTC GGCAAGAAAG ATCATGCCTT GCTGATCGAT TGCCGCTCAC TGGGGACCAA AGCTGTTTCC ATGCCGAAAG GCGTGGCTGT CGTCATCATC AACAGTAACT TCAAACGTAC CCTGGTTGGC AGCGAATACA ACACCCGTCG TGAACAGTGC GAAACCGGTG CGCGTTTCTT CCAGCAGCCA GCGCTGCGCG ATGTCACCAT TGAAGAGTTC AACGCTGTTG CGCATGAACT GGACCCAATC GTGGCGAAAC GCGTGCGTCA TATCCTGACT GAAAACGCCC GCACCGTTGA AGCTGCCAGC GCGCTGGAGC AAGGCGACCT GAAACGTATG GGCGAGTTGA TGGCGGAGTC TCATGCCTCT ATGCGCGATG ATTTCGAAAT CACCGTGCCG CAAATTGACA CTCTGGTAGA AATCGTCAAA GCTGTGATTG GCGACAAAGG TGGCGTACGC ATGACCGGCG GCGGATTTGG CGGCTGCATC GTCGCGCTGA TCCCGGAAGA GCTGGTGCCT GCAGTACAGC AAGCTGTCGC TGAACAATAT GAAGCAAAAA CAGGTATTAA AGAGACTTTT TACGTTTGTA AACCATCACA AGGAGCAGGA CAGTGCTGA
|
Protein sequence | MSLKEKTQSL FANAFGYPAT HTIQAPGRVN LIGEHTDYND GFVLPCAIDY QTVISCAPRD DRKVRVMAAD YENQLDEFSL DAPIVAHESY QWANYVRGVV KHLQLRNNNF GGVDMVISGN VPQGAGLSSS ASLEVAVGTV LQQLYHLPLD GAQIALNGQE AENQFVGCNC GIMDQLISAL GKKDHALLID CRSLGTKAVS MPKGVAVVII NSNFKRTLVG SEYNTRREQC ETGARFFQQP ALRDVTIEEF NAVAHELDPI VAKRVRHILT ENARTVEAAS ALEQGDLKRM GELMAESHAS MRDDFEITVP QIDTLVEIVK AVIGDKGGVR MTGGGFGGCI VALIPEELVP AVQQAVAEQY EAKTGIKETF YVCKPSQGAG QC
|
| |