Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1938 |
Symbol | |
ID | 3784234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2228720 |
End bp | 2229829 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812024 |
Product | galactokinase |
Protein accession | YP_412625 |
Protein GI | 82703059 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.491701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCAGTT TCGAGTCTGT ATTTGGAGAT TCGCCTGAGT TTGTGGCACG CGCCCCAGGC CGCGTTAACC TGCTGGGAGA GCATACTGAC TACAACGGAG GTTATGTACT GCCGATTGCA ATCGAGCAGC AGACGAGTGT CAGCATGAGC CATAGCAACA GCAAGCAATA CGCATTGTAT TCGGAAATGC TCGATGACAT CGTTTACTTT ACTCTCGGCA AATCTCCAAC TGAGCACTTC GCCACCTACG TATATGGATG TTTGATGGAA GCTCGCGCGG TTGGGATGAA GGTGCCGGTG CTGGATATCT ACATTCAATC AGATGTGCCG ATGGGAGCAG GGCTATCCTC CAGTGCAGCG CTTGAGGTAG CGACGTTGCG AGTCCTGCGT GCGCTGACCG GTTTCCCCCT GGATGATGTG CAAATCGCAC AACTTGCTCA GCGCGCAGAA ATCCAGTACG CAGGAGTGCG CTGTGGCATC ATGGATCAGA TGGCATCGAG CCTTGCGGGA ACCAGAGAGG CCTTGCTGCT GGATACCCTT ACGCTTGAGC GGCGCCTGGT TCCCTTACCG CCTGCTTCGG CTGTGCTGGT ACTTGATTCA GGCGTTGTCC GCACGCTTGC GACAAGCGGT TACAATCAGC GCCGGACAGA GTGTGAAGAG GCGGCACATC AACTTGGAGT GCAATCACTT CGCGAAGTCC ATGACATTTC ACTGGCGGAC GCTTTGCCTG AGCCGCTGGG CCGCCGGGTA CGCCATATCG TAAGCGAGAA CGCGCGCGTG CTGCGGGCGG CGGAATGCAA CAACGCCGCG GAGTTCGGGA TGCTGATGAA CGCATCCCAT ACCAGTTTGC GCGACGATTA TGAGGTCTCG GTTCCCCAAC TTGATCAATT GGTGACCCTG CTTCAGGCTC ATCCTGATGT TTATGGAGCG CGCTTGACAG GAGCAGGCTT TGGAGGCGCG TGCGTCGCCT TATGCAAGCC GGAGTCCTTG CATCAGATTT CCGAAGCGGT GCTGCAAGAT TATTCAAGCA TGGGCTTGAA GGGACGCATC CTTGTGCCAC CACATTGGGT CGATAGAACT GCTACTGTAG GGGGAGGTTC AGCATCTTGA
|
Protein sequence | MSSFESVFGD SPEFVARAPG RVNLLGEHTD YNGGYVLPIA IEQQTSVSMS HSNSKQYALY SEMLDDIVYF TLGKSPTEHF ATYVYGCLME ARAVGMKVPV LDIYIQSDVP MGAGLSSSAA LEVATLRVLR ALTGFPLDDV QIAQLAQRAE IQYAGVRCGI MDQMASSLAG TREALLLDTL TLERRLVPLP PASAVLVLDS GVVRTLATSG YNQRRTECEE AAHQLGVQSL REVHDISLAD ALPEPLGRRV RHIVSENARV LRAAECNNAA EFGMLMNASH TSLRDDYEVS VPQLDQLVTL LQAHPDVYGA RLTGAGFGGA CVALCKPESL HQISEAVLQD YSSMGLKGRI LVPPHWVDRT ATVGGGSAS
|
| |