Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3717 |
Symbol | |
ID | 5735581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4675809 |
End bp | 4676939 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280869 |
Product | galactokinase |
Protein accession | YP_001546481 |
Protein GI | 159900234 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0664685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAGG TTAAAGCTGC ATTTTTTGTT CGTTCTCCAG GTCGGGTAAA TTTAATTGGC GAGCATACCG ATTACAACGC TGGGTTTGTT ATGCCACTGG CGCTTGAACG TGGTACGACG TTTCAGGTTC AACCCCGTGA TGATCAGCAG TTGATTGTGC ATGCCCTACG TTTCAACGCA CATGATCAAG CCGATTTGGC TAATTTGGCG GCTGGCACGC ACGGCGATTG GCGTGATTAT GTGCGCGGAA CAGCCCAATC CTTGCTTGAT GCTGGCTACG CGTTGCAAGG CGCTGAGATT AATATTGATG GTGATTTGCC GCTGAGTGGC GGATTAAGCT CGTCAGCATC ATTAGAAGTT GGCTTGGCTT TTAGCTTACT CTACGCCCAA GGCATCACGA TTGCTCCCGC TGAATTAGCT AAAATTGCCC AACGCGCCGA AATTGAATAT GCCCATGTTA ATTGTGGAAT TATGGATCAG CTTGCGATTG CCGCAGGCGT TGCCGGCCAT GCCACATTAA TCGATTGCCG CTCGTTGGAA ATTGAGGCCG TGCCGATTCC GGCTGAAGTG GCAGTTTTGG TGATTGATAG TGGCGTGCCA CGCACCTTGG CTGGCTCGGC TTATAATCAA CGCCGCGCCG AATGTGAACA AGCTGTAGCA ATCTTGCGTC AACTCGACCC AAACATCAAC GATTTGCGCG ATGTTAACAG CGATTTGCTG GCCCAAGCCG TTGAACAAGA TCGCTTTGAA GAAGTGATTT ATCGACGTGC CCGCCATGTT GTCAGCGAAA ATGAGCGGGT GCATAAAGCC GCCGCCGCGT TTCGGGCAGG CGATTTTGGC TACGTTGGCG AGTTGATGAA CGAATCGCAT TGGAGCCTGC GCGATGATTA TGAAGTTAGC GGCCCTGAGC TTGATCAACT AACTGAGTTG TTGCGCGATA TGCCTGGGGT TTGGGGTGCT CGCCTAACTG GCGCTGGCTT TGGTGGCTGC TGCGTGGCCT TGGTCGAAGC CAGCCACGTT GATGCGGTGA TTGTGGCCTT AAGTCCAGCC TATCATGCCG CAACTGGCCG CACCTGCGAA GCCTTTAGCA CCAAAGCCTC AGCATTAACC ATTGAAGAAC CTAGAGCATA G
|
Protein sequence | MTEVKAAFFV RSPGRVNLIG EHTDYNAGFV MPLALERGTT FQVQPRDDQQ LIVHALRFNA HDQADLANLA AGTHGDWRDY VRGTAQSLLD AGYALQGAEI NIDGDLPLSG GLSSSASLEV GLAFSLLYAQ GITIAPAELA KIAQRAEIEY AHVNCGIMDQ LAIAAGVAGH ATLIDCRSLE IEAVPIPAEV AVLVIDSGVP RTLAGSAYNQ RRAECEQAVA ILRQLDPNIN DLRDVNSDLL AQAVEQDRFE EVIYRRARHV VSENERVHKA AAAFRAGDFG YVGELMNESH WSLRDDYEVS GPELDQLTEL LRDMPGVWGA RLTGAGFGGC CVALVEASHV DAVIVALSPA YHAATGRTCE AFSTKASALT IEEPRA
|
| |