Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3819 |
Symbol | |
ID | 5210801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4777593 |
End bp | 4778768 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597415 |
Product | galactokinase |
Protein accession | YP_001278123 |
Protein GI | 148657918 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.388206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0156398 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGATA CCGGAGAACT GCGCGAGCGT TTTCAGCAGC ATTACGGGAT ACATCCGCAT GTCATCGTTC GTGCGCCAGG GCGCGTTAAC CTCATTGGCG AACATACTGA TTACAATGAC GGGTTTGTGT TTCCGGTCGC TATTGATCGC GCCACCTGCG TCGCGGCCCG TCCGCGCACT GATCGAATAG TGCGCGTCAT GGCGGCGGAT CTCCATGATG AGGATCTCTT TTCAATCGAC CAGATCGAAC GCAGCAACCG GGCATGGCAC AACTATATTC GTGGCGTCGT GCTGGCGCTG CGCACCGCGG GGCATACGCT GTCGGGCGCC GACATGTTGA TCGCCAGCGA TGTGCCGCGC GGCGCCGGGC TTTCGTCATC GGCAGCGCTT GAGGTGGCCG TCGCATACAC GTTTCAGGTG CTCAACCGGC TCAACATTCT CGGCGAAGAA CTGGCGCTGC TGGCGCAGGG CGCCGAAAAT ACCTTCGTCG GTGTGCAGTG CGGCATTATG GATCAGTTGA TCGCTGTGTT CGGGCGCGCC GATCATGCGT TGCTGATCGA TTGCCGCGAC CTGACGTATC GCGCAGTTCC TCTGCCGCCA TCGGTTGCAG TCGTTGTCTG TGACAGTCAT ATCGCGCGAA CGCTGGCGGC ATCGGCGTAC AATCAGCGCC GTCAGGAGTG CGATGCCGCA GTTCGGGCGC TGCAACAGTG GTATCCCGGC ATCCGCGCGC TGCGTGACGT GAGCGAAGAT CAACTGGCAG CGCATCAGCA CGAACTTCCT GAACCACTGC GCGCTCGCGC ACGGCACGTC GTCAGCGAAA ACCGGCGCGC GCTCCAGGGC GCTGCGGCGC TCGAAGCCGG CGACATAGCC ACATTTGGGC GACTGATGAA TGAATCACAC GCCAGCCTGC GTGATGATTA TCAGGTCAGC CTGCCAGACA TTGATTTTCT CGTTACAACA GCGCAGAGTC TGGCAGGATG TTACGGATCG CGGTTGACCG GCGCCGGGTT TGGCGGATGC ACTGTCAGCC TGGTCGAGCG GAGCAGTGTG GAAACGTTTC GCCACGACCT GGCACAGGCT TACCACGATG CGACCGGTCG AACGGCAACC ATCTATGTAT GTCGCGCCAG CGACGGAGTT GGGCGCGTCA TGGACAATGC ACGTCCACAG GAATGA
|
Protein sequence | MLDTGELRER FQQHYGIHPH VIVRAPGRVN LIGEHTDYND GFVFPVAIDR ATCVAARPRT DRIVRVMAAD LHDEDLFSID QIERSNRAWH NYIRGVVLAL RTAGHTLSGA DMLIASDVPR GAGLSSSAAL EVAVAYTFQV LNRLNILGEE LALLAQGAEN TFVGVQCGIM DQLIAVFGRA DHALLIDCRD LTYRAVPLPP SVAVVVCDSH IARTLAASAY NQRRQECDAA VRALQQWYPG IRALRDVSED QLAAHQHELP EPLRARARHV VSENRRALQG AAALEAGDIA TFGRLMNESH ASLRDDYQVS LPDIDFLVTT AQSLAGCYGS RLTGAGFGGC TVSLVERSSV ETFRHDLAQA YHDATGRTAT IYVCRASDGV GRVMDNARPQ E
|
| |