Gene RoseRS_3819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3819 
Symbol 
ID5210801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4777593 
End bp4778768 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content61% 
IMG OID640597415 
Productgalactokinase 
Protein accessionYP_001278123 
Protein GI148657918 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.388206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0156398 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATA CCGGAGAACT GCGCGAGCGT TTTCAGCAGC ATTACGGGAT ACATCCGCAT 
GTCATCGTTC GTGCGCCAGG GCGCGTTAAC CTCATTGGCG AACATACTGA TTACAATGAC
GGGTTTGTGT TTCCGGTCGC TATTGATCGC GCCACCTGCG TCGCGGCCCG TCCGCGCACT
GATCGAATAG TGCGCGTCAT GGCGGCGGAT CTCCATGATG AGGATCTCTT TTCAATCGAC
CAGATCGAAC GCAGCAACCG GGCATGGCAC AACTATATTC GTGGCGTCGT GCTGGCGCTG
CGCACCGCGG GGCATACGCT GTCGGGCGCC GACATGTTGA TCGCCAGCGA TGTGCCGCGC
GGCGCCGGGC TTTCGTCATC GGCAGCGCTT GAGGTGGCCG TCGCATACAC GTTTCAGGTG
CTCAACCGGC TCAACATTCT CGGCGAAGAA CTGGCGCTGC TGGCGCAGGG CGCCGAAAAT
ACCTTCGTCG GTGTGCAGTG CGGCATTATG GATCAGTTGA TCGCTGTGTT CGGGCGCGCC
GATCATGCGT TGCTGATCGA TTGCCGCGAC CTGACGTATC GCGCAGTTCC TCTGCCGCCA
TCGGTTGCAG TCGTTGTCTG TGACAGTCAT ATCGCGCGAA CGCTGGCGGC ATCGGCGTAC
AATCAGCGCC GTCAGGAGTG CGATGCCGCA GTTCGGGCGC TGCAACAGTG GTATCCCGGC
ATCCGCGCGC TGCGTGACGT GAGCGAAGAT CAACTGGCAG CGCATCAGCA CGAACTTCCT
GAACCACTGC GCGCTCGCGC ACGGCACGTC GTCAGCGAAA ACCGGCGCGC GCTCCAGGGC
GCTGCGGCGC TCGAAGCCGG CGACATAGCC ACATTTGGGC GACTGATGAA TGAATCACAC
GCCAGCCTGC GTGATGATTA TCAGGTCAGC CTGCCAGACA TTGATTTTCT CGTTACAACA
GCGCAGAGTC TGGCAGGATG TTACGGATCG CGGTTGACCG GCGCCGGGTT TGGCGGATGC
ACTGTCAGCC TGGTCGAGCG GAGCAGTGTG GAAACGTTTC GCCACGACCT GGCACAGGCT
TACCACGATG CGACCGGTCG AACGGCAACC ATCTATGTAT GTCGCGCCAG CGACGGAGTT
GGGCGCGTCA TGGACAATGC ACGTCCACAG GAATGA
 
Protein sequence
MLDTGELRER FQQHYGIHPH VIVRAPGRVN LIGEHTDYND GFVFPVAIDR ATCVAARPRT 
DRIVRVMAAD LHDEDLFSID QIERSNRAWH NYIRGVVLAL RTAGHTLSGA DMLIASDVPR
GAGLSSSAAL EVAVAYTFQV LNRLNILGEE LALLAQGAEN TFVGVQCGIM DQLIAVFGRA
DHALLIDCRD LTYRAVPLPP SVAVVVCDSH IARTLAASAY NQRRQECDAA VRALQQWYPG
IRALRDVSED QLAAHQHELP EPLRARARHV VSENRRALQG AAALEAGDIA TFGRLMNESH
ASLRDDYQVS LPDIDFLVTT AQSLAGCYGS RLTGAGFGGC TVSLVERSSV ETFRHDLAQA
YHDATGRTAT IYVCRASDGV GRVMDNARPQ E