Gene Saro_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0544 
Symbol 
ID3918674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp593656 
End bp594798 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content69% 
IMG OID640443274 
Productgalactokinase 
Protein accessionYP_495825 
Protein GI87198568 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGCGC TGCATGACCG GCTCCTTGCC GGCTTTGCGC AAGCGTTCGG GGGAGAGCCG 
GAGCTTGTCG TGCGCGCGCC CGGCCGGGTG AACCTGATCG GCGAACATAC CGACTACAAC
GACGGCTTCG CCATGCCCGT GGCAATAGGC CAGGAAACGC GCGTGGCCTT CCGCCCGGGC
GGAACCGGGC TCAGGGTGGC CGCTCTGGAC TTTGCCGAAG ATGACGCGTT CGACAGCGCG
GCCCCGCAAA GGGCCGGCGG CGGTTGGCGC GATTATGTGC GCGGCGTGGT GGACGAACTC
GTCCGCGCGG GGATTTCCGT CCCTCCGGGC CAGCTTGCGA TCGCAGGATC GATTGCCAAG
GGGACCGGCC TGTCATCCTC GGCCTCGCTC GAGGTTGCCG TCGCGCGTGT CCTGCTCGAT
GCAGCGGGTG AACGGATGGA CCCCGTCAGC CTCGCGCTCC TTGCCCAGCG AGCGGAATGC
GATTTCGTCG GCGTTCGCTG CGGCAATCTC GATCAGATTG CCAGTGCTGC CACGACGCGC
GGCCACGCGC TGCTGATCGA TTGCCGCACC CTGGCGCTCA GGCAGATCGC CATGCCCGCC
GACGTGGCGG TGATGATCGT GCAGTCAGGC GTGGTGCGCG GATTGGTGGA CGGCGAATAC
AACCAGCGCC GGCAGGAATG CGAACGCGCC GCCCGGACGC TCGGCGTGCC GGCGCTGCGC
GACGTCGACG AGGGGATGCT CGACGAGGCG TGCGGGCGGC TTGACGATCT TGCCTTCCTT
CGCGCCCGCC ATGTCTGCGG CGACAATCGC CGGACGCGGG AGGCTGCCCG CGCGCTGGCC
TCGGGCGATC TGGTCGCGAT GGGGGCGCTC ATGCGCGAAA GCCATGTCTC GCAGGGTCGG
GACTTCGGCA TCACCGTGCC CCATACCGAC GTGCTGGCAG CGCTGATGAA CGAAGCGATC
GGCGAAGACG GCGGCGCTCG GCAGACGGGC GGCGGCTTTG GCGGCGCCGT CGTCGGCCTC
ATGCGACAGG ACCGCGTCGC GGCTGTGCGC GAAGCGGTCC TTGCCGTGTA TCGGACGCCT
GCCGGAGACG TGCCTGAAAT CTGTATAGAG GTTCCTTCGG ATGGGGCGGG ACCGGTCGGC
TGA
 
Protein sequence
MTALHDRLLA GFAQAFGGEP ELVVRAPGRV NLIGEHTDYN DGFAMPVAIG QETRVAFRPG 
GTGLRVAALD FAEDDAFDSA APQRAGGGWR DYVRGVVDEL VRAGISVPPG QLAIAGSIAK
GTGLSSSASL EVAVARVLLD AAGERMDPVS LALLAQRAEC DFVGVRCGNL DQIASAATTR
GHALLIDCRT LALRQIAMPA DVAVMIVQSG VVRGLVDGEY NQRRQECERA ARTLGVPALR
DVDEGMLDEA CGRLDDLAFL RARHVCGDNR RTREAARALA SGDLVAMGAL MRESHVSQGR
DFGITVPHTD VLAALMNEAI GEDGGARQTG GGFGGAVVGL MRQDRVAAVR EAVLAVYRTP
AGDVPEICIE VPSDGAGPVG