Gene Namu_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1768 
Symbol 
ID8447370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1936546 
End bp1937703 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID645040894 
Productgalactokinase 
Protein accessionYP_003201147 
Protein GI258651991 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.862041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.168773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA CGACCGTGTC CGACCTGTTC CGTCGCAGTT ACGGCCGGAA ACCTGACGGG 
GTGTTCTCCG CGCCCGGCCG GGTCAACCTG ATCGGCGAGC ACACCGACTA CAACGGCGGC
CTGGTGCTGC CGTTCGCCAT CGACGCCCGC GCTCACCTCG CCGCCGGCCG GGCCGACTCC
GGGGCCATCC GAATCATGTC CGCGCAGAAA CCGGGTGAGT TCAGCCAGGT CCACCTTGAC
GATGTTCGGC CTGGCTCCCC GGCCGTCGCC GGGTGGCCCG GCTACCTGCT GGGGGCCATT
TGGTCGCTGC AGCAGACCGG TCGCCCGATC GAATCGGTCG ACCTGGTCCT GGACTCCCAG
ATACCCGCCG GAGCTGGTCT GTCATCCTCG GCCGCGGTGG AATGCGCCAC CGTGCTGGCA
GTTTCCGCGC TGTCCGGCTA CTCGATGGAC CCGCTCACCA TCGCCCGCAT CGCGCAACGG
GCCGAGAACG ACTTCGTCGG GGTGCCCTGC GGCCCGATGG ACCAGACCGC GTCCGCCGCC
TGCGCCGAAG GCTCGGTACT GCTGTTCGAC ACCCGATCAG GTAGCACCGA GAACATCTCG
TTCGATCCCG CCGCGCACGA CCTCACGGTG CTGGTCGTCG ACACCCAGGT CGCTCACTCC
CTCGCCGACG GCGAGTACGG CAAGCGGCGC ACCTCCTGCG AGTTGGCCGC CGAGATCCTG
GGCGTCACCC AACTTCGGGA GGTCACCGTC GACGACCTGC CCGCCGCGCT GAGCAAGCTC
CCCGACGACG AGCTGCGCCG CCGGCTCCGG CACGTCGTCA CCGAGAACGA CCGGGTGGAA
AGCACCGTCG AGCTGCTGCG GGCCGGTCGG ATCACCGACA TCGGCCCGTT GCTCACCGCC
TCGCACGCCT CCCTGCGCGA CGACTACGAC GTCTCCTGTG CCGAACTCGA CGCTGCTGTG
GACACGGCAC TGACGGCCGG CGCCCTCGGC GCTCGGATGA CCGGCGGAGG GTTCGGTGGG
TCGGTCATCG CGCTCGTCCC GACCGATCTG ACCGACCACG TCGGTGCCGC GGTCGTCTCC
GCCTTCGCCG GGCACGGCTT TCGACCGCCG GTCCTACGCC GGGTCAGCCC CGCGCAGGGT
GCCCGGCGGG AGAACTGA
 
Protein sequence
MNTTTVSDLF RRSYGRKPDG VFSAPGRVNL IGEHTDYNGG LVLPFAIDAR AHLAAGRADS 
GAIRIMSAQK PGEFSQVHLD DVRPGSPAVA GWPGYLLGAI WSLQQTGRPI ESVDLVLDSQ
IPAGAGLSSS AAVECATVLA VSALSGYSMD PLTIARIAQR AENDFVGVPC GPMDQTASAA
CAEGSVLLFD TRSGSTENIS FDPAAHDLTV LVVDTQVAHS LADGEYGKRR TSCELAAEIL
GVTQLREVTV DDLPAALSKL PDDELRRRLR HVVTENDRVE STVELLRAGR ITDIGPLLTA
SHASLRDDYD VSCAELDAAV DTALTAGALG ARMTGGGFGG SVIALVPTDL TDHVGAAVVS
AFAGHGFRPP VLRRVSPAQG ARREN