Gene Namu_4646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4646 
Symbol 
ID8450275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5169891 
End bp5171108 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content73% 
IMG OID645043686 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_003203912 
Protein GI258654756 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTGG TCGCCGGCCT CGCCGTCATC GGCGCGCTGG CCGCCTGTTC AGGGGCCGGC 
GCACCCCTAC CGGCCAGCAC CACCATCGGG CTGAGCGCGC CCGCTTCGGC GGCCACCTCG
CCGACCGATC CGCCGACTGA CCCGTCGGCC GTGCCGTCCA CGGTGTTGTC CACCGTGCCC
TCCCCCGCCG GCACCGTGCA GGTGAGCACC GTGGCCACCG GCCTGGCCTC ACCGTGGGGC
CTGGATTTCC TGCCCGACGG GCGAGCCGTG GTCACCAGCC GCGACACCGC CACGATCAGC
CTGGTCGACC CCGACGGCAC CATCACGCCG GTGGGCACCG TCGACGGGGT GGTGCCCGGT
GGCGAGGGCG GGTTGCTCGG GATCGCCGTC TCCCCGGCGT TCAGCACCGA CCACCGGCTG
TACGTCTACT ACACCGCCGC ACAGGACAAC CGGATCGCCA CCGTCGAACT CGTCGACGGG
GCAATCGGCA ATCAGCAGGT CGGCTTCACC GGCATTCCCA AGGCCGGCAT CCACAACGGC
GGCCGGATCG TCTTCGGCCC GGACGGGCTG CTCTACGTCG GCACCGGGGA CGCCGGGGAC
CGGCCCCAGG CCCAGGACCC GGACGCGCTC GGCGGCAAAA TCCTGCGCCT GGACTCCCAG
CTCCGGCCGG CCGCCGGCAA CCCGGACGAT CCGGTCCTGG CCGGCGGCGC CGGCTACAGC
CTGGGCCATC GCAACGTGCA GGGCCTGGCC TTCGACGACC GGGGCCGGCT CTGGGCCGCC
GAGTTCGGCC AGAACACCTG GGACGAGTTG AACCTGGTCC AAGCCGGCGA CAACGACGGC
TGGCCGGTCG TCGAAGGAAT CGGCGACAAC CCCGACGGCG TCAATCCCGA ATTCGTCAAC
CCGCAGCGGC AGTGGTCGAC CGCGGACGCC TCCCCCAGCG GCATCGCCTT CTGGCAGGGC
TCCATCTGGA TGGCCGGCCT GCGCGGGCAG CAGCTGTGGC AGATCCCGCT GACCGAGTCC
GGGGCGGAAT CGACCGGGGA GTCGAACGGG GAGCTGACCG GGGAGCCGGT CGGCCATCTC
AACGGCGTCT ACGGCCGGCT GCGGACCGTG GTCGCCGCGC CCGACGGCAG CCTGTGGCTG
ATCACCTCGA ACACCGACGG TCGCGGCGAC GTCCGGGACG GCGACGACCG CATCCTGCGC
CTGCAGCCGG CCGCCTGA
 
Protein sequence
MGLVAGLAVI GALAACSGAG APLPASTTIG LSAPASAATS PTDPPTDPSA VPSTVLSTVP 
SPAGTVQVST VATGLASPWG LDFLPDGRAV VTSRDTATIS LVDPDGTITP VGTVDGVVPG
GEGGLLGIAV SPAFSTDHRL YVYYTAAQDN RIATVELVDG AIGNQQVGFT GIPKAGIHNG
GRIVFGPDGL LYVGTGDAGD RPQAQDPDAL GGKILRLDSQ LRPAAGNPDD PVLAGGAGYS
LGHRNVQGLA FDDRGRLWAA EFGQNTWDEL NLVQAGDNDG WPVVEGIGDN PDGVNPEFVN
PQRQWSTADA SPSGIAFWQG SIWMAGLRGQ QLWQIPLTES GAESTGESNG ELTGEPVGHL
NGVYGRLRTV VAAPDGSLWL ITSNTDGRGD VRDGDDRILR LQPAA