Gene Namu_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1463 
Symbol 
ID8447059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1615007 
End bp1616230 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID645040592 
ProductGlucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_003200851 
Protein GI258651695 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.843339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.476592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCG TGCCCGCCCC CGACCGGCTG CGCCCCCGAC GGGCGCGCCG GGTCCTGCCG 
ATCGCCCTGC TGGCGTTCGG CCTGCCGTTG GCGGCCTGCG CCGACTTCTC CACCGAGCCC
CCGGCCTTCA CCGTGCAGCC GTCGCTGACC GCGCGTGCGG TCGTGCCGGA CGAGAGCACC
CCCGGCGCCA TCCCGTCCGC CGTGCCGCCC TCGGGCAGCT CCCCGTCGCC GTCGCCGAGC
GGGTCGGCCG AGGCGACCGA CCCGTGCGCG CCGACCGACG AGGCGGTCAT CGCCACCTGT
CTGACCGCGC CGTGGGGGCT GGCCCCGCTG CCCGACGGCC AGTCCGCCCT GGTCGGGGAG
CGCACCACCG GTCGCATCCT CAAGGTCCGC AAGGGAGCAC AGCCGGAGCT GGTCGCCCAG
ATCGACGGGG TGGACGGCAG CGGCGACGGC GGCCTGCTGG GCCTGGTGCT GTCCCCGTCC
TATGACGAGG ACGGCCTGAT CTACGCCTAC GTCACCACCG CCGAGGACAA CCGGATCATC
CGCATCGCTT CCGGCGACGT GCCCAAGGCC ATCTTCACCG GCATCCCGAA GGGCACCAGC
CACAACGGTG GCCGGATCGC CTTCGCCCCG GACCGCAGCC TGTATGTCGG CACCGGCGAC
ACGGGCAACC CCGCACTGAC CGCCGACGCC ACTTCCCTGG CCGGAAAGGT GTTGCGGCTG
GACGAGTTCG GCAAGCCCGA CGCGACCAAC CCGGTCCCGG GCTCACCGAT CTACGCCTCG
GGCTTCAGCC AGGTGACCGG CATGTGCCCG TTGACCGACG GCACGATGGC CGCCCTGGAC
CACCGCAGCA CCGCCGACGT GCTGCTGCCG CTGGCCGCCG GCAAGAACTA CGGCACGCCG
GCCGCCGGCG ACGCGCTGTG GACCTGGAGC GCGGCCGAGG GCGGCGCGGC CAGCTGCGCG
ATCGCCGCCA ACGTGCTGGC CACCACCTCG TTGGACAAGC AGGAGCTCAC CGGGCTGCAG
ATGGCCGGGC CGACCACGTT CACCGGTTCG CCCAGCGTGC TGCTGGACAA CCGGTACGGC
CGGCTGCTGA CTGTCGTCGG CGACAAGACC GGCCTGCTCT GGCTGACCAC GTCGAACAAG
GACGGGCTCG GTCAGCCCGT GCCCAGCGAC GACCGGGTGC TGGTCGTCCC CTCGGGGGGC
GGCGGCGAGG GCGGGCTCGA CTGA
 
Protein sequence
MPSVPAPDRL RPRRARRVLP IALLAFGLPL AACADFSTEP PAFTVQPSLT ARAVVPDEST 
PGAIPSAVPP SGSSPSPSPS GSAEATDPCA PTDEAVIATC LTAPWGLAPL PDGQSALVGE
RTTGRILKVR KGAQPELVAQ IDGVDGSGDG GLLGLVLSPS YDEDGLIYAY VTTAEDNRII
RIASGDVPKA IFTGIPKGTS HNGGRIAFAP DRSLYVGTGD TGNPALTADA TSLAGKVLRL
DEFGKPDATN PVPGSPIYAS GFSQVTGMCP LTDGTMAALD HRSTADVLLP LAAGKNYGTP
AAGDALWTWS AAEGGAASCA IAANVLATTS LDKQELTGLQ MAGPTTFTGS PSVLLDNRYG
RLLTVVGDKT GLLWLTTSNK DGLGQPVPSD DRVLVVPSGG GGEGGLD