Gene Namu_4309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4309 
Symbol 
ID8449935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4790948 
End bp4791829 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content77% 
IMG OID645043357 
Product1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D- gluc opyranosidedeacetylase 
Protein accessionYP_003203586 
Protein GI258654430 
COG category[S] Function unknown 
COG ID[COG2120] Uncharacterized proteins, LmbE homologs 
TIGRFAM ID[TIGR03445] 1D-myo-inosityl-2-acetamido-2-deoxy-alpha-D-glucopyranoside deacetylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.414925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGTTG CCGACGTGCC TGCCCGCCTG CTTGCCGTCC ACGCGCATCC CGACGACGAG 
TCGCTGACCA TGGCCGGCAC GCTGGCCGGG GCGGCCCTGG CCGGTGCCGA GGTCACTCTG
GTCACCGCCA CCCTCGGCGA GGAGGGGGAG GTGATCGGCG ACGAGCTGCA GGGCCTGATC
GCCGCCCGGG CCGACCAGCT CGGCGGGTAC CGGCTGACCG AGCTGGCGGC CGCCGGCGCG
GCGCTGGGCG TGCGGGAACG GGTCATGCTC GGTGGGCTGG GCGCGTTCCG GGACTCCGGC
ATGGCCGGCA CACCGTCGGC CGAGCATCCG CGGGCGTTCA TCCGGGCGCA GCGCGGCGGC
CCCGACCATG ATCGGGCGGC CCGGGCGCTG GCCCGGGAGA TCGACCGGGT CCGCCCACAT
GTGCTGCTCA CCTACGACGA GGACGGCGGC TACGGCCATC CCGACCACGT GGCCGTGCAT
CAGGTCGTGC TGGCCGCGCT GCCGTTGGCC GCCTGGCCGG TGCCCCGGGT GCTGGCGGTG
ATCCGCCCCC GGACGGTCAC CCAGGCCGAT TTCGCAGCGC TGACGACCCC GCCCGGGTAT
CTGGCCGCGG CGGCCGACGA GGTCGGGTTC CTGGCGGCCG ACGACTCGGT CGCGGTGGCC
GTGCCCGTCA CCGCGGCCGC CGCGCGGCGT CGCGCCGCGC TGGCCGCGCA CGCCACCCAG
GTCGAGCTGC TGCCCGGCGA GGTGTTCGCC CTGTCCAATC GGATTGCCCA GCCGCTGCCC
GCGGCCGAGT ACTTCCGGGT GCTGGCCGGC TCGCCGGTCC CGGTCGGGCC GGACTGGACG
GTGCCGGCCG ACGTGGCCGC CGGGCTGGAC CTGGACCGGT GA
 
Protein sequence
MYVADVPARL LAVHAHPDDE SLTMAGTLAG AALAGAEVTL VTATLGEEGE VIGDELQGLI 
AARADQLGGY RLTELAAAGA ALGVRERVML GGLGAFRDSG MAGTPSAEHP RAFIRAQRGG
PDHDRAARAL AREIDRVRPH VLLTYDEDGG YGHPDHVAVH QVVLAALPLA AWPVPRVLAV
IRPRTVTQAD FAALTTPPGY LAAAADEVGF LAADDSVAVA VPVTAAAARR RAALAAHATQ
VELLPGEVFA LSNRIAQPLP AAEYFRVLAG SPVPVGPDWT VPADVAAGLD LDR