Gene Namu_4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4076 
Symbol 
ID8449696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4491421 
End bp4492521 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content75% 
IMG OID645043120 
ProductHaloacid dehalogenase domain protein hydrolase 
Protein accessionYP_003203355 
Protein GI258654199 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.125935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0189831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCC TGGCGCAGGC CCACGATGCA CTGCTGCTCG ACCTGGACGG CACCGTGTAC 
CTGGGCGGGC AGCCCATCGA CCACGTGGCT CCGGCGCTGG TCCGGGCCGG GGTGCTCGGG
GCGCGGTCCG TCTTCGTCAC CAACAACGCC TCCCGCCCGC CGGCCGAGGT GGCGGCCGCG
CTGACCTCGA TGGGCGTTGC CGCCGAGGCC GACGACGTGT TGACCTCCCC GCAGGCGGCC
GCCGTCATGC TCGCCGACCG GCATCCGGCC GGCGCGAAGG TGCTGGTCAT CGGGGCGCCC
TGGCTGGAGG AGTCGGTCCG GCAGGCCGGT CTGCAACCGG TCCGGCTGGC CGAGGACGAG
CCGGTGGCCG TGGTGCAGGG CCATTCGCCC GACACCGGTT GGCGCAACCT GGCCGAGGGG
TGCATCGCGT TGCGGGCCGG GGCCGATTGG GTCGCCTGCA ACGTGGACAG CACCCTGCCG
ACGGACCGGG GCATGCTCCC CGGCAACGGC TCGATGGTGG CCGCCCTGGT CGCGGCCACC
GGCCTGCACC CGCGGGTGGC CGGCAAGCCC GAACGCCCGC TGCTCGACGC CGCCGTCCGG
CTCGTTGGCA GCACCCGCCC GCTGGTCGTC GGGGACCGGC TGGACACCGA CATCGCCTGC
GCGGTGGGCG CGTCCACGCC GAGCCTGATG GTGCTCACCG GCGTCTCGAC CGCCTCCGAC
CTGCTCGCCG CCGACCCCGG CCAGCGGCCG ACCTACGTGG CCTTCGACAT GCGCGGGTTG
GTCGAGGAGG ACCGGGCGGT CCGCATTCCC GGCCCGGGAT CACCCCGGAG CAGCCGGGAG
TGGACGGTCG GCACCGACTC CTCCGGTCTG GTCCTCAGCT CGGCCGCAGA CGCCGAACCC
AGCTCGGAAC CGGGCCCCGC GGCCGTCTCC GACGGTGCCG CCCTGCGGGC CCTGGCGCTG
CTCACGGCAG CCGCCTGGGC GTCCGGCCTC ACCGCCGTCC GGCCGCACGA CGCGCTCGCC
GCGGCCGCAC TGACCCGGTG CGGCATCCCG ATTCGACCGA CCCAATCTGG CCAGCGGGCA
TTCACCGACT CGCTCGACTG A
 
Protein sequence
MTILAQAHDA LLLDLDGTVY LGGQPIDHVA PALVRAGVLG ARSVFVTNNA SRPPAEVAAA 
LTSMGVAAEA DDVLTSPQAA AVMLADRHPA GAKVLVIGAP WLEESVRQAG LQPVRLAEDE
PVAVVQGHSP DTGWRNLAEG CIALRAGADW VACNVDSTLP TDRGMLPGNG SMVAALVAAT
GLHPRVAGKP ERPLLDAAVR LVGSTRPLVV GDRLDTDIAC AVGASTPSLM VLTGVSTASD
LLAADPGQRP TYVAFDMRGL VEEDRAVRIP GPGSPRSSRE WTVGTDSSGL VLSSAADAEP
SSEPGPAAVS DGAALRALAL LTAAAWASGL TAVRPHDALA AAALTRCGIP IRPTQSGQRA
FTDSLD