Gene Namu_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3972 
Symbol 
ID8449591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4384410 
End bp4385549 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content75% 
IMG OID645043017 
ProductHAD-superfamily hydrolase, subfamily IA, variant 3 
Protein accessionYP_003203253 
Protein GI258654097 
COG category[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00807242 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0158039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTCG CCCAGGAATT GGGCCTGCCC GTGCTCGACC TGGACACCCT GACCAACCCG 
CTTCTCGACG CGTTGCCGGT CTCGCTGCTC GCCGAACACT GGCTCTCGCC GGCCCACGGC
ACCGCCGTCC GCGACGGCCG GTACGCCGCG CTGCGGGCGG TGGCCGCGCA GGTGGTGGCC
ACCGCCGGCG GGGCAGTGCT GGTCGCCCCG TTCACCGCCG AATTGGCCGG CGGTGACGCG
TGGGTCCGGC TCCGAACCGC GCTGGCGCCG ACCGACCCGC TGGTGGTGTG GATCCGCGGG
GATGCCGAGC TGTTCGCGCG GCGGCGGGGG CAGCGGGCGG CGGAGCGCGA CCGGCACCGG
CCGGACGGGC CCGCGCCGGT GCCGCCGGCC GTCGATCATC TGGCCGTCGA CGCCGACCTC
ACCACGGCAC AGCAGGTTTT CCGGGTCCGG CGGGCGCTGG GCGACCGGAT TCCCGTCGCA
CCGGACGCGC CGATCCTCGG CCTCGACGTC GACGCGCTGC TGTTCGACCT GGATGGCACG
CTGGCCGACT CGACCGCATC GGTGGCTCGC TGCTGGGACC GGTTGGCCCG CGAGTTCGGG
GCGGCGCCGG CTCTGGTGCA GGCCAATCAC GGACAGCCGG CCGACGTCCT GGTCGGCAAG
CTGGTCGGAC CCGATCCGCA AGCCGCCGGA CGGGCCCGGA TCCGGCAGCT GGAGATCGAG
GACGCGCCCT CGATCGACCG GATACCCGGT GCCGCCGAAC TCTTCTCGTC CGTCCCGGAG
TCGCGCCGGG CCATCGTCAC CTCCGGCGTC CGCGAGCTGG CGGCCGCGCG CCTGCGCGCC
GCCGGGCTGC CGATCCCGGC CACCATGGTC ACCTTCGACG ACGTGATGCG CGGAAAGCCC
GATCCGGAGC CCTATCTGCT GGCCGCGTCG CGGCTGGGGG TGGACCCGGC GCGCTGCCTG
GTGTTCGAAG ACGCGCCGGC CGGCATCGCG TCGGCCCGGG CGGCCGGCTG CCGGGTGGTG
GCGGTCCTGG GCACGGCCCC CGCGGACGAG TTGGTCGGGG CCGAGCTGAT CGTGGACGCA
CTGGACCGGC TGACCGTGGT GCCGCACGGA TCGGCGCTGC GGTTGGCCGC CACCGGCTGA
 
Protein sequence
MALAQELGLP VLDLDTLTNP LLDALPVSLL AEHWLSPAHG TAVRDGRYAA LRAVAAQVVA 
TAGGAVLVAP FTAELAGGDA WVRLRTALAP TDPLVVWIRG DAELFARRRG QRAAERDRHR
PDGPAPVPPA VDHLAVDADL TTAQQVFRVR RALGDRIPVA PDAPILGLDV DALLFDLDGT
LADSTASVAR CWDRLAREFG AAPALVQANH GQPADVLVGK LVGPDPQAAG RARIRQLEIE
DAPSIDRIPG AAELFSSVPE SRRAIVTSGV RELAAARLRA AGLPIPATMV TFDDVMRGKP
DPEPYLLAAS RLGVDPARCL VFEDAPAGIA SARAAGCRVV AVLGTAPADE LVGAELIVDA
LDRLTVVPHG SALRLAATG