Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3972 |
Symbol | |
ID | 8449591 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4384410 |
End bp | 4385549 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645043017 |
Product | HAD-superfamily hydrolase, subfamily IA, variant 3 |
Protein accession | YP_003203253 |
Protein GI | 258654097 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00807242 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0158039 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTCG CCCAGGAATT GGGCCTGCCC GTGCTCGACC TGGACACCCT GACCAACCCG CTTCTCGACG CGTTGCCGGT CTCGCTGCTC GCCGAACACT GGCTCTCGCC GGCCCACGGC ACCGCCGTCC GCGACGGCCG GTACGCCGCG CTGCGGGCGG TGGCCGCGCA GGTGGTGGCC ACCGCCGGCG GGGCAGTGCT GGTCGCCCCG TTCACCGCCG AATTGGCCGG CGGTGACGCG TGGGTCCGGC TCCGAACCGC GCTGGCGCCG ACCGACCCGC TGGTGGTGTG GATCCGCGGG GATGCCGAGC TGTTCGCGCG GCGGCGGGGG CAGCGGGCGG CGGAGCGCGA CCGGCACCGG CCGGACGGGC CCGCGCCGGT GCCGCCGGCC GTCGATCATC TGGCCGTCGA CGCCGACCTC ACCACGGCAC AGCAGGTTTT CCGGGTCCGG CGGGCGCTGG GCGACCGGAT TCCCGTCGCA CCGGACGCGC CGATCCTCGG CCTCGACGTC GACGCGCTGC TGTTCGACCT GGATGGCACG CTGGCCGACT CGACCGCATC GGTGGCTCGC TGCTGGGACC GGTTGGCCCG CGAGTTCGGG GCGGCGCCGG CTCTGGTGCA GGCCAATCAC GGACAGCCGG CCGACGTCCT GGTCGGCAAG CTGGTCGGAC CCGATCCGCA AGCCGCCGGA CGGGCCCGGA TCCGGCAGCT GGAGATCGAG GACGCGCCCT CGATCGACCG GATACCCGGT GCCGCCGAAC TCTTCTCGTC CGTCCCGGAG TCGCGCCGGG CCATCGTCAC CTCCGGCGTC CGCGAGCTGG CGGCCGCGCG CCTGCGCGCC GCCGGGCTGC CGATCCCGGC CACCATGGTC ACCTTCGACG ACGTGATGCG CGGAAAGCCC GATCCGGAGC CCTATCTGCT GGCCGCGTCG CGGCTGGGGG TGGACCCGGC GCGCTGCCTG GTGTTCGAAG ACGCGCCGGC CGGCATCGCG TCGGCCCGGG CGGCCGGCTG CCGGGTGGTG GCGGTCCTGG GCACGGCCCC CGCGGACGAG TTGGTCGGGG CCGAGCTGAT CGTGGACGCA CTGGACCGGC TGACCGTGGT GCCGCACGGA TCGGCGCTGC GGTTGGCCGC CACCGGCTGA
|
Protein sequence | MALAQELGLP VLDLDTLTNP LLDALPVSLL AEHWLSPAHG TAVRDGRYAA LRAVAAQVVA TAGGAVLVAP FTAELAGGDA WVRLRTALAP TDPLVVWIRG DAELFARRRG QRAAERDRHR PDGPAPVPPA VDHLAVDADL TTAQQVFRVR RALGDRIPVA PDAPILGLDV DALLFDLDGT LADSTASVAR CWDRLAREFG AAPALVQANH GQPADVLVGK LVGPDPQAAG RARIRQLEIE DAPSIDRIPG AAELFSSVPE SRRAIVTSGV RELAAARLRA AGLPIPATMV TFDDVMRGKP DPEPYLLAAS RLGVDPARCL VFEDAPAGIA SARAAGCRVV AVLGTAPADE LVGAELIVDA LDRLTVVPHG SALRLAATG
|
| |