Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4076 |
Symbol | |
ID | 8449696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4491421 |
End bp | 4492521 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645043120 |
Product | Haloacid dehalogenase domain protein hydrolase |
Protein accession | YP_003203355 |
Protein GI | 258654199 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.125935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0189831 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC TGGCGCAGGC CCACGATGCA CTGCTGCTCG ACCTGGACGG CACCGTGTAC CTGGGCGGGC AGCCCATCGA CCACGTGGCT CCGGCGCTGG TCCGGGCCGG GGTGCTCGGG GCGCGGTCCG TCTTCGTCAC CAACAACGCC TCCCGCCCGC CGGCCGAGGT GGCGGCCGCG CTGACCTCGA TGGGCGTTGC CGCCGAGGCC GACGACGTGT TGACCTCCCC GCAGGCGGCC GCCGTCATGC TCGCCGACCG GCATCCGGCC GGCGCGAAGG TGCTGGTCAT CGGGGCGCCC TGGCTGGAGG AGTCGGTCCG GCAGGCCGGT CTGCAACCGG TCCGGCTGGC CGAGGACGAG CCGGTGGCCG TGGTGCAGGG CCATTCGCCC GACACCGGTT GGCGCAACCT GGCCGAGGGG TGCATCGCGT TGCGGGCCGG GGCCGATTGG GTCGCCTGCA ACGTGGACAG CACCCTGCCG ACGGACCGGG GCATGCTCCC CGGCAACGGC TCGATGGTGG CCGCCCTGGT CGCGGCCACC GGCCTGCACC CGCGGGTGGC CGGCAAGCCC GAACGCCCGC TGCTCGACGC CGCCGTCCGG CTCGTTGGCA GCACCCGCCC GCTGGTCGTC GGGGACCGGC TGGACACCGA CATCGCCTGC GCGGTGGGCG CGTCCACGCC GAGCCTGATG GTGCTCACCG GCGTCTCGAC CGCCTCCGAC CTGCTCGCCG CCGACCCCGG CCAGCGGCCG ACCTACGTGG CCTTCGACAT GCGCGGGTTG GTCGAGGAGG ACCGGGCGGT CCGCATTCCC GGCCCGGGAT CACCCCGGAG CAGCCGGGAG TGGACGGTCG GCACCGACTC CTCCGGTCTG GTCCTCAGCT CGGCCGCAGA CGCCGAACCC AGCTCGGAAC CGGGCCCCGC GGCCGTCTCC GACGGTGCCG CCCTGCGGGC CCTGGCGCTG CTCACGGCAG CCGCCTGGGC GTCCGGCCTC ACCGCCGTCC GGCCGCACGA CGCGCTCGCC GCGGCCGCAC TGACCCGGTG CGGCATCCCG ATTCGACCGA CCCAATCTGG CCAGCGGGCA TTCACCGACT CGCTCGACTG A
|
Protein sequence | MTILAQAHDA LLLDLDGTVY LGGQPIDHVA PALVRAGVLG ARSVFVTNNA SRPPAEVAAA LTSMGVAAEA DDVLTSPQAA AVMLADRHPA GAKVLVIGAP WLEESVRQAG LQPVRLAEDE PVAVVQGHSP DTGWRNLAEG CIALRAGADW VACNVDSTLP TDRGMLPGNG SMVAALVAAT GLHPRVAGKP ERPLLDAAVR LVGSTRPLVV GDRLDTDIAC AVGASTPSLM VLTGVSTASD LLAADPGQRP TYVAFDMRGL VEEDRAVRIP GPGSPRSSRE WTVGTDSSGL VLSSAADAEP SSEPGPAAVS DGAALRALAL LTAAAWASGL TAVRPHDALA AAALTRCGIP IRPTQSGQRA FTDSLD
|
| |