Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1914 |
Symbol | |
ID | 8447521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2108638 |
End bp | 2109423 |
Gene Length | 786 bp |
Protein Length | 261 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645041044 |
Product | HAD-superfamily hydrolase, subfamily IIA |
Protein accession | YP_003201292 |
Protein GI | 258652136 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01457] HAD-superfamily subfamily IIA hydrolase, TIGR01457 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.00534341 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.000838349 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACACGCC AGCTCGGGGT CATCTCCGAC ATGGACGGGG TGATCTACCG AGGCAAGCAG GCCGTGCCCG GCGCCCAGGC GTTCATCGAC CGGCTGCGGG AGCGCGGCGT TGGCTTCGTG TTCCTGACCA ACAACTCCGA GCAGACCCCG CTGGATCTGG TCCGCAAGCT CGCCGGGCTG GGCTTTCAGG GCCTGACCGA GCAGAACTTC ATCACCTCGG CGATGGCCAC CGCGAAGTTC CTGCATTCGC AGCGACCCCG CGGCACCGCC TACGTCATCG GCGGCGGCGC GCTGTCGGCC GAGCTGTACA AGGTGGGCTA CTCGATCACC GACAGCAATC CCGATTACGT CGTGGTCGGC AAGACCTCCG GGTTCGCGTT CCCCCAGCTG CGCAAGGCGT CCGCGCTCAT CGACAAGGGG GCCCGGTTCA TCGGCACCAA CCCGGATCTG GTCGATCCGG TGGAGGGCGG CACCGAGCCG GCCGCCGGGG TGCTGCTGGC CTCGATCGAG GCGGCCACCG GGATGAAGCC ATACGTGGTG GGCAAGCCGA ACTCGCTGAT GATGATCTAC GCCCAGGAGA TGCTCGGCGT GCCGGCCCGG GACTGCGTGA TGATCGGCGA CCGGATGGAC ACCGACGTGG TCGGCGGCCT GGAGGCCGGC ATGCGCACCT GCCTGGTGCT GTCCGGGGTG TCCGACGCGC AGACGGTGAA CCGGTTCCCC TACCGGCCCA GCTTCGTCTA CGACAGCGTC GCCGACATCG ATCCGGACGA GCTGGCGGCG GGCTGA
|
Protein sequence | MTRQLGVISD MDGVIYRGKQ AVPGAQAFID RLRERGVGFV FLTNNSEQTP LDLVRKLAGL GFQGLTEQNF ITSAMATAKF LHSQRPRGTA YVIGGGALSA ELYKVGYSIT DSNPDYVVVG KTSGFAFPQL RKASALIDKG ARFIGTNPDL VDPVEGGTEP AAGVLLASIE AATGMKPYVV GKPNSLMMIY AQEMLGVPAR DCVMIGDRMD TDVVGGLEAG MRTCLVLSGV SDAQTVNRFP YRPSFVYDSV ADIDPDELAA G
|
| |