Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4102 |
Symbol | |
ID | 8449725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4529061 |
End bp | 4530212 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043148 |
Product | Aminocarboxymuconate-semialdehyde decarboxylase |
Protein accession | YP_003203380 |
Protein GI | 258654224 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.13287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0312348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGGTG CCGCCGGGCG GTGGCACAGC GACGCCGGCG CGGAGCCGGC CGTCCAACCG AACACGTGGG AGGTCGCGTT GACGAGCTCG AATCCGGCGG ACACGTCCGT CACTGCAGGA TCGACGGCGG CCGGATCGAC GGCGGCCGCA CCGACCGGGC CGCGGGTGGT CGACGTCCAT GCGCATGCGA TGCCGATGCC GCTGCTGCGC AGCCTGGCCG ACCGCGGCCT GGCCGACGTC TCCGCGGTCG ATCAGGGCAT CGTCCGGCTC GACCCGAAGG TCAGCGGCGT GGGTCCGGGA GCTCCGCTGC CGCTGGCCCG CTCCCAGCAC GACGTGGCCA CCCGGCTGGT CGAGATGGAC GCGGCCGGCG TCGACGTGCA CGCGGTGTCG TTGCCGCCGT TCCTGTTCGC CACCAACGCC GACGACGCGG GCTTCGCCAC CGGCATCGTG GCCCAGGGCA ACGACGAGCT GGCCGGCTAC GTCGCCGGCG CCCCCGATCG GTTGGTCGGC CTGGGCTACG TGCCGCTGGG CTGGCCCGGG GTGGCCGACG AGGCGGTACG CGTGCTCGAC GAGCTGGGCC TGGCCGGCAT CGCGATCGGC AGCCAGGGCG GCGGCAAGGA TCTGGACGAT CCGGTGAACG AGGATCTGTG GGCGTTGCTG GCCGAGCGGA ACACCTTCGT GTTCCTGCAC CCGTCGGGCA TGCCGGCCGG TCCGCGACTC AAGGACTACT GGATGCCGCA GCTGGTCGGG TATCCGATGG AGACGGCGAT CGCGGTGGCC CGGCTGGTGT TCAGCGGCAC CCTGGAGCGG TACCCGATCA CCCTGTGCCT GGCCCACGGC GGCGGCTGCG TGCCCTCGCT GCGCGGGCGG ATGGACATGG GCTGGGAGCG CAAGGACGTC GCCCACACCA ACGACCACCC GCCGACCCAC TACACCGATC GGCTCTACTA CGACACGGCG GTGTTCAACA CGACCGTGCT GAGCCGGATC GTGCAGGACG TGGGCGTCGA GCACGTGCTG ATGGGTACCG ACCACCCGTT CGAGCTGGGC GATCCGACGC CGCGAAAGAC CGTGGGCGAC CTGGGGCTGA GCGAGGCGGA CACCGCGGCC ATCCTGGGCG GCACGGCCAG CCGGTTGCTC GGGTTGGCCT GA
|
Protein sequence | MIGAAGRWHS DAGAEPAVQP NTWEVALTSS NPADTSVTAG STAAGSTAAA PTGPRVVDVH AHAMPMPLLR SLADRGLADV SAVDQGIVRL DPKVSGVGPG APLPLARSQH DVATRLVEMD AAGVDVHAVS LPPFLFATNA DDAGFATGIV AQGNDELAGY VAGAPDRLVG LGYVPLGWPG VADEAVRVLD ELGLAGIAIG SQGGGKDLDD PVNEDLWALL AERNTFVFLH PSGMPAGPRL KDYWMPQLVG YPMETAIAVA RLVFSGTLER YPITLCLAHG GGCVPSLRGR MDMGWERKDV AHTNDHPPTH YTDRLYYDTA VFNTTVLSRI VQDVGVEHVL MGTDHPFELG DPTPRKTVGD LGLSEADTAA ILGGTASRLL GLA
|
| |