Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3221 |
Symbol | |
ID | 8448835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3550059 |
End bp | 3551300 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645042300 |
Product | amidohydrolase |
Protein accession | YP_003202541 |
Protein GI | 258653385 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000000027233 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000156349 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGCCTC GGATGATCGC ACTTCGTTCG TCCTCGCTGT TCGACGGCTC GGTGTTCTTC GACGACGGAG TGACCGTCGT CGTCGACGGC GAGTCGATCG CCGGCGTGCT CCGCGGGCAT CCCGACCTCG GGTCGGACGT CGAGGTGATC GAGTTGGGCG AGGCCACCGT GCTGCCCGGT CTCATCGACA CCCATGTCCA TCTGGTCGCC GGCAGCGGGG TGCGCGCCCT GGATCTGGTG GAGGGCTACT CGGACCAGGA GATCGAGGCC GTGGTCACCC GGTCCCTGGC CGCGCACCTG GCGGCCGGGG TGACGACGGT CCGCGACCTG GGGGACCGGC GGTTCGTGGT GGTCAATCGC CGGGACGACC AGCATGCCCG CCCGCTGACC ACTGCCCGGC CGTGGACACC GACCATCCTG GCCGCCGGAC CACCGCTGAC CACGCCCCGG GGCCATTGTC ACTACCTGGG CGGCGAGGTG TCCGGTCCGG TGGAGATCGA GGCCGCGGTG CAGGAGCGGA TCGACCGCGA GGTGGACGTG GTCAAGGTGA TGGCCAGTGG CGGGATGGCC ACTACCGGCA CCGACGTGAT GATGCCGCAG TTCTCCCTGG CCGAGATGCG GCTGATCGTC GACCTGGCGC ATGCCGCCGG GATCGCCGTG ACCGCGCACG CGCATGCCCT GCCCGCGGTG GAGATCGCGC TCGCCGCCGG GGTCGACGGG CTCGAGCACT GCAGTTGCCT GACCCCGCAG GGGCCACGGG TGTCCGACGA ACTGCTCGCG GTGCTGGCCG AACGTCAGGT GCCGATCGGG GCGGCTCTGA TGGCCCCACC ACCGGAAGCG TTCGAGCACG CTCCGCCCAA TATCAAGAAG GTGATGGCCC AGATGGGCAT GACCCCGGAG ACGATGCTGG AAAGCCGGCG GTCGATGGTG GGCCGGATGC ACGCGGCCGG GGTCCGGTTC GTCGGCGGTT CGGACGCCGG GATCGAGCCG TTCATGGCCC ACGGCCTGAT GCGCTCGGGT CTGGGCTTTC TGCTCTCCGC CGGGGCGTCG GTCAGTCAGA CCCTGGCCGC CGGCACCTCG CTGGCCGCTG CCGCTTGCGG GCTGACCCGA AAAGGGTTCC TGCGTCAGGG TTTCGACGCC GATCTGGTCG TCGTCGACGG ACGCTTCGAT TCCGACCTGG CGCCGCTGGC GCAGGTGCGT CAGGTCATGC TCGGTGGTCG GTTCGCCCCG GCCGGGCTAT GA
|
Protein sequence | MTPRMIALRS SSLFDGSVFF DDGVTVVVDG ESIAGVLRGH PDLGSDVEVI ELGEATVLPG LIDTHVHLVA GSGVRALDLV EGYSDQEIEA VVTRSLAAHL AAGVTTVRDL GDRRFVVVNR RDDQHARPLT TARPWTPTIL AAGPPLTTPR GHCHYLGGEV SGPVEIEAAV QERIDREVDV VKVMASGGMA TTGTDVMMPQ FSLAEMRLIV DLAHAAGIAV TAHAHALPAV EIALAAGVDG LEHCSCLTPQ GPRVSDELLA VLAERQVPIG AALMAPPPEA FEHAPPNIKK VMAQMGMTPE TMLESRRSMV GRMHAAGVRF VGGSDAGIEP FMAHGLMRSG LGFLLSAGAS VSQTLAAGTS LAAAACGLTR KGFLRQGFDA DLVVVDGRFD SDLAPLAQVR QVMLGGRFAP AGL
|
| |