Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0809 |
Symbol | |
ID | 8446401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 893851 |
End bp | 895728 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645039946 |
Product | peptidase M28 |
Protein accession | YP_003200209 |
Protein GI | 258651053 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCCA TGGAACTGCC GGCCCGGGCG GACGCCTCCG ACCTGGATCA GGCCCTGCCG CGGCAGCGCA CGGGCGGGCG GGCGCGGCCG GCCCTGCGGC TGATCCTGTT CGATCTGGGC GACACCCTGG AAAGCGGCGA GCAGTTGCGA CCCGGCGCCC TGACAACCCT GCGGGCGATC GAGAAGCTGG GCGACGTCGC GCACGTGGCC CTGCTTTCCG ATGTCGAGCA GCCGGCCTCG GCGCGGGACG AATCGCGGAT CCGCCACGAA TACGAGGAGC TGCTGGGCCG GCTGGGCATC CGGGCGTTCT TCGAGCCACT GGCCCGCTGG ATCACCCTGT CCAGCGAGGT CGGCGTGCGC AAGCCGGCGC CGGCCACCTT TCGGCGGGCG ATGCGCAAGG CGGGCGCCGA CCTCGGCTTC GGCGACGTCA TGTTCATCAC CGAGAACGAG GGTCATGTCC GGCGGGCCCG GGAGCTCGGC ATGCGGGCCG TCCAGGTCCC CGGGCCCGGC GGTCCCGCGG CCGGGGCGGA CATCACCGGG CTGGAGGAGT TGATCGGTGT CGTCGAGCAG TTCGTCAGTG GCCGGGAGGG CGAGGGCCCC GCGGCGGCCA CCGGTGAACA GGACGGGGCG ACGATCCACC TTGCGGTGCT CGCCCCGGCC GGCGCTTCGG ACGAGGCCGG TGGCTCGATC GGGCACCGGG TGCGCCTGGG GGCGCTGTCG GTCCAGATCG ATACCGAACC GATCGGCACC GACTTGATCG ACACCGAACT CACTGGTCCC CAGGCTTCCT CCGACGCGCG CGGGGCGCCG CGGTCCCTGG ACCGGTTGCA CCTGGTGGTG CAGAACGGCC GCACCTTCCA GCAGGAGCAC CCGGACGTGC CGGTGCTGGC GGACCAGGGC CGCTACCTCG TCGTCGACCT CGACCCGGCA ATCGCGCGCG AGCTGGACGG CCCCGATCAG GTCTGTTTCA GCGTGCTGCC CCTGCCGCTG AACACGACTG TCTTCGCCCG GGCGGTGGCG GAGCCGGTGG CCGAGCAGCC GTGGATCCGG GAGCTGGTCG ACCGGGTGGC GGCGGGGCGA TTCCGGATGG ATCTGGACAA GCTGGTCGCC TTCGGCAGCC GGTATTCCAC GAGCAGCGCC TACCGGGCCG CGGCCACCGC CACCCGCGAC GAACTGGCCG CCCAGGGCTA CGCGGCCGTC CTGGTCCCGA TCTCGGTGCA GGGTCGGCAG TCGTGGAACG TCGTCGCCGA CCACCCGGGC AGCGGGCCGC AGCCCCGCCC GGTGGTGCTG GTGACGGCCC ACCTGGACTC GATCAACCTG GCCGGCGGCC CGCAGGCCAT GGCGCCCGGC GCCGACGACA ACGCGTCCGG ATGTGCCGGC CTGCTCACCT TCGCCCGGGT GTTCGGCACC CACCCGGGGG CGGCCGACCT GCGGTTGATC CTGTTCGGCG GGGAGGAGCA GGGCTTGTTC GGCAGCCGCC AGTACGTGGC CGGGCTCGAT CCGGCCGAGC GTGCCCGGAT CGCGGCCGTG GTCAACATGG ACATGATCGG CACGCTGACC ACGCAGCGGC CGACCGTGCT GATCGAGGGC GCCGCGGTGT CCCGGCCGGT GATGGACGGC TTGAGCGCGG CCGCCGCGAC CTACACCAGC CTGATCGTGC AGACCAGCCT GCACCCGTAC AACAGCGACC ACGTGCCGTT CCTGGACGCC GCGATCCCGG CCGTGCTGAC GATCGAGGGG GCGGACGGCG CCAACGACCG GGTGCACACC GACCAGGACC TGGCCCGGTT CGTCGACGAC GAGCTGGCCG TGCAGATCCT GCGGATGAAC GTGGCCTTCG TCGCCGAGCA GCTGGGCCGG GCCGGTGACC CCGGCTAA
|
Protein sequence | MAAMELPARA DASDLDQALP RQRTGGRARP ALRLILFDLG DTLESGEQLR PGALTTLRAI EKLGDVAHVA LLSDVEQPAS ARDESRIRHE YEELLGRLGI RAFFEPLARW ITLSSEVGVR KPAPATFRRA MRKAGADLGF GDVMFITENE GHVRRARELG MRAVQVPGPG GPAAGADITG LEELIGVVEQ FVSGREGEGP AAATGEQDGA TIHLAVLAPA GASDEAGGSI GHRVRLGALS VQIDTEPIGT DLIDTELTGP QASSDARGAP RSLDRLHLVV QNGRTFQQEH PDVPVLADQG RYLVVDLDPA IARELDGPDQ VCFSVLPLPL NTTVFARAVA EPVAEQPWIR ELVDRVAAGR FRMDLDKLVA FGSRYSTSSA YRAAATATRD ELAAQGYAAV LVPISVQGRQ SWNVVADHPG SGPQPRPVVL VTAHLDSINL AGGPQAMAPG ADDNASGCAG LLTFARVFGT HPGAADLRLI LFGGEEQGLF GSRQYVAGLD PAERARIAAV VNMDMIGTLT TQRPTVLIEG AAVSRPVMDG LSAAAATYTS LIVQTSLHPY NSDHVPFLDA AIPAVLTIEG ADGANDRVHT DQDLARFVDD ELAVQILRMN VAFVAEQLGR AGDPG
|
| |