Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5199 |
Symbol | |
ID | 8450830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5795525 |
End bp | 5796778 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645044230 |
Product | Formamidase |
Protein accession | YP_003204454 |
Protein GI | 258655298 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 62 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGG TGATCTTCCC GCTCGATTCG AGCAAGAAGT TCGAGGACCA GGAGAAGGTG GGCCACAACC GGTGGCACCC GGAGATCCCG CCGGTGGCTA CGGTCAAGCC GGGCGACAGT TTCCGGGTGC ACTGCCGGGA ATGGTTCGAC GGGGCCATCG TCAACGACGA CTCGGCCGAC GACATCCTCA ATGCCCCGCT CAAGACGGTG CACAAGCTCT CCGGCCCGTT CCGGGTGGAG GGAGCCAAGC CCGGTGACCT GCTCATCGTC GACATCCTCG ACGTCGGGCC GATTCCCGGA GAGGACTCGG GACCGCTGGC CGGTCAGGGT TGGGGTTACA CCGGCATCTT CGCCAAGCGC AACGGCGGCG GCTTCCTGAC CGACCAGTTC CCCGGCGCCT ACAAGGCCAT CTGGGACTTC CGCGGCCAGA TCGCGACATC CCGCCACGTG CCCGGGGTCT CGTTCGCCGG ACTGATCCAT CCCGGGTTGA TGGGCACCGC GCCCTCGGCC GAGCTGCTGG CCACCTGGAA TCGCCGCGAG CAGGCGCTGA TCGACACCGA CCCGAATCGG GTTCCGCCGC TGGCCCTTCC GCCGGAGCCG GAGTTCGCCG TGCTGGGCTC GCTGCCGGAG TCGGAATACG CCCGGGTGGC CGGGGAGGCC GCGCGGACCG CCCCGCCGCG GGAGAACGGC GGCAACCAGG ACATCAAGAA CCTGTCCAAG GGCACCCGGG TCTTCTACCC GGTGTTCGTG GACGGGGCCA ACCTCTCGCT CGGCGACCTG CACTTCTCCC AGGGCGACGG CGAGATCACC TTCTGCGGCG CCATCGAGAT GGGCGGCTTC ATCGACCTGC ATGTCGACCT GATCAAGGAC GGCATGAGCA CCTACGGGGT GTCGGAGAAC GCGATCTTCC TGCCCGGCAC CGTCGACCCG AGATTCAGCG AATGGATCGC CTTCTCGGGG ACCTCGGTCA CCCTGGACGG CGAACAGCGT TACCTGGACT CGCACCTGTC CTACCAGCGG GCCTGCCTGC ACGCCATCGA CTACCTGACC AAGTTCGGTT ACTCGCCCGA GCAGGCGTAC CTGATCCTGG GCGCCGCGCC GATCGAGGGC CGGCTCTCCG GGGTCGTGGA CATCCCGAAT GCCTGTGCCA CCGTGTACAT CCCGACCTCG ATCTTCGACT TCGACGTGCG ACCCAGCGCG TCCGGGCCGG CGCAGATCGA TCCGGGCCCG GGAGCGCCGC ACTCGGCCGG GTGA
|
Protein sequence | MPEVIFPLDS SKKFEDQEKV GHNRWHPEIP PVATVKPGDS FRVHCREWFD GAIVNDDSAD DILNAPLKTV HKLSGPFRVE GAKPGDLLIV DILDVGPIPG EDSGPLAGQG WGYTGIFAKR NGGGFLTDQF PGAYKAIWDF RGQIATSRHV PGVSFAGLIH PGLMGTAPSA ELLATWNRRE QALIDTDPNR VPPLALPPEP EFAVLGSLPE SEYARVAGEA ARTAPPRENG GNQDIKNLSK GTRVFYPVFV DGANLSLGDL HFSQGDGEIT FCGAIEMGGF IDLHVDLIKD GMSTYGVSEN AIFLPGTVDP RFSEWIAFSG TSVTLDGEQR YLDSHLSYQR ACLHAIDYLT KFGYSPEQAY LILGAAPIEG RLSGVVDIPN ACATVYIPTS IFDFDVRPSA SGPAQIDPGP GAPHSAG
|
| |