Gene Namu_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0116 
Symbol 
ID8445696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp127990 
End bp129945 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content72% 
IMG OID645039264 
ProductNeprilysin 
Protein accessionYP_003199539 
Protein GI258650383 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT CCCCCGCGGC CGATGTCATC GTCCCGCGAC CGCAGGACGA CCTGTTCCGG 
CACGTCAACG GGCCCTGGTT GGCGACCGCC GAGATCCCGG CCGACCGGTC CGCCGACGGC
GCCTTCTACC AGCTGCGCGA CGAGGCCGAA AAGGACAGTC GGGCGATCAT CGAGGACGCC
GCCGCCGCGG CCGACGGCGC CGAGCCGGGC AGCCCGGTGC AGCTGATCGG CGACCTGTAC
CGCAGCTTCA TGGACGTCGA GGCGGTGGAG CGGCAGGGCC TGGCGCCGAT CGCCGCGCGG
CTGACCGAGG TCGAGGGTGT GGACAGTCCG GCCGCGCTGA TGCGGACCCT GGGCCGGCTG
CGCCGGTCCG GCGTCGGCGG CGCGTTCGCC ATCGACGTGG ACACCGACCC GGGCGACCCC
GACCGGTACG TGCTCAACCT CTACCAGGGC GGCATCGGCC TGCCGGACGA GTCCTACTAC
TCCGACGCGG CGCACGCCGA CGTGCTGAGC GCCTACGCCG CGTTCCTGCC GAGCATCCTG
GAACTGGCCG GGATCCCGGA GAGCGCTGGC GCCGGTGCCG CGGTGGTCGA GCTGGAGACC
GCGGTGGCCG CCGGGCACTG GGACCGGGTC CGCTCGCGCG ACAGCTCGCA GACCTACAAC
CCCAAGGATC GGGCCGGCCT CGACGCGCTG CTGCCCGGCC CCCTCTGGGA CGCCTGGTTG
GACGGGCTGG GCGCCGACCC GTCGGTGCTC GATCAGGTCG TCGTCCGCCA GCCCGACTAC
TTCACCGCGC TGGCCGCGCT GCTCACCCCG GACCACCTGC CGGCCTGGCG GGCCTGGTTG
AGCTGGCAGA TCGTGCGGTC CCTGGCCCCG CTCGGGCCGG CCGAGCTGGT GGAGAAGAAC
TTCGACTTCT ACGGCCGCAC CCTCTCGGGC ACCCCCGAGC TGCGCGAGCG GTGGAAGCGC
GGCGTCGGCT TCGTCGAGAT GGCGGCCAAC GAGGCGGTCG GCCGGCTGTA CGTCGAGCGG
CACTTCCCGC CGGAGTCCAA GCGCCGGATG GACGAGCTGG TGGCCAACCT GCTGGCCGCC
TACCGCACGG AGATCGGCAA GCTGCCTTGG ATGGGGGAGC AGACCCGGGC CCGGGCGCTG
GAGAAGCTGG ACGCCTTCAC TCCCAAGATC GGCTATCCGG CCCGCTGGCG GGACTACACC
GCGCTGACCG TGGCCGCGGA TGACCTGATC GGCAACGCGG CTCGGGCGGC CGCGTTCGAG
CTCGACCGTG AGCTGGGCAA GCTGGGCGGC CCGGTGGACC GCGACGAGTG GTTCATGTCG
CCGCAGACCG TCAACGCCTA CTACAACCCG GGCATGAACG AGATCGTCTT CCCGGCCGCG
ATCCTGCAGC CGCCGTTCTT CGACCCCGAG GCCGACGACG CGGTGAACTA CGGCGGCATC
GGCGCGGTGA TCGGTCACGA GATCGGGCAC GGCTTCGACG ACCAGGGCTC CAAGTACGAC
GGGCGGGGCG CCCTGCAGGA CTGGTGGACT CCGGCCGACC GGGCCGCCTT CGAGCAGCTC
ACCGGCCGGC TGATCGACCA GTACTCGGCG CTGGAGCCGA AGAACACGCC CGGTCATCAC
GTCAACGGGG CGCTGACCAT CGGCGAGAAC ATCGGCGACG TGGGCGGGCT GGGCATCGCC
TACCAGGCGT GGCGGATCTC GCTGGGTGAC CAGTCGGCGC CGGTAATCGA CGGGCTGACC
GGGGCCCAGC GGTTCTTCCG CTCGTGGGCG ACCGTGTGGC GGCTCAAGAT GCGCGAGGCC
GAGCAGGTCC GGATGCTCTC GATCGACCCG CACTCACCGG CGGAATTCCG GTGCAACCAG
GTGGTCCGCA ACATCGCCGA GTTCCACGAG GCCTTCGACA CCCGGCCCAC CGACGGGCTG
TGGCTCGACG AGCAGGACCG GGTGCGGATC TGGTAG
 
Protein sequence
MTASPAADVI VPRPQDDLFR HVNGPWLATA EIPADRSADG AFYQLRDEAE KDSRAIIEDA 
AAAADGAEPG SPVQLIGDLY RSFMDVEAVE RQGLAPIAAR LTEVEGVDSP AALMRTLGRL
RRSGVGGAFA IDVDTDPGDP DRYVLNLYQG GIGLPDESYY SDAAHADVLS AYAAFLPSIL
ELAGIPESAG AGAAVVELET AVAAGHWDRV RSRDSSQTYN PKDRAGLDAL LPGPLWDAWL
DGLGADPSVL DQVVVRQPDY FTALAALLTP DHLPAWRAWL SWQIVRSLAP LGPAELVEKN
FDFYGRTLSG TPELRERWKR GVGFVEMAAN EAVGRLYVER HFPPESKRRM DELVANLLAA
YRTEIGKLPW MGEQTRARAL EKLDAFTPKI GYPARWRDYT ALTVAADDLI GNAARAAAFE
LDRELGKLGG PVDRDEWFMS PQTVNAYYNP GMNEIVFPAA ILQPPFFDPE ADDAVNYGGI
GAVIGHEIGH GFDDQGSKYD GRGALQDWWT PADRAAFEQL TGRLIDQYSA LEPKNTPGHH
VNGALTIGEN IGDVGGLGIA YQAWRISLGD QSAPVIDGLT GAQRFFRSWA TVWRLKMREA
EQVRMLSIDP HSPAEFRCNQ VVRNIAEFHE AFDTRPTDGL WLDEQDRVRI W