Gene Namu_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3504 
Symbol 
ID8449123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3847666 
End bp3848823 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID645042582 
Producthypothetical protein 
Protein accessionYP_003202818 
Protein GI258653662 
COG category 
COG ID 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00612057 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.208979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG CGCTGGTCAA CTCCACCCTG GTCCGCCCGG GACAACGGGA CTGGAGCCTG 
CGGGCGATCG CGGCCGAACA TGGCACCCCG GAGTACTCGC TGCGCGCGCT GCTGGCGCAG
CGCGACCGCA CCATCGTGAA GTCGTGGGCG GTCTGCCTGT TGACGTTCTC GCCGGTCACA
CCGCGCCTGA CGATGCCGAT GGACTGGTTC GACCGGATGT TCTTCGAACC GGAAGGGGTG
GCCGCGTTCT TCCGGACGAT GAGCGGCGGC CGGCTGCTGG TGGAGTGGCA GGTTTTCGGC
CCGCTGCCCC TGATGACCTT TCAGCAAAAG CAGCAATCGG CGAAGGCGGG GACCGAGGAC
GCCGACTACA CCAGGCTGGC CAAGGCCCAA GGGGTGCCGT TGGACCAGTT CGACCACGTG
ATGTGGATGC CCGACGACGG TGTCTCCACG GCCGGGACGG CGGCCGGGCA GAACAACCGG
TTTGTGGGTG CGCAGGACGT CGCCCCGCAG CTGGCCTGCC ATGAGATGAC CCACACGTTC
GGCGTCTGCT CGCACGCCGA CCGGTACACC CTCGACGACT ACGCCGACCC GTTCTGCATG
ATGGGTCGAC CGGGGGTTGC CCGCACCTGG GAGAGCCCCA CGCTCGCCTG GCCCGGTCGG
TTCCAGCACG GCATGGTCGG GCCCGGCCTG ATTGCCCCCT ACCTCTTCGT GGCCGGGTGG
TTGGACTACG GCCGCAACGT CACCCACTTC CAGGTGGCGG ACCTGGCCGA CGCGGTCGGC
CTGTCCTACC CGCTGTCGTG CAACGCCGGC GCCCCGCCGA TCGGGGACGG AAGGCGCATC
GCCATCACCG TCGGAGAACT CCCTCGACGG CCCATGGACA ACGCCCAGAT CTGGGTGGAG
TACCGCAGGC CGGAGGGCTT CGACCGTGGC ATTGCCGCAC CGCCGGGAGG AGCGGCGGAC
CTGCCCGCCT CCGGCGGGCT CGTAGTTCAC CGCGTCGGAT TCGGGTCGGC CCGGTGTCAA
AACGCCCTCC GCGCCGCGGT CACCTCGTGG CACCCGGCCG TCGTGGGTCA GACCATCCCG
TTGCCCGGGT ACGGCCAGAC CCTGCAAGTC ACGTCCGTCG ACGACGCGCG TCGTGAGGTC
ATGGTGACCG TGCGGTAG
 
Protein sequence
MDAALVNSTL VRPGQRDWSL RAIAAEHGTP EYSLRALLAQ RDRTIVKSWA VCLLTFSPVT 
PRLTMPMDWF DRMFFEPEGV AAFFRTMSGG RLLVEWQVFG PLPLMTFQQK QQSAKAGTED
ADYTRLAKAQ GVPLDQFDHV MWMPDDGVST AGTAAGQNNR FVGAQDVAPQ LACHEMTHTF
GVCSHADRYT LDDYADPFCM MGRPGVARTW ESPTLAWPGR FQHGMVGPGL IAPYLFVAGW
LDYGRNVTHF QVADLADAVG LSYPLSCNAG APPIGDGRRI AITVGELPRR PMDNAQIWVE
YRRPEGFDRG IAAPPGGAAD LPASGGLVVH RVGFGSARCQ NALRAAVTSW HPAVVGQTIP
LPGYGQTLQV TSVDDARREV MVTVR