Gene Namu_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3898 
Symbol 
ID8449517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4298061 
End bp4299389 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID645042944 
ProductPeptidase M23 
Protein accessionYP_003203180 
Protein GI258654024 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0247447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0868297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCC TGCCGCGACG CCTGCTGCTC GCGCTGTCCG CCGCCGCGTC CGTCGCGCTG 
GTGGCCGGTT GCACCTCCGG GACCGAATCC GGTTCCGGCT CCAGTCACGA CGAGGGCGGC
TCGGCCGCCG CCACGGCCAC ATCGTTGACG ACATCTTCGG CGGCGCCGGT GGTCTCGCCG
GCGGAGGCCG AGCTGACCCC GGTGGTCGGC ACCGTGACCA CCGAACCGGC TCCGGTGATG
GGCAGCGACG GCAACGTGTA CCTGGCCTAC GAGCTGTCGA TCGTGAACGC CGCCGGGGCC
CCCGTGGTGA TCAAGGGCGT GCGCGTGCGC GACGCCGACA CCGGTGAGGT GGTCCAGGAA
CGCTCCGGCG CCGCACTGCT GTCGACGTTC AAGGCGACCG GCGCCGGGGC CGCCACGACG
GCGCCGACCG AGGCCACCCT CAACGGTGGG CAACACGGGT TCATCTGGCT CTCGCCCTCT
TTCGACCCGA CCCAGGCGGT GCCGCACGCC CTGGTGCACG AGCTCGAGCT CAGCTACGCC
AACCCGCCGA ACGCGTTGAT CGCCCCCTCG TCCACCGAGA CCATCGCGCC GACCCCGGTG
CGGTCCAAGC CGGCTCCGGT GATCGCCCCG CCGCTGCAGG GCGACAACTG GTTCGACGGC
AACGGATGCT GCGACGAGGT GACCCCGCAC CGCGGGGCGG CCAACCCGGT CGACGGGCAG
TTCTACTTCG CCGAGCGGTT CGGCGTCGAC TGGGTGCAGC TGGACGCGCA GGGCCGGTTG
CTCGTCGGCG ATCCGACCTC CCTGTCCAGT TACCCGTACT ACGGCGCCCC GATCACCGCG
GTGGCCGACG GCGAGATCGT GGCCGTGCAC GACGGCGAGG TGACGCAGAC GCCGGGTTCG
TCACCGGCGG TGGGATCGCT GCAGGTGACC CAGTACGGCG GCAACTACGT GGTGCAGCGC
TTCACCCAGG GCGGCGAGAT CTACTACGCC TTCTACGCGC ACCTGGAACC GGGCAGCATG
GACGCCTTGC AGGTCGGCCA GCAGGTCGCC ACCGGCGGTG CGATCGGCAA GCTGGGCAAC
ACCGGCAACA CCGACTCCCC GCACCTGCAC TTCCACGTGA TGGACGGCCC GGATCCGTTG
GCCAGCAACG GGTTGCCGTA CCGGTTCAGC TCGTTCCAGC TGGTCGGCCG GGCCACCGGC
GACGATGCGC TGCTGCCGCT GTTCACCGGC GGCGCGCTGA CCCTGGCGCC CGGCGCGGCG
TCCGGGCCGC GCACCGACGA TCTGCCGCTG TACCTTGATC TGGTCGACTT CCCGGCCCCG
ACCGGCTGA
 
Protein sequence
MSLLPRRLLL ALSAAASVAL VAGCTSGTES GSGSSHDEGG SAAATATSLT TSSAAPVVSP 
AEAELTPVVG TVTTEPAPVM GSDGNVYLAY ELSIVNAAGA PVVIKGVRVR DADTGEVVQE
RSGAALLSTF KATGAGAATT APTEATLNGG QHGFIWLSPS FDPTQAVPHA LVHELELSYA
NPPNALIAPS STETIAPTPV RSKPAPVIAP PLQGDNWFDG NGCCDEVTPH RGAANPVDGQ
FYFAERFGVD WVQLDAQGRL LVGDPTSLSS YPYYGAPITA VADGEIVAVH DGEVTQTPGS
SPAVGSLQVT QYGGNYVVQR FTQGGEIYYA FYAHLEPGSM DALQVGQQVA TGGAIGKLGN
TGNTDSPHLH FHVMDGPDPL ASNGLPYRFS SFQLVGRATG DDALLPLFTG GALTLAPGAA
SGPRTDDLPL YLDLVDFPAP TG