Gene Namu_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1371 
Symbol 
ID8446967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1515711 
End bp1516820 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content76% 
IMG OID645040503 
ProductPeptidoglycan-binding LysM 
Protein accessionYP_003200762 
Protein GI258651606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0982128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.681396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACG ATGTCAAACG CAAGGATTCG ACGGCAAAGC AATGGTCGGC ACGCCTGACC 
ATGCTGCTGT GGGTCGGCGG GCTGGCGGTC CTGGTGACGC AACTGCCCCC GCCGGGCCAG
CTCTGGGCCG CGGTTTCCGG ATCGGCCCCG GTGACCGCGC CTGCGCCACT GGCCGCCGTG
GCGTCGGTCC TGGTGACGAC GGCCGGACTG ATCGCCTGGC TCCTGGTCGG CTGGGCGGCG
GTGGTCCTTG CCGTGGGCCT GATCGCCCGG CTCCCCGGAC GCTCGGGCCG CCGGGCCCGG
CGACTGCTGC CGCGGATCGC ACCGTCCGCG GTCGGACGGC TGGTGGCGGC CGCGGTGGGC
GTGTCCCTGC TCGCCGGTAC CGCGGCATGT GCCGCGCCGG CCGGGGCGAG CGCATCGGCC
CGCCCGGCGC CCGCGGTGAC CGCCGGCGCC CCCGTCAACC CCGCCGTCAG CCCCGACGTG
ACGCCGGCGC CCGGCTTCAC CACCGCCGCG CCGACCAGCT CGATCGCCAT CGACTGGCCA
ACCCCGGACC CCGCCATCCC GGGCTCGACC ACCCCGAGCC CGGCCGCGCC AAGCCAGACA
GAACCAAGCC AGACCGCACC GACCTCGACC CCCGGCCCGG TGGCCCCGGA GCCCACCAGC
GCGGCCCCGG CGAGCCCCTC GCCCACCCCG CCCGCGACGA CATCGCCGTC CCCGGTTCCC
TTCTCGCCGA CCTCGGAGTC CACCGTGCCC GGTTCACCGC CCCCCACCGC CACCGCCGGC
ATCCCGGCCC AGACCCCGGC TCCCGCGGCA CCGCCACCCG CGTCGGCGGC ACCCCCACCC
GCCCCGGCCG ATACCGGAGC GCCGGCGAGC ACCGACACGG ACACCGACAC CGACACCGGC
GGCGCGGTCA CGGTGACGGT GCAGCCGGGA GATTCGCTGT GGCGGATCGC GGCCCGCACC
TTGGGCCCCG GCGCGAGCGA CGCCGATATC GACAATTCCT GGCGAGCTTG GTATTTCACC
AACCAGCAGG TGGTCGGCGA CGACCCCGAC CAGATAGTGC CGGGCCAGCA GCTGACGGCC
CCGACGGTGG CCGGTCAGGT GCGGTCATGA
 
Protein sequence
MANDVKRKDS TAKQWSARLT MLLWVGGLAV LVTQLPPPGQ LWAAVSGSAP VTAPAPLAAV 
ASVLVTTAGL IAWLLVGWAA VVLAVGLIAR LPGRSGRRAR RLLPRIAPSA VGRLVAAAVG
VSLLAGTAAC AAPAGASASA RPAPAVTAGA PVNPAVSPDV TPAPGFTTAA PTSSIAIDWP
TPDPAIPGST TPSPAAPSQT EPSQTAPTST PGPVAPEPTS AAPASPSPTP PATTSPSPVP
FSPTSESTVP GSPPPTATAG IPAQTPAPAA PPPASAAPPP APADTGAPAS TDTDTDTDTG
GAVTVTVQPG DSLWRIAART LGPGASDADI DNSWRAWYFT NQQVVGDDPD QIVPGQQLTA
PTVAGQVRS