Gene Namu_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3765 
Symbol 
ID8449384 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4131113 
End bp4132117 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content76% 
IMG OID645042816 
Productpeptidase M48 Ste24p 
Protein accessionYP_003203052 
Protein GI258653896 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.511777 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.498171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGG ACCTGGCCCC GGCCGCGGGG CCGACGCCGG TCGCCGCGAT CGGCTGCCTG 
CTGGTCCTGA TCGGGGTGAC GCTGGCGGCG CCGGTCAGCC TGGCGCTGGC CCACGCCCGC
TGGCCCCGGC GCGCGCCGGC CTGCAGCCTG CTGCTGTGGC AGGGGGTCTG CCTGGGCGCG
GGATTCTCGG TGGTGGCCGG GCTGGCCCTG CTGGCCGTCG AGCCGTCCGG GCACACCCTG
GTCGGCGCCG GGCTGAGTTG GTGGGGGTCG CTGTTCGCGG GGGAGCTGGC GCCCTGGCGC
GCGGTCTGCG GGCTGCTCGG GCTGGCCGTC GCCGTGCTGT TGCTGACCAC CCTGATCCGC
ACGGCCTGGC GCACGGTGCG CCGGCGCCGC GGGCACCGAC AACTGCTGGA TCTGCTGACC
CGACCGATCC CGCCCCGGGT GCCGATCGGC GCCGGCTGCC AGGTCCGGTT GATGGATCAC
CGCAGCGCGG TCGCGTACAC CCTGCCCGGT CTGCACGCCC GGCTGGTCGT CTCCACCGGG
CTGGTCGACC TGCTCACCCC GGTCGAGCTG GCCGCGGTGG TCGAACACGA ACGGGCCCAC
CTCCGGGCCC GGCACGACAT CCTGATCCTG CCGTTCCAGT CCTGGGTGGC CGCAGTCGGC
CGGGTGCGTG GGGTGCGGGA GGCCGGGCGG TCGGTGACGG AGTTGACCGA GATGCTCGCC
GACGACGTCG CGCGGGCCCG GTCGGCCCCG GGCGCATTGG CCTCGGCGCT CACCCGGGTC
GCCCTGGAGG GACGGACCCG GGCCGGTACC GACGGCCCCG ACACCATCGG CGGCCAGGCC
GCCGGAAGTG ACGCGCATCG GGCCGGATCG GGGACGTGCG TGACCGACCG GGTGCGGCGG
ATGAACGATC CGCGGCCGTT GCCGCTGTTC GCCCGGGCCT TGATCGTGCT CACGGCGGCC
GGCCTGCTGG CGATCCCGGC CACCGTGCTG ACCCTGACCT GGTAG
 
Protein sequence
MDADLAPAAG PTPVAAIGCL LVLIGVTLAA PVSLALAHAR WPRRAPACSL LLWQGVCLGA 
GFSVVAGLAL LAVEPSGHTL VGAGLSWWGS LFAGELAPWR AVCGLLGLAV AVLLLTTLIR
TAWRTVRRRR GHRQLLDLLT RPIPPRVPIG AGCQVRLMDH RSAVAYTLPG LHARLVVSTG
LVDLLTPVEL AAVVEHERAH LRARHDILIL PFQSWVAAVG RVRGVREAGR SVTELTEMLA
DDVARARSAP GALASALTRV ALEGRTRAGT DGPDTIGGQA AGSDAHRAGS GTCVTDRVRR
MNDPRPLPLF ARALIVLTAA GLLAIPATVL TLTW