Gene Namu_5026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5026 
Symbol 
ID8450657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5605607 
End bp5606887 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content73% 
IMG OID645044063 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003204287 
Protein GI258655131 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGT CGTTGGGGTC GATGATGCTC GCGGCATTGG ACGAGGCCGG CCCCGGAGCC 
GACCCGGAGT TGCCGGTCGT GCTGCAGATC GACGCACTCT CGCCGGGGGC CGACGAGTCG
TGGGGGCACT TCAAGGATCG GGTCGGAGAC CGGCTCGGCC GGCAGACCGA CCTGCTCCGG
GACCGGATCG GCGTGGGCGA CGTGCGCGAG CTGTACGCCG GCAACGCACT GGCCGCCTCG
CTGACCACCG AGCAACTGGC CGCCGTCATC GACGATCCGC AGATCGCCAT CCTGTTCGCC
GATCTCGATC CGGTGCTGCC GGTGGTGCTG ATGCACGAGG TGCACGAGCT GGTCGGCGCC
CCCGCCTTCC GCCGTGCGGG CGGTGGGCTG ACCGGCGCCG GGGTGAGCGT CGCGGTGCTC
GACACCGGCA TCGACCGGCG GCATCCGGCG CTGACCGTCG CGCACAGCAT CCAGACCTGC
GACGAATCCG TCGACATCCC CGGCCACCAC GGCACGCACT GCGCCGGCAT CATCGCCTCG
ACCGACCCGC GGGCGCCCGG CATCGCCCCC GGTGTGGACC TGATCGACGT GAAGGTGCTG
CGGGCCAACG GAACCGGCCG GCACACCGAC ATCACCGCCG GTGTGGACCG GGCCCTGGAC
CGCGCGGCCG ACATCCTGTC CATCAGCCTG GGGTTCAACC ACCTCCCGAT CAGCGTGCCC
GGCGGTCACG GCTGGACCTG CGTGGACGGC GCCTGCCCGC TGTGCACGGC GGTGGACAAC
GCGGTGCTGG AAGGCGCGCT GGTGGTGGTG GCCGCCGGCA ACGAGCATCA GCGCTGCGAA
GGGGTGCGCT CGGCCGGGCA GGGACTGGTC TACGACACCG AGCTGAGCTG CCCCGGCCAG
GCCCGCGGCG CCCTCACGGT CGGCGCCACG CACAAGGCCA CGCATGCGCC GGCCCGCTTC
TCCAGCAACG GGCCGACCGC CTACGACTCG GGCAAGCCGG ATCTGGTCGC CCCCGGGGTG
GACGTCCGGT CCACCGTGCC GCTCCCGCCG GCCAGTCCCG GCGGATCGGC GGTGGCCCCG
CCGCCGTTCG GGATGAAGAG CGGCACCTCG GTCGCCGCCC CTGTAGTCGC CGGGGCCTGC
GCCCTGCTCA TCGAGTCGGC CCGCCGGACC GGCGCCCCCG ACGACCCGGC CGCCATTCGT
CGCATCCTGC TGGACACGTG TGTCGAGCGG ATCGGTGGTC CGGCCAACGT TGTCGGGGCC
GGGCGGCTGC GGCTGCCGTG A
 
Protein sequence
MIESLGSMML AALDEAGPGA DPELPVVLQI DALSPGADES WGHFKDRVGD RLGRQTDLLR 
DRIGVGDVRE LYAGNALAAS LTTEQLAAVI DDPQIAILFA DLDPVLPVVL MHEVHELVGA
PAFRRAGGGL TGAGVSVAVL DTGIDRRHPA LTVAHSIQTC DESVDIPGHH GTHCAGIIAS
TDPRAPGIAP GVDLIDVKVL RANGTGRHTD ITAGVDRALD RAADILSISL GFNHLPISVP
GGHGWTCVDG ACPLCTAVDN AVLEGALVVV AAGNEHQRCE GVRSAGQGLV YDTELSCPGQ
ARGALTVGAT HKATHAPARF SSNGPTAYDS GKPDLVAPGV DVRSTVPLPP ASPGGSAVAP
PPFGMKSGTS VAAPVVAGAC ALLIESARRT GAPDDPAAIR RILLDTCVER IGGPANVVGA
GRLRLP