Gene Namu_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3304 
Symbol 
ID8448919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3632207 
End bp3633526 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content71% 
IMG OID645042382 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003202622 
Protein GI258653466 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.1007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0511831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTCA GCATTCCGGC CCTGCCCGAC CCTGATCCCA GCACGGTCCC GCTGGGCGGC 
ACCCCGGTAC AGACCACCGG CCGCTTCATC GTTGTGTTCG CCCGAGGCAC CGACGCCAAG
GCGACCCTGG CCCGGACCGC CGGCGTCGGC ACCGTGGCCG ATTCGCGCGA CTTCACCGCC
CAGGCCGTCG ATTTCAGTCA GACCGAGGGG TCGGACGCGG TCTGGTTCGA CACCCTGGGC
GTCGCCGTCG TCACGGCCGA ACCGGCTCAA CTCGGGGCGC TGCGGACCGC CGAGGCCGGT
GAGGACGCGA TCATCTCGGT CTCGCCGGAG CTGATTCACC ACATCCTGGA CGGCGACTAC
CTGCAGGGTT ACCGGGACGG GGTCAGTGAC CTGGCCGGGC GGCTCGGCGC GGTCGAGCGG
CCGGCAGGCT CCGGAGTGTC CGCGGCCGCC GCCAATCCCT CGTTCGCCGA CAACGCCCAA
TTCACCTGGG GCCTGCAGGC GACCGGAGTG TCGACCTCAC CCCAGTCCGG CGCCGGCATC
AAGGTCGCCG TCCTGGACAC CGGGTTCGAC GTGGGCCATC CCGACTTCGT CGGCCGTTCG
GTGACCACCC AGTCCTTCGT TGCCGGGGAG ACCGTCCAGG ACGGGCACGG CCACGGCACC
CACTGCATCG GTACCTCCTG CGGGTCGAAG GCGCCGGAGA CCGGACCGAG GTACGGCGTC
GCCTACGGCG CGTCGATCTA CGCCGGAAAG GTGCTGGGCG ACAGCGGCTC CGGCTCCGAC
GGGGGCATCA TCGCCGGCAT CAACTGGGCC GTGGAGAACG GCTGCCACGT GATCTCCATG
TCCCTGGGGG CGGACGTGGC CTCGGTGCAC CCGCCCTACA CGGTGGTGGG CCAGCGCGCT
CTGGACGCCG GTTCGCTGAT CGTCGCTGCG GCCGGCAACA ACGCCGACCG CCGGGTCGGC
AACTTCGGCT TCGTGGGCAC CCCGGCCAAC AGCCCGTTCA TCATGGCCGT CGGCGCCCTG
GACCAGAAGC TGGACATGGC CTACTTCTCG GCCCGCACCC TGGCCGGCAC CCGCGGCGGG
CAGGTCGACA TCGCTGGGCC CGGCTACCAG GTCTACTCGT CGTGGCTGAT GCCGACCCGG
TACAAGACGA TCAGTGGCAC CAGCATGGCC ACTCCGCACG TGGCCGGCGT CGCCGCGCTC
TGGGCCGAGC TCACCGGCTA CCGCGGCCGC GATCTGTGGG CCACCCTGGC CCAGGACTCG
CAGCGCCTGC TGCAGCCGTC GGTGGACGTC GGCGGCGGAT TGGTCCTCGC CCCGCAATGA
 
Protein sequence
MEFSIPALPD PDPSTVPLGG TPVQTTGRFI VVFARGTDAK ATLARTAGVG TVADSRDFTA 
QAVDFSQTEG SDAVWFDTLG VAVVTAEPAQ LGALRTAEAG EDAIISVSPE LIHHILDGDY
LQGYRDGVSD LAGRLGAVER PAGSGVSAAA ANPSFADNAQ FTWGLQATGV STSPQSGAGI
KVAVLDTGFD VGHPDFVGRS VTTQSFVAGE TVQDGHGHGT HCIGTSCGSK APETGPRYGV
AYGASIYAGK VLGDSGSGSD GGIIAGINWA VENGCHVISM SLGADVASVH PPYTVVGQRA
LDAGSLIVAA AGNNADRRVG NFGFVGTPAN SPFIMAVGAL DQKLDMAYFS ARTLAGTRGG
QVDIAGPGYQ VYSSWLMPTR YKTISGTSMA TPHVAGVAAL WAELTGYRGR DLWATLAQDS
QRLLQPSVDV GGGLVLAPQ