Gene Namu_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2158 
Symbol 
ID8447769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2378419 
End bp2379942 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content74% 
IMG OID645041281 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003201525 
Protein GI258652369 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0489221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0113989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGCACT CCCTGACGGT GCCCGAGAAG GACAATCGGT CCGTGCCGAA CGGATCACCC 
ACCAACGGAT CGCCCAGCAA CGGCGTGCGC CGCCCGCCCG CCCCGGTCCC GTCGCCCGCT
CCCGACCCCG TGCCGGCCGG CGCCTACGCC CGGCCGGGTG GGGTGAACGG TTCCTTCGGT
CCCCGGCCCC AGCGCGGGCC GGTCGTCCTC GGTGCGCCGC CGGTGCCCGA ATACCTGGCC
GATGCCTTCG GACGCGCACC CGGCCAGAGC GGCATCGGGG CCCCACCACC CACCGCCGAA
CCACCGGCCG AGCCGCCGGC CCAGGACCCC TGGCGGGACC CGGCATCCGG CGCCCAGCTG
GGCGCCCCGG CCCTGACCCG CGAGTCGGCA CCCCCGCCGC CGGCCCCGCC GGCGCAGCGA
TTCACCCTGC GCCAGGCCCT GTTCGAGCGC CGGCTGCGCC CCTCGGCGAT CATCGGCATC
CTGGTCAGCG CCCTGGTCAT CGGCGCGCTC GGCGCCGGCA TCGGGGTCTT CGCCGCCGGC
CGGCTGCCCG CGCCGACCAC CGATCCCTCC TTCGAACTGG CTCCGGTGTC ACCGGGCATC
ACCCGGGAGC CGGGATCGGT GGCCGACATC GCGGCCAAGG TCCTGCCGTC CGTGGTGTCG
TTGGAGATCC GCACCGGTGA CGTGGGGGAG ACCGGCTCCG GCGTGGTCAT CGACGGCAGC
GGCTACATCC TGACCAACAA CCACGTCGTG TCCAGCGCGG CGACCGACCC GTCGGCGACG
CTCACCGTGA TCTTCGACGA CGCCGCGCAG AGCCGGGTGC CCGCGGTCAT CGTCGGTCGC
GACCCGCTGA CCGACCTGGC CGTGATCAAG GTCGACGTCA CCGACGCCAC GGTCGCGCAG
ATTGGCGACT CCAATGCGCT TGCCGTGGGC GATCCGGTCA TCGCCATCGG CTCGCCGCTC
GGCCTGGCCG GCACCGTCAC CACCGGCATC GTCTCGGCCA AGAACCGCCC GGTGCGGCTG
CAGGGCGGCG GGTCGGACAC CGACGCGGTG ATCGACGCCA TCCAGACCGA CGCCGCGGTC
AACCCGGGCA ACTCCGGTGG CCCGCTGGTC GACGCCTCCG GCGCCGTGGT CGGCATCAAC
TCGGCCATCC GCACCCTCGG CGGGGACTCC AGCGGCTCCA TCGGGCTGGG CTTTGCCATC
CCGATCGCCA CGGCCAAGGA CGTCGCCGAG CAGATCATCC GCTCGGGCAG CGTGCAGCAC
TCGACCATCG GGGTGAACGC CCGGTCGGCC ACCGACGGGA TCACCGACGG GGCCCAGGTG
CAGAACGTCC AGGGCGGCGG GCCGGCCGCG GCGGCCGGCA TCGCCGAGGG GGACGTGATC
ACCAAGGTGG GGGACCGGCA GGTGGGCAAC GCGGACGAGT TGATCGTCGC GGTCCGGCAG
AACCCGGTCG GGGCCACGGT CCCGGTGGTG CTGCTGCGCG ACGGGCGGGC GATGACCGTC
TCGGTCACGC TCGGCTCGGA GTAA
 
Protein sequence
MGHSLTVPEK DNRSVPNGSP TNGSPSNGVR RPPAPVPSPA PDPVPAGAYA RPGGVNGSFG 
PRPQRGPVVL GAPPVPEYLA DAFGRAPGQS GIGAPPPTAE PPAEPPAQDP WRDPASGAQL
GAPALTRESA PPPPAPPAQR FTLRQALFER RLRPSAIIGI LVSALVIGAL GAGIGVFAAG
RLPAPTTDPS FELAPVSPGI TREPGSVADI AAKVLPSVVS LEIRTGDVGE TGSGVVIDGS
GYILTNNHVV SSAATDPSAT LTVIFDDAAQ SRVPAVIVGR DPLTDLAVIK VDVTDATVAQ
IGDSNALAVG DPVIAIGSPL GLAGTVTTGI VSAKNRPVRL QGGGSDTDAV IDAIQTDAAV
NPGNSGGPLV DASGAVVGIN SAIRTLGGDS SGSIGLGFAI PIATAKDVAE QIIRSGSVQH
STIGVNARSA TDGITDGAQV QNVQGGGPAA AAGIAEGDVI TKVGDRQVGN ADELIVAVRQ
NPVGATVPVV LLRDGRAMTV SVTLGSE