Gene Namu_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2239 
Symbol 
ID8447850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2466552 
End bp2469851 
Gene Length3300 bp 
Protein Length1099 aa 
Translation table11 
GC content71% 
IMG OID645041361 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003201605 
Protein GI258652449 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00131913 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.475839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACGG CATCGCTGCT GGCCAGCGCC CTGCTGCCGG CCACCGCCGG GGCCGAGCCG 
GCCGACGCCG GAAACCTCAC CGGCACCGCG ATCGAGGTCG CGGACCGGGT TTCCGTGCCC
AAGGCCGCCA CCAGCCGTTT GGCCGAAAGC GATCCCGCGG TGCTGGCCGC CACCGGCACG
GACACCGTGC CGGTCATGGT CAAGCTCGAC TACGACTCGG TGGCCACCTA CACCGGCGGC
GTCGCCGACC TGGCCCCGAC CAGCCCGGCC GAGACGGACA AGAGCCTGCA GGACAACGCC
GGCGCGGTGC AGGAGTACGA GCAGCACGTC GCCGGGGTCG AGGACTCGTT CGTCGCCGAT
CTGGCCGCCG CCGTCCCCGA CGCCGAGGTC GGAACCCGGC TGCGGACGGT GTACGGCGGT
GTCTCCGTCC GGCTGCCGGC CGACCAGGCC AAGAACCTGC TCGACATCCC CGGTGTCGTC
GCGGTCCAGG CCGACCACCT CAACCAGCCA CTGACCGACT CCAGCCCGGC GTTCATCGGC
GCGCCCACCA TCTACAACGC CTTGGGCTCC TCGACCACGG CCGGTTCCGG GGTGATCGTC
GGCGTGCTGG ACAGCGGCGC CTGGCCCGAG CATCCGTCCT TCGCCGATCC CGGCCTGCCG
GCGCCACCGC CGCTGGCCAA CGGCCAGGCT CGCGTGTGCG ACTTCGGGGA CAACCCGCTC
ACCCCGGCGA ACGACGTGTT CGTCTGCAAC AACAAGCTGA TCTCGGGCCA GCCGTTCCTG
CAGACCTACA ACGCCAACAA CTCCGGTGAG ATCTATCCGG ACTCGGCCCG GGACAGCAAC
GGGCACGGCA CCCACACCGC ATCGACCGCG GCCGGCTCAC CGGTGGCGGA TGCCCAGACC
CTCGGGGTCA GTCGCGGGCC GATCCAGGGC ATCGCCCCGG GCGCGCACGT CGCCGTCTAC
AAGGTCTGCG GCGCCCAGGG CTGCTACAGC TCGGACTCCG CCGACGCCGT GCAGCAGGCC
ATCAACGACG GCGTCGACGT CATCAACTTC TCCATCTCCG GGGGCACCAA CCCGTTCACC
GATCCGGTCG AGCTGGCCTT CCTGGACGCC TACGGGGCCG GCGTGTTCGT CTCCACCTCG
GCCGGCAATG ACGGCCCCGG CGCCGGCACG GCCAACCACC TGGCGCCCTG GGTGACCACG
GTGGCCGCCT CGACCCAGAC CCGGGCCTTC CAGTCCACGC TGACCGTCAC CGGCAGCGGG
GCCGCGCCGT TCGTCGCCAC CGGCGCCTCG ATCACTGCGG GGGCCGGACC GGCACCGGTG
GTGCTGGCTT CCAGTGCGCC CTACGGCAAC GCGTTGTGTT CGGCGCCGGC CCCGGCCGGC
CTGTTCGACG GCAAGATCGT GGCCTGCCAG CGCGGCGGCA ACGCCCGGGT CGACAAGGGC
TACAACGTCA AGCAGGGCGG CGCGGCGGGA ATGATCCTGT ACAACCCGTC CCTGGCCGAC
GTCGAAACCG ACAACCACTG GCTGCCGACG GTGCACCTGG CCGACGGGAC CGCCTTCCTG
GCCTACCTGG GCGCCCACCC GGACGCCACC GCCAGCTTCA CCGCCGGGGT CAAGGCCAAC
GGCCAGGGCG ACGTGATGGC GGCGTTCTCC TCCCGCGGAC CGGCCGGCTC GTCGATCAAG
CCGGACCTGA CCGCCCCCGG CGTGGAGATC CTGGCCGGGC AGACGCCCAC TCCGGAGTCG
ACCACCGAGG GACCCCCCGG CCAGTACTTC CAGGCCATCG CCGGCACCTC GATGTCCTCG
CCGCACGTCG CCGGTTCGGC CGCCCTGCTC CGGGCCCTGC ACCCGGACTG GACGCCGGGC
CAGATCAAGT CGGCCCTGAT GACGACCGCG ATCACCAAGG TGGTCAAGGA GGACACCACC
ACCGCGGCCG ATCCGTTCGA CATGGGGGCC GGACGGATCG ACCTGACCAA GGCGGGCAAC
CCCGGTGTCA CCTTCGACGA ATCCGCCCGC GGGTTCCTCA AGGCCGGGCA GAACCCCGTC
GCCGTTGCCG ATCTGAACCA GCCGTCGATC GACGTGCCGA CCCTGCCGGG CCTGGTGACG
GTCAAGCGGA CCGCCACCAA CGTCACCGGC GGCACCATCC GCTACACCAC CAGCGCCGCG
TCGCCCGACG GCTCGACCAT CACCGTCCGC CCGCGCAGCT TCACCTTGCG CAAGGGACAG
TCGGTGGAGC TGTCGGTCAC CATCGACGCC ACGAAGCTGG CGCCCGGGAC CACCGCGGCC
CAGTTCGGCC AGATTCAGTT GACCGAGATC GGCGGCAAGG CCCGCGATGC GCACCTGCCG
GTCGCGTTCA CCCCGGGATC GGCGGCGGTG ACCCTGACCA ACACCTGCGA CCCGAGCACG
ATCACGCTGC GCCCGGTCAC CGCGTCCACC TGCACGGTGA CCGCGGCCAA CACCGGCGCC
GCCCCGGTCA CCGTCGACCT GCGGACCTCG GTGTCCAACC GCCTGCGAAT CACCGGCGTC
CAGGGGGCCA CCCGGGACAG CACCACCTCG GTCGCCCTGT CCAAGGTGAC CCTGGCCGGC
AACCGACCGG GTACGCCGTC CATCGCGAGC GGCCGATCGC CGGTCGGTTA CCGCGCGCTG
GCCAGTTTGG GCATCGCCCC GCAGGCCATC GGCGACGAGG AGATGGCGAA CTTCACGGTG
CCCGCGTTCC AGTACAACGG CCAGTCGTTC ACCTCGATCG GGGTCAGTTC CAACGGCTAC
CTGGTGGTCG GCGGCGGTAC GTCGGAGGAC CAGGCCTACG AACCGCAGAC CCTGCCCGAC
CCGGCCAAGC CGAACAATGT GCTGGCGCCG TTCTGGACCG ACCTGGACGG CACCGGGGCG
CCCGGCATCT CGGTCGCCTC GCTGACCGAC GGCAAGCACA GCTGGGTCGT CGTGCAATGG
CAGGTGAAGG TCTTCGGCAC CGCCAATGCC CGCACGTTCC AAACCTGGAT CGGCGTGAAC
GGCACGCAGG ACATCTCCTT CGCCTACGAC CCGGACAACC TGCCGGCTGC TCCGGGCAGC
CAGGCGCTCA CCGTCGGCGC GGAGAACAGC GACGGCTCGG GTGGTGGACA GATCAGCGGG
CTGCCCACCG GCGATCTGGT GGTCACCAGC ACCGACCCAC AGCCGGGCGG CAGCGTGAGC
TACACCTTGG AGGTGCAGGG CGCCTCCACC GGGACCGGCA CGGTGACCTC GACCATGACG
TCGCCGAGCA TCCTGGGCAC GACGGTGAAG GAAGCCACGA TCACGGTGAA CCGGCGGTAG
 
Protein sequence
MVTASLLASA LLPATAGAEP ADAGNLTGTA IEVADRVSVP KAATSRLAES DPAVLAATGT 
DTVPVMVKLD YDSVATYTGG VADLAPTSPA ETDKSLQDNA GAVQEYEQHV AGVEDSFVAD
LAAAVPDAEV GTRLRTVYGG VSVRLPADQA KNLLDIPGVV AVQADHLNQP LTDSSPAFIG
APTIYNALGS STTAGSGVIV GVLDSGAWPE HPSFADPGLP APPPLANGQA RVCDFGDNPL
TPANDVFVCN NKLISGQPFL QTYNANNSGE IYPDSARDSN GHGTHTASTA AGSPVADAQT
LGVSRGPIQG IAPGAHVAVY KVCGAQGCYS SDSADAVQQA INDGVDVINF SISGGTNPFT
DPVELAFLDA YGAGVFVSTS AGNDGPGAGT ANHLAPWVTT VAASTQTRAF QSTLTVTGSG
AAPFVATGAS ITAGAGPAPV VLASSAPYGN ALCSAPAPAG LFDGKIVACQ RGGNARVDKG
YNVKQGGAAG MILYNPSLAD VETDNHWLPT VHLADGTAFL AYLGAHPDAT ASFTAGVKAN
GQGDVMAAFS SRGPAGSSIK PDLTAPGVEI LAGQTPTPES TTEGPPGQYF QAIAGTSMSS
PHVAGSAALL RALHPDWTPG QIKSALMTTA ITKVVKEDTT TAADPFDMGA GRIDLTKAGN
PGVTFDESAR GFLKAGQNPV AVADLNQPSI DVPTLPGLVT VKRTATNVTG GTIRYTTSAA
SPDGSTITVR PRSFTLRKGQ SVELSVTIDA TKLAPGTTAA QFGQIQLTEI GGKARDAHLP
VAFTPGSAAV TLTNTCDPST ITLRPVTAST CTVTAANTGA APVTVDLRTS VSNRLRITGV
QGATRDSTTS VALSKVTLAG NRPGTPSIAS GRSPVGYRAL ASLGIAPQAI GDEEMANFTV
PAFQYNGQSF TSIGVSSNGY LVVGGGTSED QAYEPQTLPD PAKPNNVLAP FWTDLDGTGA
PGISVASLTD GKHSWVVVQW QVKVFGTANA RTFQTWIGVN GTQDISFAYD PDNLPAAPGS
QALTVGAENS DGSGGGQISG LPTGDLVVTS TDPQPGGSVS YTLEVQGAST GTGTVTSTMT
SPSILGTTVK EATITVNRR