Gene Namu_4674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4674 
Symbol 
ID8450304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5193851 
End bp5195122 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content73% 
IMG OID645043715 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003203940 
Protein GI258654784 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACAAG CCGTGCAGCA CCGTGGCCGG AAGCCTTTGA TGGCGGCGGG AGCGTTGGGG 
GTGGCCGCCG CGCTGGGCCT GGCCGCCGCA TTCGGCGGCA CCACGGCGGT GGCCGGCACC
CCGACGGCGG TCCAGGGTCT CGCCGCGACC GCCTCGCTGC CCAACCCGTA CGGCGGCGGG
TCCTCCTCCG GCTCCGGCGG GTACGGCGGG TACGGCGGCT ACGGGGTGGA TCCGTTCGGC
GGATCGGGGG GCTCCACCGG CAACGGCTAC TCCGGCGGTT CGTCGGGCAA CTCGTCCGGC
AGTTCGGGTC AAGCGCCGAC GGCGAACCAG CAGGTCGGGC TGGTCTACAT CGACACCGTG
CTGGGCTACC AGGACGCCGC GGCGGCCGGC ACCGGGCTGG TGCTGACCTC GAACGGGCAG
ATCCTGACCA ACAACCACGT GATCGAGGGC TCGACGTCGA TCACCGTCAC CATCGTGACC
ACCGGCCAGA CCTACCAGGC CTCGGTCGTG GGCACCGACG TGCAGGACGA CATCGCGGTG
CTGCAGCTCG GTGACGCCTC CGGCCTGACC ACCGCCAACT TCGGCACCTC CGCCGACCTG
CAGGTCGGCG AGTCGGTGGT CGGCGTCGGC AACGCCGGCG GCGACGGGGG AACCCCGTCC
GCGGCCGCCG GGGTGGTCTC CGCGTTGAAC CAGACCATCA CCACCCAGGC CGAGGGCTCG
GCCGCCGGTG AGACGCTGAC CGGGCTGATC GAGACCACCG CCGACATCCA GTCCGGCGAC
TCCGGTGGGC CGCTGTTCGA CGCCAACGAC GAGGTCGTCG GCATCGACAC CGCCGCCGAG
GTGGTCGCCG GGCAGAGCGA GAACGGTTAC GCCATCCCGA TCGACACCGC CCTCGGCATC
GCCAAGCAGA TCATCACCGG CGACGAGTCC GACGGGGTCC GGATCGGCTA CCCGGCCTTC
CTCGGCGTGC AGGTGCAGTC CGCGGCCACC GCCGGCACGG GCACCGGCCG GAGCCGGTCC
GGATCGAGCA GCACCGGCAC CGCCGGCGCG CTGATCGGCG GCGTCGTGTC GGGCTCGCCG
GCCGAGCAGG CCGGATTGAC CGCCGGGGAC ACGGTGACTG GCATCGACGA CAGGGCCGTC
AGCAGTGCCG ACGGGCTGAG CACCGCGCTG GCGGCGCACA ACCCGGGCGA CCGGGTGAGC
GTGACCTGGG TGGATTCGTC CGGGGCGAAG CACTCGGCCA CCGTCACCCT GGCCCAGGGT
CCGGCCGCCT GA
 
Protein sequence
MEQAVQHRGR KPLMAAGALG VAAALGLAAA FGGTTAVAGT PTAVQGLAAT ASLPNPYGGG 
SSSGSGGYGG YGGYGVDPFG GSGGSTGNGY SGGSSGNSSG SSGQAPTANQ QVGLVYIDTV
LGYQDAAAAG TGLVLTSNGQ ILTNNHVIEG STSITVTIVT TGQTYQASVV GTDVQDDIAV
LQLGDASGLT TANFGTSADL QVGESVVGVG NAGGDGGTPS AAAGVVSALN QTITTQAEGS
AAGETLTGLI ETTADIQSGD SGGPLFDAND EVVGIDTAAE VVAGQSENGY AIPIDTALGI
AKQIITGDES DGVRIGYPAF LGVQVQSAAT AGTGTGRSRS GSSSTGTAGA LIGGVVSGSP
AEQAGLTAGD TVTGIDDRAV SSADGLSTAL AAHNPGDRVS VTWVDSSGAK HSATVTLAQG
PAA