Gene Noca_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0343 
Symbol 
ID4597980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp364136 
End bp365311 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID639774958 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_921574 
Protein GI119714609 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACCTGC TCGACTGGCT CCTGGTCGTG CTCGTGCTCG CCTACGCGCT CTCCGGCTAC 
TGGCAGGGCT TCATCACCGG TGCCTTCGCG ACCGGGGGGC TGCTGCTCGG CGGACTGTTC
GGCGTGTGGC TGGCCCCGGT CGCACTGGGT GACGCCAACC CCTCCCTGAT GGTCTCCCTG
GGCGCGCTGT TCATCGTGAT CCTGTCCGCG TCGCTGGGGC AGGCCGTGCT CCAGTTCGCC
GGCGCCCGGA TCCGCGAGCG GATCACCTGG CAGCCGGCCC GCGCCCTCGA CGCGGTCGGC
GGCGCCATGC TCAGCGCCGT GGCGGTCCTC GTGGTCGCCT GGGCGCTCGG CGTCGCGATC
TCGGGGTCTC GGATCGGCGG CGTCACCCCG CTGGTGCGGG GCTCGACCGT GCTCTCCCAC
GTCGACGAGG TGATGCCCGC CAGCGCCGAC GGCGCGCTGC AGGCGTTCAA CGACGTCGTC
GGCACCAGCT TCTTTCCCCG CTACCTCGAG CCGTTCGCGC CCGAGCGGAT CGTCGAGGTC
GGACCCGGCC CCAAGCGGCT GCTCAACGAC CCCGACGTCG AGCGCGCCGG GTCGAGCGTC
CTCAAGATCC GCGGCACCAA CGAGTGTGGC CGGGGTGTCG AGGGGTCCGG GTTCCTGTAC
GCCGGCAACC GGCTGATGAC CAACGCCCAC GTCGTCGCCG GGATCGACGA CCCCGAGGTC
ATCGTCGGCG ACGAGTCGGT CCCGGCGGAC GTCGTCTACT ACAACCCCGA CATCGACGTG
GCGGTGCTCT CCTTCGACAG CGGGGACCTG CCGGCTCTGC GCTTCGACCG CGACGCCGGC
GCGCCCGACG GTGTCGCGAT CCTGGGCTAC CCGCAGGACG GGCCCTACCA CGTGGAGCCC
GCCCGGATCC GCTCCGAGCA GCGGCTCCGC TCACCCAACA TCTACGGCGA CGGCGCGGTG
ATCCGTGAGG TCTACTCCCT GCGCGGGCGG ATCCTGCCGG GCAACTCCGG CGGGCCGATC
GTGTCCTCGG CCGGCGACGT CGTCGGCGTG GTGTTCGCCG CCTCGGTCAC CGACCACGAA
ACCGGCTACG CGCTGACCGC CGGACAGGTC TCCGCCGCCG CGGCCGCCGG CCTGACCAGC
TCGAGCCAGG TGTCGACCGG CGGTTGTGCT GGGTGA
 
Protein sequence
MNLLDWLLVV LVLAYALSGY WQGFITGAFA TGGLLLGGLF GVWLAPVALG DANPSLMVSL 
GALFIVILSA SLGQAVLQFA GARIRERITW QPARALDAVG GAMLSAVAVL VVAWALGVAI
SGSRIGGVTP LVRGSTVLSH VDEVMPASAD GALQAFNDVV GTSFFPRYLE PFAPERIVEV
GPGPKRLLND PDVERAGSSV LKIRGTNECG RGVEGSGFLY AGNRLMTNAH VVAGIDDPEV
IVGDESVPAD VVYYNPDIDV AVLSFDSGDL PALRFDRDAG APDGVAILGY PQDGPYHVEP
ARIRSEQRLR SPNIYGDGAV IREVYSLRGR ILPGNSGGPI VSSAGDVVGV VFAASVTDHE
TGYALTAGQV SAAAAAGLTS SSQVSTGGCA G