Gene Noca_2395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2395 
Symbol 
ID4599495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2551932 
End bp2553602 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content71% 
IMG OID639776998 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_923587 
Protein GI119716622 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGTC TACGTGGTGC ACGCAAGGCG GCAGCGGCCC TCGCTGCGAG TGCGGTAGCG 
GTCCTCCTCC CGCTGGGAGG TACGACGGCG CTGGCCGCGG ACGAGTCCCC CGGCCCGGCG
GCCGTGGACG TCACCCCGCT GGAGAAGGTC AACGCCTTGG TCCAGCCGAG CGTGGTCTTC
CTGCTCCAGA CCTGGGACGG CTACGTCTAC GACACCTTCA ACAAGCAGTA CCTGAACGAC
GGCAATCCGT TCGAGCTGCA GTTCCAGTGC ACCGGCTTCG TGGTGAATCC CAACGGCTAC
ATCGCGACCG CCGGCCACTG CGTGGACTTC AAGGAGGTCG AGGGCAGTTT CGTCGAGACC
GCCGCCCAGT GGGCGCTCGC CAACGGCTAC TACAGCAGCA CCACCCTCAC CCTGGACGAC
ATCGTCGGGT TCGACGACTA CCGGATCGAG TCGAGCGAGC GCAAGAACAC CGCCGACCTG
GACATCCAGG TGGGCTGGGG TGCCTCCGTC TCCGGCATCG AGACCTCCGA GGTCAAGCGG
GCTCGCGTCA TCGACTTCGA CCGGCAGTCC AAGGGCGACG TCGCGCTGAT CAAGGTCGAG
GCCACCGATC TGAACGCCTT GCCGATGGCG ACCGACGAGG TGGACGTGGG GACCGACGTC
GTGTCGATCG GCTACCCCGC CTCGGTCGAC TCCGTCACCG ACCCCAACCT GACCCCGTCG
TTCAAGGACG GCTCGATCAG CTCGGTCAAG ACGGTCCAGG GCGGCGTACT CCCGGTCTAC
GAGATCTCCG CCGCTGTCTC GGGCGGGATG AGCGGGGGGC CGTCGGTGAA TCTCGACGGC
GAGGTGATCG GCGTCAACAG CTTCGGCATC CTCGGTGAGC CGCAGGCCTT CAACTTCCTC
CGCCCGTCCT CCCAGCTCGC GGAGCTGATG GCCGGTGCGG GCGTGACCAA CGAGCTCAGC
GAGACCACGC AGGCGTACCG AGATGGTCTC CTCGCCTACT GGGCGGGCGA CCGAACCACG
GCCGTGGACA AGCTCGGGAG CGTCGTCGAC GAGCAGCCCA CCAACAAGCT CGCCGCGGAG
TTCCTCGAGA AGGCCCAGGA CCTGCCCGAG CCGCCTCCGG CCGAAGAGTC GGACTCCGGC
CTGCCGGTGG TGCCGATCGT GATCGGCGTT GCCGTGCTGG TCCTGGTCGG CGGGGGTCTC
CTGGCCTTCC TCCTGCTGCG GCGCAAGGGC GGATCGTCGC CCGCCGCGAC CCCGCCGGCG
GCTCCGGTGG CACCCGCGAC CCCGGCGGCA CCGGCCGCTC CGCTCGGCGG ACCGTACGCC
GCGCCGTACG CGGACCCGGT GTCGAGCGCG CCCGCCGCAC CGCTCGGCTT CTCCGGCGGG
GTGACCACGG CCCCACCGCC GACCATCCCG CCGACCATCC CGCCCACCAG CCCGGCCGCG
CCTCCGCCGA CGCCGGTGCC CACGGCGTCC GTCACGCCGC CGCCCGCCGC GCCTGCCCCG
GCGCCCGCCC CGGCGTCCAC GCCGGTGTCG GCGTCGGGCC CGCTGCCGAC ACCCCCGGTG
GCCTCGGAGG AGCCGGCGGA GAAGCACGAG CCGCACTTCT GCGGGAACTG TGGGGAGCCT
GCGGAGCACG GCAAGAAGTT CTGCAGCAAC TGCGGGAGCC CGCTGGCCTG A
 
Protein sequence
MNGLRGARKA AAALAASAVA VLLPLGGTTA LAADESPGPA AVDVTPLEKV NALVQPSVVF 
LLQTWDGYVY DTFNKQYLND GNPFELQFQC TGFVVNPNGY IATAGHCVDF KEVEGSFVET
AAQWALANGY YSSTTLTLDD IVGFDDYRIE SSERKNTADL DIQVGWGASV SGIETSEVKR
ARVIDFDRQS KGDVALIKVE ATDLNALPMA TDEVDVGTDV VSIGYPASVD SVTDPNLTPS
FKDGSISSVK TVQGGVLPVY EISAAVSGGM SGGPSVNLDG EVIGVNSFGI LGEPQAFNFL
RPSSQLAELM AGAGVTNELS ETTQAYRDGL LAYWAGDRTT AVDKLGSVVD EQPTNKLAAE
FLEKAQDLPE PPPAEESDSG LPVVPIVIGV AVLVLVGGGL LAFLLLRRKG GSSPAATPPA
APVAPATPAA PAAPLGGPYA APYADPVSSA PAAPLGFSGG VTTAPPPTIP PTIPPTSPAA
PPPTPVPTAS VTPPPAAPAP APAPASTPVS ASGPLPTPPV ASEEPAEKHE PHFCGNCGEP
AEHGKKFCSN CGSPLA