Gene Noca_0884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0884 
Symbol 
ID4599891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp922326 
End bp923579 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content72% 
IMG OID639775485 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_922094 
Protein GI119715129 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.63894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACC AGCCCACCAG TCCGCCTCCG GGACCGCCGA ACCCGTACTT CTCCACCCCG 
GCGCCGCTGG GTCCGCCTCC CCAGGGCCCG CCTCCCGGGG TGGCGCCGGA CCCGCAGCCG
GGGCAGCGTC GTCCTCGCCG GACCGGCCTC GCCGCCGCGG TCGTCGCGAC CGCGCTGGTG
GCCGGCGGCG CGGCAGGGGT CGGCGGCGCC GCCGCCTGGA GCGCGCTCGA CGATGGAGGC
TCGTCGAGCG CGGGCGGCCC CAGCAGCCGC ACGACGGCGC AGGTCGTCGA CACCCCCGAC
TCCGAGGCAC CGGCCGGCTC CGTCGAGCAG GTCGCTGCCA AGGTGCTGCC TTCGGTGGTC
AAGATCGACG TGGCCGGCGC CCAGGGCGCC GGTTCGGGCT CGGGGATCAT CCTCAGCTCC
GACGGCGAGA TCCTCACCAA CAACCACGTG GTGGAGCTCG CCGGCGACAA CGGGTCGATC
CGGGTCTCCT TCAACGACGG CTCCACCGCG AAGGCCGAGA TCCTCGGCAC CGACCCCCTG
ACGGACACCG CGGTGATCAA GGCCCAGGAC GTCTCCGGGC TGACGCCCGC GACGATCGGG
AAGTCGGGCG ACCTCAAGGT CGGCGAGAGC GTCGTGGCGA TCGGGTCCCC GTTCGGGCTC
GACTCGACGG TGACCAGCGG CATCGTGAGT GCGCTGGACC GGCCGGTGGA CGTCGGCTCC
GACGGCCAGG GCAACAGCAC GACGTACCCC GCGATCCAGA CCGACGCTGC GATCAACCCG
GGCAACAGCG GCGGCGCGCT CGTCGACCTC GACGGCAACG TCGTCGGCAT CAACTCCTCG
ATCCGCACCG CCAGCTCCAT GGAGGGGCAG GCCGGCTCGA TCGGGCTCGG CTTCGCCATC
CCGATGGACG AGGTGATGCC GATCGTCGAC CAGATGGTCA ACGGCGAGAC CCCGACCCAC
GCCCGCCTCG GCATCTCCGT CTCCGACGTC GCGAGCCGGC CCGGAGCCGA GGTGACCGAG
GGCGCCGAGG TCCAAGACGT CAACGCCGGC TCGACCGCGG ACGACGCCGG CCTGGCGAAG
GGCGACATCA TCACCAAGGT CGACGACCAG CTGATCAGCG GCGCCGACTC CCTGGTCGCC
ACCATCAGGT CCTACCGGCC CGGCGACGAG GTCACCGTCA CCTACGAGCA CGGCGGCGAC
ACCAAGACCG TCACTCTCCA GCTGGACTCG GACGCGGACA CGTCCAACTC CTGA
 
Protein sequence
MNDQPTSPPP GPPNPYFSTP APLGPPPQGP PPGVAPDPQP GQRRPRRTGL AAAVVATALV 
AGGAAGVGGA AAWSALDDGG SSSAGGPSSR TTAQVVDTPD SEAPAGSVEQ VAAKVLPSVV
KIDVAGAQGA GSGSGIILSS DGEILTNNHV VELAGDNGSI RVSFNDGSTA KAEILGTDPL
TDTAVIKAQD VSGLTPATIG KSGDLKVGES VVAIGSPFGL DSTVTSGIVS ALDRPVDVGS
DGQGNSTTYP AIQTDAAINP GNSGGALVDL DGNVVGINSS IRTASSMEGQ AGSIGLGFAI
PMDEVMPIVD QMVNGETPTH ARLGISVSDV ASRPGAEVTE GAEVQDVNAG STADDAGLAK
GDIITKVDDQ LISGADSLVA TIRSYRPGDE VTVTYEHGGD TKTVTLQLDS DADTSNS