Gene Noc_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0402 
Symbol 
ID3706573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp445303 
End bp446892 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content55% 
IMG OID637736914 
Productpeptidase M48, Ste24p 
Protein accessionYP_342458 
Protein GI77163933 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAGC CTAGGTTTTC AGAGATCATC CTAGGCTTTT ATGAGCGGCG AGTAAGCTTC 
GCTTTATGGC TACTAGCTGC CGCACTGCTT AGCGCCTGTG CGATTAATCC GGTAACAGGC
GAGCGGGAGC TGTCGCTGGT CTCCGAAACC CAGGAAATTC AGATGGGGGA GGAGAATTAT
TTGACGATGC GGCAGATGCA GGGAGGGGAT TATACGGCTG ACCCCGCACT CACAGCCTAT
GTGAGCCAGG TGGGTCAGCG TCTGGCGGCC GTGAGTGATC GTTCGCTGCC CTATGAGTTC
TCGGTTATTA ACGATTCCAC CCCTAATGCG TGGGCCCTGC CGGGTGGTAA AATTGCTCTC
AACCGGGGTT TATTGACGGA GCTAAACAAT GAAGCTGAGC TGGCGGCGGT GCTAGGCCAT
GAGATCGTCC ATGCCGCAGC CGGTCATAGC GCCCAAGGCA TGGAGCGTGA TTTATTATTG
AAGGGGGCGG TGTTAGGTTC AGTGCTGGCA ACGGGAGTGA GTGAATACAC GCCCCTGGTG
TTGGGCGGGG CGCAAGCGGC GGCGCAACTG GTCAATCGGA AATATAGCCG CGATGCGGAG
CGGGAAGCGG ATCTTTATGG CATGCGCTAT ATGTCCCGCG CCGGCTATGA TCCTTGGGCA
GCGGTGAGCT TGCAGGAAAC TTTCGTTCGT CTCTCCGAGG GACAGCAGGA AAATTGGCTT
TCGGGGTTAT TGGCAAGCCA TCCTCCTTCC CTGGAACGGG TGGAGGCCAA TGAGATGACT
GCCCGTACTT TGCCTGCCGG AGGTGAGTTA GGGGCGGAAC GCTACCAAGC CAAGCTTGCT
CCCCTGAGGC AGGTAGAAAC GGCCTATGCT GCTTATGATC AAGGCCGTAA AGCACTTCAA
GAGGGCAACC TGGAGCAGGC GCTGAGTTTA GCGGAGCGGG CCATTACTGA AGAGCCCCGG
GAAGCCCTGT TCTATGGCTT GCGCGGTGAT GTCTATCTGG CAAGGAAACG CTATCAGGAA
GCTCTGGCCG ATTATAATCG GGCTATTAAG CGCAATGATC ATTTTTTTTA TTTTTATAAT
CAGCGGGGTT TGGTAAACAA AGCATTAGGC CATTCAGAAA AGGCGCGTCA AGATCTGCAG
CAGAGTATAG CCCTTTTACC CACGGAAAGC GCTAATAAAG CCTTAGGTGA TTTAGCCTTG
ACGCAAGGAG ATCGGCAAGG CGCCATGACC TACTACCAAA AGGCAGCCGC CTCCCAAACC
CCTTTGGGCC TTGAAGCAAG ACGCGCCTTG GTTTACCTGG ATTTGCCTAA TAATCCCCAA
AAATATCTAG CGGCTCAGGT CAAGTTAAAT CGGCGTGGCT ATCTCGTTGT CCGGGTGACT
AATCAGGCAC CGCTCCCCGT CCGCGATATT GGCATTGAAG TGCGTTATCT GGATTCCCAG
GGGCACGTCC AGTCCCATAA GCAAGCGTTT CAAGGAATTC TTGCCGCAGG TCAGACCGCG
CGCCTCAAGA CGAATCTGGG GCCGCTTTCA GATCCACGTG CGCTCGAACG GATCGAGGCT
AAGGTGATCC AGGCCCGAAT TGCCAATTAA
 
Protein sequence
MNKPRFSEII LGFYERRVSF ALWLLAAALL SACAINPVTG ERELSLVSET QEIQMGEENY 
LTMRQMQGGD YTADPALTAY VSQVGQRLAA VSDRSLPYEF SVINDSTPNA WALPGGKIAL
NRGLLTELNN EAELAAVLGH EIVHAAAGHS AQGMERDLLL KGAVLGSVLA TGVSEYTPLV
LGGAQAAAQL VNRKYSRDAE READLYGMRY MSRAGYDPWA AVSLQETFVR LSEGQQENWL
SGLLASHPPS LERVEANEMT ARTLPAGGEL GAERYQAKLA PLRQVETAYA AYDQGRKALQ
EGNLEQALSL AERAITEEPR EALFYGLRGD VYLARKRYQE ALADYNRAIK RNDHFFYFYN
QRGLVNKALG HSEKARQDLQ QSIALLPTES ANKALGDLAL TQGDRQGAMT YYQKAAASQT
PLGLEARRAL VYLDLPNNPQ KYLAAQVKLN RRGYLVVRVT NQAPLPVRDI GIEVRYLDSQ
GHVQSHKQAF QGILAAGQTA RLKTNLGPLS DPRALERIEA KVIQARIAN