Gene Noc_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2139 
Symbol 
ID3705331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2466230 
End bp2467669 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content54% 
IMG OID637738615 
Productpeptidase S1C, Do 
Protein accessionYP_344129 
Protein GI77165604 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0865536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAGAT ATCGGATGAT TTTGTTATTG CTAGCGGCGC TTTTTGCATT GCCAGCACAG 
GCCCTTGAGG AAGGTGGCGT TGAGAGTCTG CGCCAGACGG GTAAGGCCTT TGCCGCCGTG
GCCCGTGAAG TATCACCATC AGTGGTTTTC ATCCAGGTGG AGAGTATCCG TGAGGAACGT
GGTTCGCGGG CGTTTCCTCC ACCCTTTGGC GATGAGTTTT TCAGGCGCTT TTTTGGCGAG
CCCTTTTCGC CCCGGGGCGA GGCGCCAAAA GGCCAGCGCC GTGCCATCGG CCAGGGATCA
GGCTTCATTT TTTCATCGAA AAAGGGCTTG CTGTCGGACA AGACTTATAT TCTTACTAAC
AGCCATGTAG TGGAAGACGC TGACAAGATT CGGGTCCAAT TTCAAGATGA TCGCGAGTTC
GAGGGTGAAA TTGTTGGGAC CGATCCCAAA TCGGATATCG CGGTTATTGA GATTACCGTT
GGCGGTCTTC CAGCCCTGGA ATGGGGAGAT TCTTCCAAGC TTCAAGTAGG GGAATGGGTA
ATAGCCATGG GTAACCCCTT TGGCTTGAGC CATACCCTGA CCGTGGGCGT GGTGAGCGCC
ACGGGCCGCA CCAGCTTAGG CATCAGCGAT TATGAGGATT TTATTCAAAC GGATGCCGCT
ATCAATCCTG GCAATTCCGG TGGTCCCCTG GTAAATCTGA ACGGCGAAGT GGTGGGCGTC
AATACCGCTA TCTTCAGCCG GAGCGGGGGC TATATGGGCA TAGGATTCGC CATTCCCAGC
AAATTAGCCA AGGCTATTGC CAATCAACTC ATTGAAACGG GCGAGGTGAC CCGGGGTTAT
CTGGGAATTG TCATTCAGCC CCTGACGGCG GAGCTGGCGG AATCCTTCAA TATGGAACAG
TCCCAGGGTA TTCTCGTTGC CCAGGTATCG GAGGACTCGC CTGCAAAAAA GGCGGGGCTC
AAGCAAGGGG ATGTGATTGT CGGTTACCAG GATAAGCCGG TTAAAGATAT TGGCGGTTTC
CGTAATCGAG TTGCGCTCAC CGCGCCCGGT AGCCGTGAGA CCCTTACTAT TATCCGCGAT
GGTAAGCGGC AGAAGGTAAA AATTACTATT GGTGAACTCA GCCAAGAGCA GACCCTTGCC
GAAGGGCCTA CCCAGAGCGC GGAAGAATTG GGGTTGGCGG TTCAGACTTT GACTCCGGAG
CTGGCAAGAC AATTTGATGC CAAGGCGGGT GAAGGTGTGG TTGTTACTGG GGTTGAGCGT
GGCTCCATCG CCGCCATGGC GGGGATCAGG GTGGGTAACG TGATTCTCCA GATCAACCGC
AAGCCGATCC ATAGTGCGAA GGAATTCAAT CGTGCCATGC AGGAAAGCCA GGATGATAAG
CGTGCTTTGT TGCTGATTCG TTCAGGTAAT ATGCAACATT ACGTGGTACT GCGCTGGTAG
 
Protein sequence
MHRYRMILLL LAALFALPAQ ALEEGGVESL RQTGKAFAAV AREVSPSVVF IQVESIREER 
GSRAFPPPFG DEFFRRFFGE PFSPRGEAPK GQRRAIGQGS GFIFSSKKGL LSDKTYILTN
SHVVEDADKI RVQFQDDREF EGEIVGTDPK SDIAVIEITV GGLPALEWGD SSKLQVGEWV
IAMGNPFGLS HTLTVGVVSA TGRTSLGISD YEDFIQTDAA INPGNSGGPL VNLNGEVVGV
NTAIFSRSGG YMGIGFAIPS KLAKAIANQL IETGEVTRGY LGIVIQPLTA ELAESFNMEQ
SQGILVAQVS EDSPAKKAGL KQGDVIVGYQ DKPVKDIGGF RNRVALTAPG SRETLTIIRD
GKRQKVKITI GELSQEQTLA EGPTQSAEEL GLAVQTLTPE LARQFDAKAG EGVVVTGVER
GSIAAMAGIR VGNVILQINR KPIHSAKEFN RAMQESQDDK RALLLIRSGN MQHYVVLRW