Gene Noc_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2845 
Symbol 
ID3705537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3222196 
End bp3223173 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content47% 
IMG OID637739321 
Productpeptidase S49, SppA 
Protein accessionYP_344821 
Protein GI77166296 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.25989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACC AAGATCCATC TACGACCGAT CTAAGCAAAA CACAGGCAAT CCAGCGCAAC 
TGGGAGCGAG AGGTCTTGGA GCGTTTGGCT TTTAGCGCTC TTAAAGAACA GCGGCGGGCG
CGTCGTTGGA GTATTTTTTT CAAACTACTC TTTGCTGCCT ATTTACTCGT TCTCTTGCTG
CTTTATCTTC CTAATGGGCT TAGCGCTCCA GGAATAATTA CGCCCCACAC TGCCTTAGTA
AAAATAGAAG GGATCATTGG CAGCGACTCA TTTGCAAATG CGGAAAATAT AAAAAAAGGA
CTAAAAGCCG CCTTTGAGAA CGAACATATC GCTGGGTTAA TTCTGCATAT CAATAGTCCT
GGCGGCAGCC CAGTCCAGGC AAACCAAATC AACGACCAAA TCCATCAATT GCGCAAAGAA
CATCCTAATA TCCCTATCCA CGCCGTTATT ACAGACATCT GCGCCTCGGG AGGATATTAC
ATTGCGGTAG CGGCAGATCA AATTTATGCG GACAAAGCAA GTATCGTGGG TTCCATTGGT
GCTCTGATCA ATAGTTTCGG CTTCGTTGAA GCTATGGAAA AACTGGGAAT TGAGCGGCGC
CTCTTTACTG CAGGCGACTA TAAAGGTTTC CTTGATCCGT TCTCGCCTAT GAAAGAATTC
GAATCCCAAC ATATCCAAAA GATGCTAGAC AACATCCACA AGCAATTCAT TCAGGTTGTA
AAAGACAACC GCGGAGAACG ACTCAAGAAC GATTCTTCCC TATTCAGCGG ATTAGTCTGG
ACAGGCGAGC AAGCCATCGA TCTAGGATTG ATCGATGGCT TGGGAAACAG CAACTACGTG
GCTCGCGAGA TCATAGGAGC GGAAAAAATT GTCGATTACA CCCCCCAGCC CAAGCTCCTA
GATCGCTTCT CTCGCCAGCT TGGAGCCACT TTCGCTAATA TGCTATCCGG CTTCACCCTT
GCGGCTAATA CAAGATAG
 
Protein sequence
MSNQDPSTTD LSKTQAIQRN WEREVLERLA FSALKEQRRA RRWSIFFKLL FAAYLLVLLL 
LYLPNGLSAP GIITPHTALV KIEGIIGSDS FANAENIKKG LKAAFENEHI AGLILHINSP
GGSPVQANQI NDQIHQLRKE HPNIPIHAVI TDICASGGYY IAVAADQIYA DKASIVGSIG
ALINSFGFVE AMEKLGIERR LFTAGDYKGF LDPFSPMKEF ESQHIQKMLD NIHKQFIQVV
KDNRGERLKN DSSLFSGLVW TGEQAIDLGL IDGLGNSNYV AREIIGAEKI VDYTPQPKLL
DRFSRQLGAT FANMLSGFTL AANTR