Gene Noc_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0041 
Symbol 
ID3705974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp39123 
End bp40130 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content53% 
IMG OID637736565 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_342113 
Protein GI77163588 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.676343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGTAC TTGGAATAGA AACTTCCTGC GATGAAACAG GGGTAGCCGT TTTTGACAGT 
AAACAGGGAC TATTGGCCGA TATCCTCCAT AGTCAGGCAA AACTACATGC AGGCTACGGT
GGGGTCGTTC CAGAATTAGC CTCTCGAGAT CACATTCGCA AATTACTGCC TCTGCTTCGC
AAGGTGTTAA ACCAGGCTGG CCTTAACAAG GAGGACATTG ATGGCGTTGC CTATACGGGA
GGTCCAGGGT TAATCGGCGC TTTATTAGTC GGTGCTGCCG TGGGTCGTAG CTTGGCCTGG
GCTTTGGGCA AACCGGCAAT CGCTGTCCAT CACATGGAAG GACATTTACT CTCCCCTCTA
TTAGAACCCA ATCCCCCCGG TTTCCCCTTT TGCGCCCTCC TTATTTCCGG TGGCCATACC
ATGCTGGTTA CGGTCAACCG TATCGGCGCT TATCGGATAC TCGGAGAGTC CCTGGATGAC
GCGGTTGGGG AAGCTTTTGA TAAGACGGCG AAGCTGCTTC AACTCGGGTA TCCAGGGGGA
CCCGCCCTGG CTAAACTTGC TGAACAGGGA AATCCTGACC GTTTTTATTT CCCGCGCCCT
ATGCTCGACC GCCCAGGGTT AAACTTTAGC TTTAGTGGGT TAAAAACCTA TGCTCTCAAT
ACCCTGCATA AAAATGGCTC TGAAGCTGCC GCCGATATTG CCCGGGCCTT TCAAGATGCC
GTCGTAGACA CCCTTACTGT CAAATGTCGG CGGGCCCTGC AACAAACAGG ACTCAACCGC
CTGGTCGTTG CCGGCGGGGT TAGCGCCAAC CAAGCACTTC GACAGCGACT TAAAGCCATG
GGAACCCAAG AAAATGTCCA AGTTTATTAC CCGCGATTAG CTTTTTGTAC TGATAACGGG
GCAATGATCG CTTTTGCAGG CTGTCAGCGC CTGCTAGCAG GAGAACAAAC CAGCTCCGGA
TTTAGCGTTC GGGCAAGATG GCCCTTAGAT ACGCTGCCGG AAATTTAA
 
Protein sequence
MRVLGIETSC DETGVAVFDS KQGLLADILH SQAKLHAGYG GVVPELASRD HIRKLLPLLR 
KVLNQAGLNK EDIDGVAYTG GPGLIGALLV GAAVGRSLAW ALGKPAIAVH HMEGHLLSPL
LEPNPPGFPF CALLISGGHT MLVTVNRIGA YRILGESLDD AVGEAFDKTA KLLQLGYPGG
PALAKLAEQG NPDRFYFPRP MLDRPGLNFS FSGLKTYALN TLHKNGSEAA ADIARAFQDA
VVDTLTVKCR RALQQTGLNR LVVAGGVSAN QALRQRLKAM GTQENVQVYY PRLAFCTDNG
AMIAFAGCQR LLAGEQTSSG FSVRARWPLD TLPEI