Gene Noc_2903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2903 
Symbol 
ID3707420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3282822 
End bp3284546 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content58% 
IMG OID637739380 
ProductType I restriction-modification system M subunit 
Protein accessionYP_344879 
Protein GI77166354 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACA ATATCACCCT ACAACAACTG GAATCCTTCC TGTGGGAGGC CGCCGATATT 
CTCCGGGGCA ACATGGACGC CTCCGAGTAC AAGGATTATA TCTTCGGCAT GATGTTCCTC
AAGCGCCTGT CCGATGCCTT TGAGGAAGCC CAGGAAGGGG TTATCCAGTA CTACCTGGGC
AAGGGCAAAA CTGACGCCGA GGCCCGGGAG CTGGCCAACG ATGAGGACGA ATACGACAAG
ACCTTCTACA TTCCGCCCAT CGCCCGCTGG GGTGCCCTGA AAGACCTGAA ACACGATATT
GGCACCGAGC TGAACAAGGC CACGGAAGCC ATCGAGGAAG TCAACCCTTC CCTGGAAGGG
GTGCTGGTGT CCATCGACTT CAATATCAAG AACAAGCTCT CGGATAAGAA ACTGCGGGAT
CTGCTCCGTC ACTTCAGCCG CCATCGGCTT CGCAATGAAG ACTTCGAGCA CCCCGATCTA
CTGGGCACCG CCTACGAGTA CCTGATAAAA ATGTTTGCCG ATAGCGCCGG CAAGAAGGGC
GGCGAGTTTT ACACCCCTTC CGAGGTGGTG CGGCTGCTGG TGGCCCTGCT CAAGCCCCAG
GCCGGGATGC GCATCTACGA TCCCACCGCC GGGTCCGGCG GGATGCTGGT GCAGACCCGC
AACTATCTGG CCCGTCATGG TGAAAACCCG GCCAATCTGT CCCTGTTCGG TCAGGAGATG
AACCTGAACA CCTGGGCCAT CTGCAAGATG AATATGTTTT TGCACGGAGT CTACAGCGCC
GACATCCGCA AGGGGGATAC CCTGCGGGAA CCCCAGCATA CCCAGGGCGG TGAACTGATG
ACCTTTGACC GGGTGATCGC CAATCCCCCC TTTTCCCTGA AAAAGTGGGG CAAGGATGAA
GCGGACAAGG ACGCCTACGG GCGCTTCCCC TACGGTACAC CCCCCAAGGA CGCCGGGGAT
TTGGCCTTTG TCCAGCACAT GATCGCCAGC CTGAACGCCG AAGGCATGAT GGGGGTGGTC
ATGCCCCACG GGGTGCTGTT CCGGGGCGCC AGCGAAAAAG CCATCCGCCA GGGCATCCTG
AAGGATGATC TACTGGAAGC GGTGATCGGC CTGCCCGCCG CTTTGTTCTA CGGCACCGGC
ATCCCCGCCT GCCTGCTGAT CCTCAACAAA AACAAACCGG CGGAACGCAC AGGCAAGGTA
TTGTTCATCA ACGGCGAGCT GGAATTTCAG GAAGGCAAGA ACCAGAACAA ACTGCGCCCG
CAGGATATGG ACAAGATCGT TCGGACCTTC GATGACTACA GGGAGATCAA GCGGTATTCC
AAGGTGGTCA GTTTGGCGGA CATTGCCGGG AACGATGATA ACCTGAATAT TCGCCGCTAC
GCGGACACCT CGCCGCCCCC GGAAATCTTC GATGTCCGCG CCATTCTCCA CGGCGGTATT
CCCGTGCGGG AAGTGGAAAG CGAGTATATC CGGGAAGAAA TACTGGAAGA CTTTGATGTG
ACCATGGTCT TTGTGAGGCG GGATGAGCGC TACTTTGAGT TCAAGCCCGA GATCGAGTCC
AAGGAAGCCA TCCGGGAAGC GGCTGGGGAA GTTGACGCCA AGGTGATCCA ACAACTGGAA
CGCTGGTGGG ACAAGTACCG GGTGTCCCTG CATGAACTGG ACGCCCAGGT GGCGGCAGCT
GAGGAAGTGA TGAAAGGCTA TCTGAAGGAG CTAGGGTATG AGTGA
 
Protein sequence
MSHNITLQQL ESFLWEAADI LRGNMDASEY KDYIFGMMFL KRLSDAFEEA QEGVIQYYLG 
KGKTDAEARE LANDEDEYDK TFYIPPIARW GALKDLKHDI GTELNKATEA IEEVNPSLEG
VLVSIDFNIK NKLSDKKLRD LLRHFSRHRL RNEDFEHPDL LGTAYEYLIK MFADSAGKKG
GEFYTPSEVV RLLVALLKPQ AGMRIYDPTA GSGGMLVQTR NYLARHGENP ANLSLFGQEM
NLNTWAICKM NMFLHGVYSA DIRKGDTLRE PQHTQGGELM TFDRVIANPP FSLKKWGKDE
ADKDAYGRFP YGTPPKDAGD LAFVQHMIAS LNAEGMMGVV MPHGVLFRGA SEKAIRQGIL
KDDLLEAVIG LPAALFYGTG IPACLLILNK NKPAERTGKV LFINGELEFQ EGKNQNKLRP
QDMDKIVRTF DDYREIKRYS KVVSLADIAG NDDNLNIRRY ADTSPPPEIF DVRAILHGGI
PVREVESEYI REEILEDFDV TMVFVRRDER YFEFKPEIES KEAIREAAGE VDAKVIQQLE
RWWDKYRVSL HELDAQVAAA EEVMKGYLKE LGYE