Gene Noc_3030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3030 
Symbol 
ID3705777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3426811 
End bp3428505 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content46% 
IMG OID637739503 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_345001 
Protein GI77166476 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAAG CAGTGCATGC TTCACTGGAA CAGCATTTTG ATACCGCCTT TTCAGCGCCA 
GATGGTATCG CCAAGCTGCG CGAACTGATT TTGACCTTGG CCATGCAAGG CAAGTTGGTA
CCCCAAGACC CCAATGATCC ACCTGCCAGC GAGTTATTAA AGGAAATCGA AGCCGAAAAG
CGGCGTTTGG TAGAGGAGAA GAAAATTAAA AAGCCGAAGC TATTGCCACC TATTAAGCTA
GAGGAGGTGC CCTACGAGCT GCCAGAGGGG TGGGAGTGGG TCAGGCTAGG TGAAATCGGA
GTCATTAATC CTCGTAACAA TGCTGAAGAT TCAATAAAGG CCGGGTTTGT CCCTATGCCG
ATGATCCCTG AAGGTTATTC AGAAGAGCAC CAATTTGAGG AGCGCACTTG GAGTGATGTT
AAAAAAGGCT ACACCCATCT TGCCGATAGT GATGTGGGCA TGGCGAAAAT CACTCCTTGT
TTTGAGAATG CAAAATCATG CGTGTTTTCA GGGTTACCAA ATGGGCTAGG AGCAGGTACA
ACGGAGCTTC ACATTTTTAG AAACACTTTC AATGCGGTTC TGCCAAGATT CCTGCTGTAT
TACTTGAAAA ACCCACATTA TATTTCAAAA ACAGTGCCAT ACATGACAGG CTCAGCTGGC
CAAAAGCGAG TGCCAACACC GTATTTTACT GAGCAATTAT TTCCTCTTCC TTCCTTGTCC
GAACAACAGC GTATCGTCGC CCGCATCGAC CAGCTAATGG CTCGTTGTGA TGAGCTGGAA
AAACTGCGCA AAGAGCGGGA AGAGGTGCGC CTGAAGGTTC ACGCTGCGGC CATCAAGCAA
TTGTTGGATG CACCCGATGC CGGTTGGCCC TTTATCCAAC AGCATTTTAG TGAACTCTAC
ACAGTTAAAG AAAACGTCGC CGAACTGCGC AAAGCCATTC TACAACTCGC CGTTATAGGC
CGCCTTGTAC CCCAAGACTC CAACGACCCG CCTGCCTGCG AGCTGTTAAA GGAAATTGAA
GCTGAAAAAC AGCGGTTGGT GGATGAGAAG AAAATCAAAA AGCTAAAGCC GTTACCGCCT
ATTAAACCGG AGGAAGTGCC TTATCAATTG CCGCGAGGTT GGGAGTGGGT GAGGTTACAG
GATGTATTAG ATGTTCGAGA CGGAACACAT GATTCTCCAA AAGATGCTGT TGGGTCTGAT
ACCTACCCAC TCATCACCAG CAAGAACTTC TCCAATGGTC GAATTGATTT TTCTGAAGCA
AGGATGATTT CTTCAGAAGA TCATTTTGAA ATTACAAAAA GATCAAAGGT AGACCGCCTC
GACATACTTT TCTCGATGAT AGGTGGCAAT ATTGGTAACC AAGTTATCGT TCAAGAAGAT
CGTGAATTTA GCATAAAAAA TGTTGCTCTT TTCAAGTATT ACGATAGAAA CCTAACGTAC
CCATATTTTA TAAAGAGATT TATGGAGCAT ATTGCAGCTG ATTTGCAGCA AAAAGCCGTG
GGTGGAGCAC AACCTTTTGT ATCTCTCGGA TTTTTAAGGA ATATCGTTTT CGGCCTCCCT
CCGATAAACG AACAATACCA CATCGTTGCC CGTATCGATG AGTTGATGGC ATTGTGTGAC
AAGCTTGATC AGCAGATTGA AGCAGCTTCC TGCAAGCAGA GTGCCCTGCT TAACTCCGTG
ATGGCGCAGG TATAG
 
Protein sequence
MTQAVHASLE QHFDTAFSAP DGIAKLRELI LTLAMQGKLV PQDPNDPPAS ELLKEIEAEK 
RRLVEEKKIK KPKLLPPIKL EEVPYELPEG WEWVRLGEIG VINPRNNAED SIKAGFVPMP
MIPEGYSEEH QFEERTWSDV KKGYTHLADS DVGMAKITPC FENAKSCVFS GLPNGLGAGT
TELHIFRNTF NAVLPRFLLY YLKNPHYISK TVPYMTGSAG QKRVPTPYFT EQLFPLPSLS
EQQRIVARID QLMARCDELE KLRKEREEVR LKVHAAAIKQ LLDAPDAGWP FIQQHFSELY
TVKENVAELR KAILQLAVIG RLVPQDSNDP PACELLKEIE AEKQRLVDEK KIKKLKPLPP
IKPEEVPYQL PRGWEWVRLQ DVLDVRDGTH DSPKDAVGSD TYPLITSKNF SNGRIDFSEA
RMISSEDHFE ITKRSKVDRL DILFSMIGGN IGNQVIVQED REFSIKNVAL FKYYDRNLTY
PYFIKRFMEH IAADLQQKAV GGAQPFVSLG FLRNIVFGLP PINEQYHIVA RIDELMALCD
KLDQQIEAAS CKQSALLNSV MAQV