Gene Noc_2688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2688 
Symbol 
ID3704445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3044928 
End bp3046391 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content53% 
IMG OID637739170 
Productrestriction endonuclease S subunits-like 
Protein accessionYP_344671 
Protein GI77166146 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.165356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGTT TGGTTGATAC GAATTTGGCT ACGCAAGCGG CGACTGGATC GCCCGATCAG 
GGCGGGGCTA GTGCGGGTGG CATTCCGAAG TATCGGGAGT ACAAGAATTC AGACGTAGTT
TGGATTGGTG AAGTACCAAG CTTCTGGGAG GTCAAACCGT TCAAATGGCT GCTCACCCAT
AACGAAGGAG GCGTGTGGGG CGATGACCCA GCAGGCGAAG GTGACACGAT TGTCCTGCGC
TCCACCGATC AAACCGTTGA TGGCAACTGG AATGTCACCG ATCCTGCCGT CCGCCACCTC
ACCGTCAAAG AAAATGCCTC TGCGGTTCTT GAGGCGGGTG ACTTGGTTGT AACAAAATCC
AGCGGCAGCG CTTTGCACAT CGGCAAAACA ACGTTGGTAA ACGTTGACAT GGCAAAACTA
GGTTATTGCT ATGGAAATTT CATGCAAAGG CTAAGGCTTG GCCAAAAGTA TATTCCCAAG
CTAGCTTGGT ATGTCATGAA TAATGACTTG GTTAGGTTGC AATTGAACTT GCTATCAAAC
TCAACAACTG GGCTTGCAAA TCTGAACGCT ACGTTGATTG GCGAGATTTT GCTGCCGGTT
CCCCCTGTTG AAGAACAAAC CCAAATCGCC CGCTTCCTCG ACCACGAAAC CGCCCGCATC
GACGCACTGA TTGAAGAGCA GCAGCGTCTG ATTGAACTGC TCAAGGAAAA GCGCCAGGCC
ATCATCTCCC ACGCTGTCAC CAAGGGCCTG GACCCCACCG TGCCGATGAA AGACTCCGGC
GTGGAGTGGC TGGGCGAAGT GCCGGCGCAT TGGATTACCA AGCCGCTGAA GCATCTGGCT
GAGCTGAACC CGAAGAAATC AGGCTACCAC GGCGATCGGG ATGAGCTTTG CAGTTTCGTT
CCAATGGAGA AGTTGAAGAC TGGTGTTATT CAACTGGATG AGGAGCGATT CATTGCCGAT
GTAATTTCTG GCTACACCTA CTTTGAAGAT GGCGATGTGC TGCAGGCGAA AGTCACACCA
TGTTTTGAGA ATCGAAACAT CGCTATAGCT GATGGTTTAA CAAATGGTGT GGGTTTTGGG
TCGAGTGAAA TCAACGTATT AAGGCCGTTC CCAGACGTTA ACGCATCATT TCTCTACTAC
CGGCTGCAAG AAGATGGCTA CATGGGAATT TGCACTGCGT CGATGATTGG CGCGGGCGGT
CTAAAACGAG TGCCAGGTGA AGTCATAAAT GGTTTCACGG TAGCCGTTCC CGAACGCCAC
GAGCAAACCC AAATTGCCCA TTTCCTCGAC CACGAAACCG CCCGCGTGGA CAAATTGGTC
GAAGAGGCAA ACGTTGGCAT TGAACTCCTG AAAGAACGCC GCTCCGCCCT GATCTCCGCC
GCCGTCACCG GAAAAATCGA CGTGCGCGGT TGGCAGCCGC CGGCCAGCGC GCCATCTCCC
GAATTGGAAA ACGAGGCCGT GTAA
 
Protein sequence
MTGLVDTNLA TQAATGSPDQ GGASAGGIPK YREYKNSDVV WIGEVPSFWE VKPFKWLLTH 
NEGGVWGDDP AGEGDTIVLR STDQTVDGNW NVTDPAVRHL TVKENASAVL EAGDLVVTKS
SGSALHIGKT TLVNVDMAKL GYCYGNFMQR LRLGQKYIPK LAWYVMNNDL VRLQLNLLSN
STTGLANLNA TLIGEILLPV PPVEEQTQIA RFLDHETARI DALIEEQQRL IELLKEKRQA
IISHAVTKGL DPTVPMKDSG VEWLGEVPAH WITKPLKHLA ELNPKKSGYH GDRDELCSFV
PMEKLKTGVI QLDEERFIAD VISGYTYFED GDVLQAKVTP CFENRNIAIA DGLTNGVGFG
SSEINVLRPF PDVNASFLYY RLQEDGYMGI CTASMIGAGG LKRVPGEVIN GFTVAVPERH
EQTQIAHFLD HETARVDKLV EEANVGIELL KERRSALISA AVTGKIDVRG WQPPASAPSP
ELENEAV