Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0446 |
Symbol | |
ID | 3706617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 480607 |
End bp | 481911 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637736956 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_342500 |
Protein GI | 77163975 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000807768 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAGG CAATGAAGAA TGACAAACAA ATGCGGGTGC CCAAGCTGCG CTTTCCTGAG TTTCGGGATG CGGGAGAGTG GGAGAAAGTT GCGCTTTCAA CGCAAGTTGA ACTCCTCTCG GGGCTTCATC TTTCACCGGA CGGATATACA GACACGGGAG ACATCCCGTA TTTCACAGGG CCATCAGATT ACACAAATGA CTTAGCTTTA GTGAGTAAAT GGACAACTCG TAGTGCAAAC GTAGGGCGCG CGGGGGATAC ACTAATAACT GTAAAGGGGA GCGGTGTGGG CGAGCTGCTT AACTTAGAGC TCGATGAAGT GGCTATGGGT CGTCAGTTGA TGGCAGTCAG AGCACGTACA GCACACGGAG AGTTTATTTT CCATTTTCTG ATAACGCAGC GCCTGCGGCT GATTGCTCTG GCCTCTGGGA ACCTTATTCC GGGACTCTCA CGGGGCGACA TCTTAAGCCT CAAAGTGCCA GTGCCAAGCC ATGAAGAACA ACAAAAAATC GCCGATTGTC TCTCCTCCCT CGATGCCCTG ATTGCCGCCC AGACAGAAAA ACTCGACGCC CTCAAAACCC ACAAAAAAGG ACTGATGCAG CAACTCTTCC CCCGGGCCGG CGAAACCGTC CCCCGGCTGC GCTTTCCCAA GTTTCGGGAT GGGGGGCGTT GGACAAGTAA AAAGATGAGT GACGTGTACC GATTCCTCTC AACAAATACG TATTCAAGAG ACAAGTTGAA TTACGAAAAA GGGGAAGTAA AAAATATTCA TTACGGAGAC ATCCATACAA AATTTTCTAC GTTGTTCGAT GTAACACAAG AATACGTTCC ATATATTAAT AGGACTGAAT CGCTAGAACG GATAAAAGAT GACAGCTATT GCTTAGAGGG CGATATCGTA TTCGCAGATG CTTCAGAGGA CGTCGAAGAT GTAGGGAAAA GCATTGAAAT CGTAAACACT GGTAACGAAA AAATACTATC TGGACTGCAT ACATTGCTGG CGCGACAAAA AAATAATGAC TTAGTTATTG GTTTTGGTGG TTATCTATTT AAGTCTGGCT TAATTCGAGA ACAGATCAAA AGAGAATCTC AAGGCGCTAA GGTTTTGGGC ATCTCCTCCG GGCGGTTGTC AAAGATTAAA GTTTGTTTTC CATATGAAAA ACGCGAACAA CAAAAAATCG CCCATTGCCT CTCCTCCCTC GATGCCCTGA TTGCCGCCCA GGCGGAAAAA ATCGACGCCC TCAAAACCCA CAAAAAAGGA CTGATGCAGC AGCTCTTTCC TTCGCTGGAG GAAGTCCATG CATGA
|
Protein sequence | MSKAMKNDKQ MRVPKLRFPE FRDAGEWEKV ALSTQVELLS GLHLSPDGYT DTGDIPYFTG PSDYTNDLAL VSKWTTRSAN VGRAGDTLIT VKGSGVGELL NLELDEVAMG RQLMAVRART AHGEFIFHFL ITQRLRLIAL ASGNLIPGLS RGDILSLKVP VPSHEEQQKI ADCLSSLDAL IAAQTEKLDA LKTHKKGLMQ QLFPRAGETV PRLRFPKFRD GGRWTSKKMS DVYRFLSTNT YSRDKLNYEK GEVKNIHYGD IHTKFSTLFD VTQEYVPYIN RTESLERIKD DSYCLEGDIV FADASEDVED VGKSIEIVNT GNEKILSGLH TLLARQKNND LVIGFGGYLF KSGLIREQIK RESQGAKVLG ISSGRLSKIK VCFPYEKREQ QKIAHCLSSL DALIAAQAEK IDALKTHKKG LMQQLFPSLE EVHA
|
| |