Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2751 |
Symbol | |
ID | 3705289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3122731 |
End bp | 3123618 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637739229 |
Product | hypothetical protein |
Protein accession | YP_344730 |
Protein GI | 77166205 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00117385 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATAC TGCCTTCGCA CCGCCAGGGG CTTTATTATC TGGAGCATTG CCGAGTGATG GCCAAAGACG AGCGGGTGGT GTATGCCTGC CAGGAAGGCG CGTTCACCAA ATTTTTTGCT ATCCCGCCGG CGAATACCAA TGTCATTCTG CTGGGTAGCG GCACTTCCCT GACCCAAGCG GCCGCCCGCC TGCTGGCAAG CGAGCAGGTG ATGGTGGCGT TTGTGGGCGG CGGGGGAAGT CCCTTATTTC TGGCTTCTCA AAACGAATAC CGGCCGACTG AATACTGTCA AGCGTGGATG CGTTTATGGC AGGACAATGA CCAGCGCCTT AAGGTAGCTA AGACATTTCA AAGAAACCGG GCCGAATTTT TAATGCAGCA ATGGCCCAAA CTGGCAGAGC CGAAACCCCA TAAAGCGAGT CTGGAAAAGC TGGCCGAGCG TTATCTGGCG GACATTGAGC TGGCCGGGGA CAACGGAACG ATCCTGGCCC AGGAGGCCAA GTTCGCAAAA AAACTTTATA AATTTTGGGC GAACTGTACC GAGACTGAAA ACTTCACCCG CGATCCTGGC AAGCGGGATT TTAACGACCC CTTTAACAGT TATCTTGATC ATGGCAACTA TCTGGTCTAT GGGATTGCGG CAGCGGTTTT ATGGGTTTTG GGAATTCCCC ATTCCTTGCC GGTGATTCAC GGCACTACCC GGCGCGGGGC TTTGGTATTT GATGTGGCCG ACATCATTAA GGATACATGC GTGATGCCCA TTGCGTTTCA GCACGCTGCG GCAGGCCGCA GTGATCAAGA GATGCGCCAG GCGTGCATTG CCTGGCTTGA CGAAAGCCAC GCTATGACCT TTCTCTTCCA GTCCATCAAG CGCGTGGCCC AGCTGTGA
|
Protein sequence | MPILPSHRQG LYYLEHCRVM AKDERVVYAC QEGAFTKFFA IPPANTNVIL LGSGTSLTQA AARLLASEQV MVAFVGGGGS PLFLASQNEY RPTEYCQAWM RLWQDNDQRL KVAKTFQRNR AEFLMQQWPK LAEPKPHKAS LEKLAERYLA DIELAGDNGT ILAQEAKFAK KLYKFWANCT ETENFTRDPG KRDFNDPFNS YLDHGNYLVY GIAAAVLWVL GIPHSLPVIH GTTRRGALVF DVADIIKDTC VMPIAFQHAA AGRSDQEMRQ ACIAWLDESH AMTFLFQSIK RVAQL
|
| |