Gene Noc_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2751 
Symbol 
ID3705289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3122731 
End bp3123618 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content55% 
IMG OID637739229 
Producthypothetical protein 
Protein accessionYP_344730 
Protein GI77166205 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00117385 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATAC TGCCTTCGCA CCGCCAGGGG CTTTATTATC TGGAGCATTG CCGAGTGATG 
GCCAAAGACG AGCGGGTGGT GTATGCCTGC CAGGAAGGCG CGTTCACCAA ATTTTTTGCT
ATCCCGCCGG CGAATACCAA TGTCATTCTG CTGGGTAGCG GCACTTCCCT GACCCAAGCG
GCCGCCCGCC TGCTGGCAAG CGAGCAGGTG ATGGTGGCGT TTGTGGGCGG CGGGGGAAGT
CCCTTATTTC TGGCTTCTCA AAACGAATAC CGGCCGACTG AATACTGTCA AGCGTGGATG
CGTTTATGGC AGGACAATGA CCAGCGCCTT AAGGTAGCTA AGACATTTCA AAGAAACCGG
GCCGAATTTT TAATGCAGCA ATGGCCCAAA CTGGCAGAGC CGAAACCCCA TAAAGCGAGT
CTGGAAAAGC TGGCCGAGCG TTATCTGGCG GACATTGAGC TGGCCGGGGA CAACGGAACG
ATCCTGGCCC AGGAGGCCAA GTTCGCAAAA AAACTTTATA AATTTTGGGC GAACTGTACC
GAGACTGAAA ACTTCACCCG CGATCCTGGC AAGCGGGATT TTAACGACCC CTTTAACAGT
TATCTTGATC ATGGCAACTA TCTGGTCTAT GGGATTGCGG CAGCGGTTTT ATGGGTTTTG
GGAATTCCCC ATTCCTTGCC GGTGATTCAC GGCACTACCC GGCGCGGGGC TTTGGTATTT
GATGTGGCCG ACATCATTAA GGATACATGC GTGATGCCCA TTGCGTTTCA GCACGCTGCG
GCAGGCCGCA GTGATCAAGA GATGCGCCAG GCGTGCATTG CCTGGCTTGA CGAAAGCCAC
GCTATGACCT TTCTCTTCCA GTCCATCAAG CGCGTGGCCC AGCTGTGA
 
Protein sequence
MPILPSHRQG LYYLEHCRVM AKDERVVYAC QEGAFTKFFA IPPANTNVIL LGSGTSLTQA 
AARLLASEQV MVAFVGGGGS PLFLASQNEY RPTEYCQAWM RLWQDNDQRL KVAKTFQRNR
AEFLMQQWPK LAEPKPHKAS LEKLAERYLA DIELAGDNGT ILAQEAKFAK KLYKFWANCT
ETENFTRDPG KRDFNDPFNS YLDHGNYLVY GIAAAVLWVL GIPHSLPVIH GTTRRGALVF
DVADIIKDTC VMPIAFQHAA AGRSDQEMRQ ACIAWLDESH AMTFLFQSIK RVAQL