Gene Noc_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1189 
Symbol 
ID3706763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1296443 
End bp1298167 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content54% 
IMG OID637737692 
ProductRecJ exonuclease 
Protein accessionYP_343221 
Protein GI77164696 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGAGA AAGTTCTCAA ACAGCGGCCT GTTAACGAGC TTGAATGGCC AGAGGCAATC 
CATCCGATAT TACGCCGGGT GTATGGCGCG CGAGGGATCA AGGCTCCAGA TGAGCTCGAC
TATACCCTGG AGCGTTTACC TTCTCCATGG TTGCTGAGCA ATATCAAAAT GGCAGTGACC
TTATTAATGG AAGCCCTCGT ACGGGATTGG CGGATTCTGG TGGTCGCTGA TTTTGATGCG
GATGGCGCCA CTAGCTGCGC GGTGGCTGTG CGGGCGCTTC GCTTAATGGG TGCCCATAAA
GTAGATTATT TGGTCCCCAA CCGTTTTATC CACGGCTACG GCCTTACCCC CGCCATTGTG
GCGGAGGCGA TGGCCCGGGG CCAGCCGGAT CTCATTATTA CCGTGGACAA TGGGATATCC
AGTCTGGCCG GCGTACAAGC GGCGCGAGCG GCCAATATTC GTGTGCTTAT CACGGATCAC
CACTTGCCAG GAATATCCTT GCCAGCCGCG AATGCTATTG TCAACCCTAA CCTTCCCAAT
GATCCTTTTC CCAGCACTTG CCTCGCAGGG GTGGGTGTTA TTTTTTATGT CATGCTAGCT
TTACGGGCCC ACCTAAGGGA GCAAGGCTGG TTTATTCGCA GAGATGAGCA GGAACCAGCG
CTTGCTCCCC TGCTGGACTT GGTTGCGCTT GGCACGGTCG CTGATGTGGT GCCGCTGGAT
CAGATTAATC GGATCTTGGT TGCCCAAGGA CTGGCCCGCA TTCGGCAGAG CCGCTGCTGC
GCTGGTATTC AGGCACTCGT GGCTTGCGCT CGACGCCCTC TTGAAACCTT GACCACCAGT
GATCTGGGTT TTGCGGTAGG GCCACGGCTA AATGCCGCAG GGCGCTTGGA GGATATGAGC
CTTGGTATTG CTTGTTTATT AACCGATTCC CTGGAGCTGG CGCAACAACA GGCTAATCAG
CTTGATGGGC TCAATCGTGA GCGGCGAGAG ATTGAGTCGA CTATGCAAGA GCAAGCAGTA
ACTCATCTGG AAAATTTGGT TTTTCAAGGA GAAGAGAGGG CGCCACTAGG CTATTGTTTA
TTTGATGAAT CGTGGCATCA AGGCGTAATC GGCCTGCTCG CTGCTCGCAT TCGGGAGCGG
GTCTATCGTC CAGTAATTGC TTTTGCTCCC CATGATAGTG AGGAGTTGAA GGGATCGGCC
CGTTCCATTC CAGGGCTTCA CATTCGTGAT GCTTTGGATA GAGTAGCAAC TTGCTATCCC
GATTTGCTGA CTAAATTTGG TGGTCATGCG ATGGCTGCCG GTTTAAGTTT GCGGCGGGGC
CATCTAGAGC CCTTCCGCGT TGCTTTTTTG GAGGTACTAG AAACTCTGCT TGATAAGGAA
GCCTTGGAGG ATGTTATTCT CAGCGATGGC AGCTTAGAAC AGTGGGATCT AGAAATGGCG
GAAACCTTGC GGAACGGTGG TCCTTGGGGG CAGGGATTTC CGGAGCCCTT GTTTGATGGG
GTATTTCGGG TAGCAGGGTT TCGTATTGTG GGCGAAGCGC ACCTTAAGCT GACGCTGACG
ACCCTGGATG GTCGACAGCA ACTGGAAGGA ATTGCCTTTC GCTGCCTCCC ACCCGACGGG
TTTGCGCTTG GCATAAAAAT AAGGCTTGCT TATCGGTTGG ATGTCAATAT ATATCGAGGT
TCGCGGACAG CGCAATTGAT GGTGGAGCAC TTAGAGTTGA TTTAA
 
Protein sequence
MAEKVLKQRP VNELEWPEAI HPILRRVYGA RGIKAPDELD YTLERLPSPW LLSNIKMAVT 
LLMEALVRDW RILVVADFDA DGATSCAVAV RALRLMGAHK VDYLVPNRFI HGYGLTPAIV
AEAMARGQPD LIITVDNGIS SLAGVQAARA ANIRVLITDH HLPGISLPAA NAIVNPNLPN
DPFPSTCLAG VGVIFYVMLA LRAHLREQGW FIRRDEQEPA LAPLLDLVAL GTVADVVPLD
QINRILVAQG LARIRQSRCC AGIQALVACA RRPLETLTTS DLGFAVGPRL NAAGRLEDMS
LGIACLLTDS LELAQQQANQ LDGLNRERRE IESTMQEQAV THLENLVFQG EERAPLGYCL
FDESWHQGVI GLLAARIRER VYRPVIAFAP HDSEELKGSA RSIPGLHIRD ALDRVATCYP
DLLTKFGGHA MAAGLSLRRG HLEPFRVAFL EVLETLLDKE ALEDVILSDG SLEQWDLEMA
ETLRNGGPWG QGFPEPLFDG VFRVAGFRIV GEAHLKLTLT TLDGRQQLEG IAFRCLPPDG
FALGIKIRLA YRLDVNIYRG SRTAQLMVEH LELI