Gene Noc_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2002 
Symbol 
ID3705192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2308232 
End bp2310238 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content53% 
IMG OID637738479 
ProductUvrD/REP helicase 
Protein accessionYP_343994 
Protein GI77165469 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID[TIGR01074] ATP-dependent DNA helicase Rep 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0486961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCAAATC TTAACCCCCA ACAGCGTTTG GCAGTCCGCC ACATCGATGG CCCCCTGTTG 
GTGTTGGCGG GAGCTGGCAG CGGTAAAACC CGAGTGATTA CCCACAAAAT TGTCTATCTC
ATTGAGCAAT GTCATTTGTC GGCGCGATCC ATTGTGGCGG TGACGTTCAC CAATAAAGCC
GCCCGGGAAA TGAAGTCTCG GATAGGACAA TTACTAACTA AGGGAGAAAG CCGGGGATTA
GTTGTTTCTA CCTTTCACGC TTTGGGACTC AATATCTTGC GCCGCGAACA CGAAATTCTC
AGACTGAAAG CGGGTTTTTC TTTGCTGGAT GCCCAGGATA GCCGCGCACT TATCTGCGAT
CTCCATCAGC AAGAGTTTAG TAGCGGCGGA GAGGAAAGCA GCTTCCAGTG GCAAATTTCC
ACCTGGAAAA ACGCATTAGT GACGCCTGAG GAAGCCTTAT GCAGGGCCAG CAATGATCAG
GAAGCCATAG CAGCCCAGCT TTATGCCGCT TATGATCGGC GCCTGCGGGC TTATAATGCT
GTCGATTTCG ATGATTTGAT TGGTTTACCT GTTCATCTTT TAACGACGCG CCCGGAAATT
CTCAGTCGTT GGCAAAATTA TTTCCGTTAT CTGCTAGTAG ACGAGTATCA AGATACCAAC
GCAGCCCAGT ACCAGTTGGT TAAGTACTTA GCTGGAGTGC GGGGTGCCGT TACCGTGGTG
GGAGATGATG ACCAATCGGT ATACGCTTGG CGGGGCGCCC AGCCGGAAAA CTTGCATCAG
CTCAAAGAAG ATTTTCCCCA GCTTACGGTT ATTAAGCTAG AGCAGAACTA CCGCTCCACC
ACTCGCATTT TGCGGGTAGC CAATCAACTG ATTAGCTCTA ATCCCCATGT CTTTGAAAAA
CGACTTTGGA GTGCCTTAGG CGAAGGTGAT TCCATTCGGG TGTTGACCTG CCGGGATGAG
CACCATGAGG CAGATAGAGT CGTTGCCGAA CTGATGTACC ATCGTTTTAA GTACCGCACG
GCATGCCGTG ATTACGCCAT CCTATACCGG GGCAACTATC AATCTAGGCC CTTTGAGCGG
GCTTTGCGGG CCCACGGCAT CCCTTATGTT CTGAGTGGAG GGACTTCCTT CTTCGAGCGA
GGCGAAGTCA AGGATATCAT GGCTTACCTG CGCCTGTTGG CTAATGAGGA TGACGATAAT
GCTTTTTTAC GGGTGGCTAA TACGCCGCGC CGGGGAATCG GAGCGGTCAC CCTGGAAAAA
CTGGCGGGAT ACGCGGCTTT GCGGGGGCAG AGCTTGTTAG TCTCAGGCTT TGAATTAGGC
TTAGGAGAGC ATCTTTCCGG TGAAGCCTTG CCTAGGCTGC GCCGATTTTG CGAGTGGGTT
GTGGATTTGG CTGATCGGGG CCGCCGTGGC GATCCCATTG CGGTGATAAA AGACCTCATT
GCTGACATTG ATTATCGTGC CTGGCTTGAT GAAAATTGTA ATGATCGGCG TACCGCAGAG
CGGCGGATGG CTAATGTGGA GGAGCTGGTA GGGTGGCTGG AACGCCTCTA TCAGCGGGGA
GATGAACGCC GGGCCCTCGG CGATTTGGTA GCGGAAATAA GCTTGCAAGA TATCCTGGAG
CGAACCCAGG AGAAAAAGGA CCGGGATGCG GTTAACTTAT TGACGCTCCA CGCCGCCAAG
GGCCTAGAAT TTCCCTACGT TTTTATGGTG GGGATGGAGG AGGAATTGCT ACCTCACCGA
ACCAGCGTGG AGCAGGGCAC TTTAGAGGAG GAACGGCGCT TGGCTTATGT GGGAATCACC
CGGGCCCAAA GGAGTCTTTG TTTTACTATG GCGGAAAAGC GTCAGCAGTA TGGGGAAACC
ATCTTGTGCG AACCCAGTCG GTTTTTGTCA GAACTACCGG CTGCGGATCT TCAATGGGAG
CGGGAGGGAA TTCCCCGTGA CCCCGCAGAA CGGATGGAGA GGGGTCAAGT GCATCTGGCT
AATTTGCGGG AGATGCTCCG TCAGTAG
 
Protein sequence
MPNLNPQQRL AVRHIDGPLL VLAGAGSGKT RVITHKIVYL IEQCHLSARS IVAVTFTNKA 
AREMKSRIGQ LLTKGESRGL VVSTFHALGL NILRREHEIL RLKAGFSLLD AQDSRALICD
LHQQEFSSGG EESSFQWQIS TWKNALVTPE EALCRASNDQ EAIAAQLYAA YDRRLRAYNA
VDFDDLIGLP VHLLTTRPEI LSRWQNYFRY LLVDEYQDTN AAQYQLVKYL AGVRGAVTVV
GDDDQSVYAW RGAQPENLHQ LKEDFPQLTV IKLEQNYRST TRILRVANQL ISSNPHVFEK
RLWSALGEGD SIRVLTCRDE HHEADRVVAE LMYHRFKYRT ACRDYAILYR GNYQSRPFER
ALRAHGIPYV LSGGTSFFER GEVKDIMAYL RLLANEDDDN AFLRVANTPR RGIGAVTLEK
LAGYAALRGQ SLLVSGFELG LGEHLSGEAL PRLRRFCEWV VDLADRGRRG DPIAVIKDLI
ADIDYRAWLD ENCNDRRTAE RRMANVEELV GWLERLYQRG DERRALGDLV AEISLQDILE
RTQEKKDRDA VNLLTLHAAK GLEFPYVFMV GMEEELLPHR TSVEQGTLEE ERRLAYVGIT
RAQRSLCFTM AEKRQQYGET ILCEPSRFLS ELPAADLQWE REGIPRDPAE RMERGQVHLA
NLREMLRQ