Gene Noc_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0044 
Symbol 
ID3705919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp41378 
End bp43102 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content50% 
IMG OID637736568 
ProductDNA primase 
Protein accessionYP_342116 
Protein GI77163591 
COG category[L] Replication, recombination and repair 
COG ID[COG0358] DNA primase (bacterial type) 
TIGRFAM ID[TIGR01391] DNA primase, catalytic core 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.723286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATGG GAGAGCGGAT CCCGCAAGAA TTTATTGATG AACTGGTGGC TCGTACGGAT 
ATCGTGGAAT TAATTGATTC CCGTGTTCCT TTGCGTAAAG CAAGCCACAA TTATGTGGCT
TGCTGCCCTT TTCACAATGA AAAGACCCCG TCTTTTACGG TGAGTCCCCA GAAACAGTTC
TATTATTGCT TTGGTTGCAG TGTTCATGGG ACGGCCATTG GTTTTTTGAT GGCGTTCGAC
CGCCTTAGTT TTATCGAGGC GGTAGAAGAA CTGGCGCAAC GAGCGGGAAT GATGGTTCCC
CAAAGCTCGA AGCAACAAGA CTACTACAAT CGGCACCAAG GCTTATATGA AGTACTGGCC
TGTGCCGCCG AGTTTTATCA GCAGCAGCTG GAGGCTAGTG CCTATCAAGG GCAGGTTAAG
GCTTATTTGC GGGAACGAGG TCTAAGTGGC CCTATTATTG CCGAATTTGG ACTAGGTTTT
GCTCCGCCCC GCTGGAATGC TTTATTACAT TATACCCGGC CCTCCCTTAA ATCTTACCTT
CAGGCAGCGG GTTTAACGAT CAGCAAAGGC GAGGATCGTT ACTATGATCG GTTTCGAGAT
CGTTTAATAT TCCCCATTCA TGATTATCGG GGCCGGGTAA TTGGTTTCGG CGGCCGTCTT
TTGGGTGATG GCTCTCCAAA ATATCTTAAT TCACCTGAAA CAGCGTTGTT TCATAAAGGA
CGAGAGCTTT ATGGCTTGTA TCAGGTACGA AAGTCCCTCC ACCGCTGCGA TAGGTTGTTG
GTGGTGGAAG GATACATGGA TGTTTTGGCC CTAGCAGAGC ATAAAATTCG CTATGCAGTC
GCGACTTTAG GGACAGCAAC GACTTCGGAT CATTTGACTC GCTTGTTCCG GATAACGCCA
GCGGTCATAT TTTGCTTTGA TGGCGATCGG GCAGGTTACC AAGCAGCCTG GCGGGCGCTA
GAAACGGCAC TGCCTTTGCT TAGTCAAGGG CGACAGGTAC AGTTTATGTT TTTACCCCAA
GGGGAAGATC CCGATACTAT GGTACGTGCT GAGGGGCAGA CGGCTTTTGA AGCGCGCTTG
GCAGAGGCCG TGCCGCTATC TGATTTTTTG TTAAATAATT TGCGGCAGAA GGTTAATCTT
AGCAGTGTGG ATGGATGCGC GCGTTTGGTG GAACTTGCCC GACCCCTCCT AGCTCGTATC
CCACCGGGAG TTTATCAAGA TATGCTGCTT GCGCGTCTAG CGGAACTCGC CCAGATAGAG
CAGACAACGC TTATTCGCCA CTTGAGTCCA GGAAAGAAAC CTACGGCGGT GCCTCTTCGG
CGATTGGAGC AAGGTGCCGC GTCTCCAACA CGACGGGCAG TAGCCATTTT GCTTCAACGG
CCTAAGATGA TTCAATGGGT AGATAAGAAT TTATCTCTTA GGGGACTAGA AGGTGCAGGG
GCAGAATTAC TTCAGAAACT AGTTGATTTA TTGCAAAACA ATCCACATCT AAATACGGCT
GCCCTCCTTG AACGCTGGCG GGATTCAGAA ATGGGCCGGT ATCTGGAGCA ATTAGCCGGC
TGGGAGTTGC TTTTGACTGA CGAGGATATG GTATTGGAGT TACAAGCTGC TTTAGAGCGT
TTGCAGGTAC AGGGGGCAGA GCAGCGGATA ATAACTCTTT CCAATCAGCC TTCCTTGACG
GGAGCAGAGC AACGAGAGTT GCTTGCTCTG CTGGCGGAAA AATAA
 
Protein sequence
MPMGERIPQE FIDELVARTD IVELIDSRVP LRKASHNYVA CCPFHNEKTP SFTVSPQKQF 
YYCFGCSVHG TAIGFLMAFD RLSFIEAVEE LAQRAGMMVP QSSKQQDYYN RHQGLYEVLA
CAAEFYQQQL EASAYQGQVK AYLRERGLSG PIIAEFGLGF APPRWNALLH YTRPSLKSYL
QAAGLTISKG EDRYYDRFRD RLIFPIHDYR GRVIGFGGRL LGDGSPKYLN SPETALFHKG
RELYGLYQVR KSLHRCDRLL VVEGYMDVLA LAEHKIRYAV ATLGTATTSD HLTRLFRITP
AVIFCFDGDR AGYQAAWRAL ETALPLLSQG RQVQFMFLPQ GEDPDTMVRA EGQTAFEARL
AEAVPLSDFL LNNLRQKVNL SSVDGCARLV ELARPLLARI PPGVYQDMLL ARLAELAQIE
QTTLIRHLSP GKKPTAVPLR RLEQGAASPT RRAVAILLQR PKMIQWVDKN LSLRGLEGAG
AELLQKLVDL LQNNPHLNTA ALLERWRDSE MGRYLEQLAG WELLLTDEDM VLELQAALER
LQVQGAEQRI ITLSNQPSLT GAEQRELLAL LAEK