Gene Noc_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1301 
Symbol 
ID3706316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1441033 
End bp1444878 
Gene Length3846 bp 
Protein Length1281 aa 
Translation table11 
GC content52% 
IMG OID637737801 
ProductATP-dependent helicase HrpA 
Protein accessionYP_343330 
Protein GI77164805 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01967] ATP-dependent helicase HrpA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.144172 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGCG ATCAGCATCG ACTAAAACGG CGCCTGCAAC GGCTAACTAA AGGAAATTCA 
AGTTACAATT TAGGCCACCT AACCCAGGCC ATTGAAGACT CTCGCCTATG GCGGGAACAG
CGCCAAAGCC AACTGCCCAG ACCAGCTTTT GAGCAATCCT TGCCAGTCAT TGAACGGCGA
GAGGAAATTG GCGCAGCTAT CCGTAATCAT CAAGTGGTGA TTTTATGTGG CGAGACGGGT
TCTGGAAAAA CCACCCAATT GCCTAAAATA TGCTTGGAAC TGGGCCGCGG CGTTGCGGGT
ATGATTGGCC ATACCCAACC ACGCCGAATT GCGGCCCGCA CTGTAGCCAA CCGAATTGCC
AAGGAACTCA ATAGTGATCT GGGGCAAATC GTGGGCTATA AAGTCCGTTT CCATGATCAG
GTTAGCCCTA GCACCTATAT TAAGCTCATG ACCGACGGTA TTCTCTTGGC TGAAACTCAG
GGGGACCGCT TTCTAGACCA ATACGATACC CTCATTATCG ATGAAGCCCA TGAGCGCAGT
CTCAATATTG ATTTTCTGCT GGGCTATCTC AAGCAACTGC TGCCAAAACG GCCTGATCTC
AAGGTTATTA TTACCTCTGC CACGATTGAT ACCGAACGCT TTTCCCAGCA TTTTGGCCAG
GCCCCCATTA TCGAGGTTTC CGGACGCACT TATCCGGTAG AGATCCGCTA TCGCCCCCTT
TGTGGCGAAC AAGAAACCCA GGAGCGGAAC TTATCTGAGG GTATTCTAGA TGCCGTGGAT
GAACTGTCGC GTCTGGGTCC AGGGGACATT TTGGTTTTTC TCCCTGGGGA GCGGGAAATA
CGCGAGACTG CCGAGGCATT GCGCAAGCAT CATCCCCCCC ATACCGAAAT CCTGCCCCTT
TATGCCCGGC TTTCCTCCAC CGAGCAGAAC CGGGTTTTCA AACCCCATTC GGGAAGACGC
ATCGTGCTAG CAACAAATGT TGCGGAAACG TCTTTAACGG TGCCGGGTAT TCATTACGTT
GTAGATCCGG GCTTGGCCCG CCTGAGCCGC TACAGCGTAC GCAGCAAGGT GCAGCGGTTG
CCTATCGAAA AAATTTCCCA ATCCAGTGCT AATCAGCGGG CAGGCCGTTG TGGCCGCGTT
GCGACCGGCG TTTGCATCCG GCTCTATAGC GAGGAAGATT TTCTCGGCCG GCCTGAATTT
ACCGATCCGG AAGTCCTCCG AACTAATCTG GCATCGGTTA TTCTGCAAAT GAAGTCTTTG
CAATTGGGAG CGGTAGAAGA TTTTCCCTTT CTCGATCCCC CCCTCCCCAA AATGATTAAT
GATGGCCTGC GGCTGCTAGC TGAACTGGGG GCCGTAGACA AGGCCCAGAA TTTAACCCCC
CTAGGTCAAC GACTGGCGCG ATTGCCCATT GACCCTCGTA TTGGCCGAAT GGTATTGGCA
GGTGACGAAT TCCATTGCCT TAGCGAAATG CTTATCATCG CCAGCGCCCT CAGTATTCAA
GACCCACGGG AGCGTCCGCT TGAGGCCCAG CAAGCCGCCG ATGAGGCCCA TTCCAGGTTT
CAAGATGAAC GCTCTGATTT TCTCTCTTAT CTGAAATTGT GGGAGGACCT TCATCGCCAA
CGAGCCCGTC TCTCCCAAAA TAAGCTCCGA GCCTACTGCC GGGAACATTT TCTTTCCTAT
TTGCGCCTCC GAGAATGGCG CGATATCCAT CAGCAGCTCA AACTACTGGC CACCAATATT
GGCTTTCGCC CCAATCAGGT GGCAGCGGAA TACGGAGCTA TTCATCGGGC GCTGCTCACG
GGTTTGCTGG GCAATATTGC TGTTAAATCG GAAAAAGATC ACTATCTGGG TGCGAGAAAT
ATTAAGCTCC AGATCTTCCC TGGATCTGCT CTCTTTAAAA AGAGTCCCAA GTGGATCATG
GCGGCGGAAC TGGTGGAAAC CTCCCGGCTT TATGCTCGCT GCGCCGGTAA AATTGAGCCT
GAATGGCTTG AAGCTCTTGC CCTCCATCTC GTTAAACGCA GTTATTTTGA CCCCCATTGG
GAGAAACGCC CCGCTCAGGT AATAGCCTAT GAACGGATCA CCCTCTACGG TCTTACGGTG
ATTCCTAAGC GCCGGATTCA TTATGGTCCT GTTAACCCCG AAGAGGCGCG GGAAATCTTT
ATTCGTGAAG CCCTGGTTAA TGGCGACTAT GATACCCAGG CCCCCTTCTT CCGCCATAAC
CAAAAACTCA TCGCAGAAAT AGAGGAACTA GAGCACAAGA GCCGGCGGCG GGATGTGCTT
ATTGATGAGC AGAGCCTCTA TCAATTCTAT GAGGAGCGGC TTCCAGCGGG AGTCTACAAT
GGCGCTGGCT TTAAGAAATG GCGCCAGCAG GCGGAGAAAA AAAATCCCCA ACTATTATTC
CTCAGCCGGG AAGAGTTGAT GCGCCATGAT GCAAAAGAGA TAACAGGAGT GCGTTTTCCC
GATCAAATGA CCGTAAAAGG TCTCCCCCTT GCCTTGTCCT ATCACTTTGA ACCGGGCCAT
CCCGCGGATG GGGTAACGCT AACCGTCCCC CTCGCCGTGC TCAATCAATT GGAAGCAAGC
CATTTTCAAT GGCTAGTCCC AGGACTGCTT AAAGAAAAAA TCATTTGCCT AATTAAAGCC
TTACCTAAGG GCCTGCGCCG TAATTTCGTG CCCGTGCCTG ATTTTGCAGA GGCTTGTATT
CGCGCCTTAT CTCCGGCGCA AGGTCCGTTG TTGGACAGGC TAGCTCGCCA CCTCCAAAGC
ATGACGGGAG TCCCCCTCTC TGCCACCTGC TGGCAAGAAG TAGATCTGCC ACTCCATCTG
CAAATGAATT TCCGGCTAGT GGATGAAAAA GATAAAGAGT TGGCTACAGG CAGGGATTTA
GCCATCCTGC AACGACAGTG GGCCAGCAAA GCCCAGCGCA GCTTTCGAGG CTGGGATAAT
AGCGAGCTAA CCCGTGAAGG AATCACCCAA TGGGATTTTG GCGAATTGCC AGAACGGATT
GAATTAGAAC GCCAGGGGCT TAAACTCAAG GGTTATCCCG CGCTGCAGGA TACAGAAACC
GCAGTTTCCC TGGTCATTAT GGATTCAGCC GAGGCGGCAC AGGAGATTAC CCATCTGGGA
TTGCGGCGCT TATTTATGCT AGCATTAACT CAGCAGATTA AATATTTAAG AAAAAATCTG
CCGGGCATTC AAAAAATGTG CTTGCACTAC ACTAGCCTCC CGGCCATGCC GTGGGGAGAC
AGTGCCCCCT CCCAATCCTC TTGCGAAAGC CTCAAAGACG CCTTGATTCA GGGAATCATA
GACCGCACTT TTATCCTCGA CCACCCCCCC GTTCGCAACG GGGAAAAATT CATGGCCCGT
AAGGAAAAAG GCTGCGGCGA ACTGATGAGC ACCGCCAATG AATTTTGCCG TCTCATAGAG
GAAATTCTGA CCGAATATCA CGAGGTCGTC AGGCAGCTAA AGGGCAATCT TCCTTTTGCA
TGGCTAAACT CCATTCGTGA TATGAAGGAA CAACTAACCC ATTTGGTCTA TCATGGCTTT
ATCAACCAGA CGTCACCAAT ATGGCTCATT CATCTTCCCC GTTATCTCAA AGGGATAAAA
CTGCGGCTTG CAAAATTGCA GGAGAACCCC CGCCGAGATC AGCAACGGCA GGCAGAAATC
ACCCCTTTAT GGCAAGCTTA TCAAAAAAGA ATGGAAATAC AGCACCAGGA AGACGGCGTA
GTACCCGCCC TGGAAACTTA TCGCTGGATG TTGGAAGAAT ACCGGATCTC CCTTTTTGCC
CAGGAACTGG GGACCAAGCG CCCAGTCTCC CCTAAGCGGT TAGCCGCTCA ATGGAAAGAG
ATTTAA
 
Protein sequence
MQRDQHRLKR RLQRLTKGNS SYNLGHLTQA IEDSRLWREQ RQSQLPRPAF EQSLPVIERR 
EEIGAAIRNH QVVILCGETG SGKTTQLPKI CLELGRGVAG MIGHTQPRRI AARTVANRIA
KELNSDLGQI VGYKVRFHDQ VSPSTYIKLM TDGILLAETQ GDRFLDQYDT LIIDEAHERS
LNIDFLLGYL KQLLPKRPDL KVIITSATID TERFSQHFGQ APIIEVSGRT YPVEIRYRPL
CGEQETQERN LSEGILDAVD ELSRLGPGDI LVFLPGEREI RETAEALRKH HPPHTEILPL
YARLSSTEQN RVFKPHSGRR IVLATNVAET SLTVPGIHYV VDPGLARLSR YSVRSKVQRL
PIEKISQSSA NQRAGRCGRV ATGVCIRLYS EEDFLGRPEF TDPEVLRTNL ASVILQMKSL
QLGAVEDFPF LDPPLPKMIN DGLRLLAELG AVDKAQNLTP LGQRLARLPI DPRIGRMVLA
GDEFHCLSEM LIIASALSIQ DPRERPLEAQ QAADEAHSRF QDERSDFLSY LKLWEDLHRQ
RARLSQNKLR AYCREHFLSY LRLREWRDIH QQLKLLATNI GFRPNQVAAE YGAIHRALLT
GLLGNIAVKS EKDHYLGARN IKLQIFPGSA LFKKSPKWIM AAELVETSRL YARCAGKIEP
EWLEALALHL VKRSYFDPHW EKRPAQVIAY ERITLYGLTV IPKRRIHYGP VNPEEAREIF
IREALVNGDY DTQAPFFRHN QKLIAEIEEL EHKSRRRDVL IDEQSLYQFY EERLPAGVYN
GAGFKKWRQQ AEKKNPQLLF LSREELMRHD AKEITGVRFP DQMTVKGLPL ALSYHFEPGH
PADGVTLTVP LAVLNQLEAS HFQWLVPGLL KEKIICLIKA LPKGLRRNFV PVPDFAEACI
RALSPAQGPL LDRLARHLQS MTGVPLSATC WQEVDLPLHL QMNFRLVDEK DKELATGRDL
AILQRQWASK AQRSFRGWDN SELTREGITQ WDFGELPERI ELERQGLKLK GYPALQDTET
AVSLVIMDSA EAAQEITHLG LRRLFMLALT QQIKYLRKNL PGIQKMCLHY TSLPAMPWGD
SAPSQSSCES LKDALIQGII DRTFILDHPP VRNGEKFMAR KEKGCGELMS TANEFCRLIE
EILTEYHEVV RQLKGNLPFA WLNSIRDMKE QLTHLVYHGF INQTSPIWLI HLPRYLKGIK
LRLAKLQENP RRDQQRQAEI TPLWQAYQKR MEIQHQEDGV VPALETYRWM LEEYRISLFA
QELGTKRPVS PKRLAAQWKE I