Gene Noc_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0654 
Symbol 
ID3706886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp703535 
End bp706384 
Gene Length2850 bp 
Protein Length949 aa 
Translation table11 
GC content54% 
IMG OID637737162 
Producthypothetical protein 
Protein accessionYP_342703 
Protein GI77164178 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTGA AACCCTGGCG AGAAATTGCC ACTCCGCACA AGGATGTTCT GGAAGGCACC 
TTCAAACAGT CCGAGTTTGC GGCTGACATC ACCCAGGTAG CCAATGGCAC AGCCACTGCA
GAGTATCAGG ATGCGGAGAT GTTCTTCTCC CGCACTTATA TCACCGAGGG CATGCGCCTC
TTACTCATCT CTGTCGCGCA ACGGTTGGCT GGGCTCGGTG GCGACCCGGT CATTCAATTG
CAGACCGCTT TCGGTGGCGG CAAGACCCAC ACGCTGTTGG CCGTGTTTCA CCTGGCTTCG
CGCAAGGTGG GCACCGACAA ACTCACGGGT ATTCCTCCGG TATTAGATGA GGCAGGGATT
CAGAGCCTTC CCTCGGCCAG AGTGGCGGTG ATTGACGGCA TCAAATTGTC GCCTAGCCAG
CCCAGGAAGT ACGGGAGCAT AACTGCCAAC ACGTTGTGGG GCGAGCTGGC CTGGCAACTT
CTGGGGGATG AGGGTTATCA GATGGTCGCC GACAGCGACG CTGACGGCAC CTCTCCCGGA
AAAGAGGTTC TCACTGAACT GATCAGCAAG GCAGCCCCTT GTGTGATTCT GGTGGATGAG
CTGGTGGCGT TTATTCGACA ACTGGAGCTT GGCAAGCAAT ATAAGGCTGG CACATTCGAT
AGCAATGTCA GTTTTATTCA GGCGCTCACC GAAGCGCTGA AGGCAGTACC CAATGCCATT
CTATTGGCAT CGCTGCCCGA ATCAGAACTG GAAGTTGGAG GCACGCAAGG GCAGCGCGCC
CTTAACTCGC TGGAGAAGTA CTTCGCACGA GTTGAGTCCG TCTGGAAGCC GGTAGGCACC
GAGGAAGCGT TCGAGATTGT GCGTCGCAGA TTATTTGAGA ACCCGGGTGA ACGCGCCGAG
GTGGAGGGTA TCAGCCGTCA GTTCTCTGAC TTTTATCGTC AGAACGCCGA AAAGTTTCCG
GTCGAAACCC AATCCAACGA ATATTTCGAG CGTCTTTGCC GGTCTTACCC GATCCACCCG
GAAATTTTCG ACCGCTTGTA CGAGGACTGG TCCACGCTTG AAAAATTTCA GCGCACCCGT
GGCGTTCTTC AATATATGGC CATTGTTATC CACCGCCTGT GGAACTCGGA TAACAAAGAT
GCGCTGATCA TGCCGGGTTC ATTACCGTTA GAAGATGGTA ACGTGCGCAA CAAGAGCATT
CACTACCTGC CTCAGGGGTG GGAACCCGTG ATTGAACGAG AAGTGGACGG TACCCGCTCG
GCCCCCTATG ACATCGATGG CCACCACACC CTGTTCGGCA GCGTGCAAGC CGCGCGCCGC
ACCGCCCGGA CCATTTTTCT CGGCAGCGCA CCATCAACCA CTGAGCAGAT GATTCGCGGT
GTTCAGGTCG AACGCATCCT GCTGGGCGCG GCACAACCCG GTCAAACGCT CGGTGTATTT
GAAGACGTGC TCAAGCGCTT ACGTGATCGA CTGCATTATC TCTATTCCGA CAAAGACCGA
TTCTGGTTAG ATACCAAACC CAACCTGCGC CGAGAAATGG AGAGCCGCAA GCAGAACATC
AACGAACGAG ATGAACTCCT GCCATTGTTA AAAACCCGAG TAACCCAGGT ATTCGGCAGA
AATCACCAGT TTGGTGGCGT GCACGTCTTT ACACCTTCCG TCGATGTCCC GGACGACTAT
GGTACGGGCC CTCGACTGGT CGTGTTGCCG ACAAACACAG CCTACAGTCG CAGTGAGACC
AACCAGGCAT TTTCTGCCGC AGAAGAGATT TTGCGCAACC GCGGCGACCA ACCACGGCAA
AAACAAAATC GCCTGATCTT CCTGGCTCCC GACTACGATG TGGTTGGCCG ACTCAAAGAG
CAGGCCCGCA TTTTCCTGGC GTGGCAATCC ATTGCCACAG ATATCGAAAA TGGCCACTTG
AATCAGGATT TGTCTCATTT GAACCAGGCC AAGCGTAATC GTGATGGCGC AGATCAGTCG
CTTGCCCAAC TGGTGCGCGA AACTTACAAA TGGTTAATCG CCCCAGTTGA AGAATTCGTA
AAGGGTAAGC CAACTCTCAA TTGGGAGGTG GTTCCCGTCT CACCCGCAGC GCCCAACTTG
ATTCAGGCGA TCGAGGATAA ATTGCGCGAA GAGGAGTGGA TGATTTACGA GTGGTCTCCT
ATTCATCTTC GCAATGTACT TAAACAGTGG TACCTGAAAG AAGGCGTAAA CGACGTCAGT
GCGTTGAAGG TCTGGCAGGA CTGCTGTCAC TACCTGTACC TGCCTCGGCT TGTTAACGAC
AGTGTTTTCC GCAATGCCAT CACCCAGGGG ATCGAGGTTG AAGATTACTT CGCCTTCGCC
TCTGGCAAGG AAGGCGACCG TTATCTCGGC TTCACATTCG GGCGTAATTC TATTGCCACG
GTGGATGAAT CCTCTCTCCT GATAGATCGC GAGGCAGCAG TTGCTTACCG CGAAAATACA
CAGCAGCCCA CTCCACCAAC GGCGGAGCCC GGGACTGCAG GCGGCGAACC TGGAGGCACT
ACAATACCGG TTGGTGGCGC GAGCGGCACG GGGACACCAA CCCCAACATC CGGCGGGTTA
GGAGGCGCTG CCACAACAAC ACCAGCAGCA ACCAAGAAGC AGTTCTACGG CACTATTTCA
CTCGACCCGG TCAAAGCCAA AATGGACTTT GCCACCATAA TGGATGAAGT CGTACAGCAG
TTCACTGCCA AGCTCGGCGT GAATGTCAGG ATATCCGTGG AGATTGAGGC CAATAGCCAG
GATGGATTCA ATGAGTCCAT GCAGCGAACA GTCAAAGAGA ATTGTAATGT CTTGAAATTT
AGCTCAGCAG AATTTGAGGA AGAGTCGTGA
 
Protein sequence
MSLKPWREIA TPHKDVLEGT FKQSEFAADI TQVANGTATA EYQDAEMFFS RTYITEGMRL 
LLISVAQRLA GLGGDPVIQL QTAFGGGKTH TLLAVFHLAS RKVGTDKLTG IPPVLDEAGI
QSLPSARVAV IDGIKLSPSQ PRKYGSITAN TLWGELAWQL LGDEGYQMVA DSDADGTSPG
KEVLTELISK AAPCVILVDE LVAFIRQLEL GKQYKAGTFD SNVSFIQALT EALKAVPNAI
LLASLPESEL EVGGTQGQRA LNSLEKYFAR VESVWKPVGT EEAFEIVRRR LFENPGERAE
VEGISRQFSD FYRQNAEKFP VETQSNEYFE RLCRSYPIHP EIFDRLYEDW STLEKFQRTR
GVLQYMAIVI HRLWNSDNKD ALIMPGSLPL EDGNVRNKSI HYLPQGWEPV IEREVDGTRS
APYDIDGHHT LFGSVQAARR TARTIFLGSA PSTTEQMIRG VQVERILLGA AQPGQTLGVF
EDVLKRLRDR LHYLYSDKDR FWLDTKPNLR REMESRKQNI NERDELLPLL KTRVTQVFGR
NHQFGGVHVF TPSVDVPDDY GTGPRLVVLP TNTAYSRSET NQAFSAAEEI LRNRGDQPRQ
KQNRLIFLAP DYDVVGRLKE QARIFLAWQS IATDIENGHL NQDLSHLNQA KRNRDGADQS
LAQLVRETYK WLIAPVEEFV KGKPTLNWEV VPVSPAAPNL IQAIEDKLRE EEWMIYEWSP
IHLRNVLKQW YLKEGVNDVS ALKVWQDCCH YLYLPRLVND SVFRNAITQG IEVEDYFAFA
SGKEGDRYLG FTFGRNSIAT VDESSLLIDR EAAVAYRENT QQPTPPTAEP GTAGGEPGGT
TIPVGGASGT GTPTPTSGGL GGAATTTPAA TKKQFYGTIS LDPVKAKMDF ATIMDEVVQQ
FTAKLGVNVR ISVEIEANSQ DGFNESMQRT VKENCNVLKF SSAEFEEES