Gene Noc_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2229 
Symbol 
ID3705109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2574115 
End bp2577720 
Gene Length3606 bp 
Protein Length1201 aa 
Translation table11 
GC content49% 
IMG OID637738705 
Producthypothetical protein 
Protein accessionYP_344219 
Protein GI77165694 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACAC CGCAGGAGCG TAAGCTAGAT CGCCGCGCCT TTCCGCGCCA ACCCGTACAG 
ATGGATGCTT ATATCAATAG CGCACGATTA GGGCGGCGCG AGGTGGAAAT TCGAGATTTT
TGCCTGGGCG GATTATTTTT GGTCCCTAGG GATTCCCAGG CGGCTTATAT TCAGTTTCGA
GGGCATACGG GCGAGACGAT CACCGTCCAT CTCTCTTGTC CAACACCTGC CGGTCATAAG
GATTTTTCGC TTAATGTCCG TATGGCGCGA GTGTCTGAAG GAGGAATAGG CGTAGCATTT
TCCAAGCCTG GGCCCGAGGT ATTGCAAGTT CTCCACTATC TTACCGCCCA GTCACGGCGA
GTTCCTCCCA AGCCGACACC GGCGCTGAAA CGAACTGTGC ACCCCGATGA GCGAAAAGGT
AAGCAAGGAG ACAATGCGCT CATAAAAGCT TGCCAGCAGC AAATAATTGA TTTTTCTCCT
ATAATTTTAA GGGATTTTTT TGAGCGGCTG GATGAAGCCT TATTTCTCTG TGCCAGGGAT
GCGGAAAATA ATCTGGAGCA AACCTTCTAT CTTGATGCCA TTGAAGAAAT AAAAGGTCAT
CGCCCGCAGG TTGAGGAACA CTGGCAAAAA AAGATTCTGG CGCGACTAAG CGCCTTGGGC
ACCACAGGTA ATGCTCCTCT TCCTCCGCGA GAGGATAAGA CGGAGTCTTC GAAGCTTTCC
ATGGTAGACA AGGACGAATT TGAAGATTGG TTAGCGATTG CCGAAGCCGT ATCGAAAATG
GAGGCCCGCT ATGCAGATCC ATTATTTGAG CTTGAGAAGC GCTTTAGCCG CCTGGCGGGG
GAGAGCCTTG ATAAGGAAAA TAACCCTGTG GGGCCAAGGG CTATTTGCCG GGCGCTGCAA
GAAGCAATAC GGGATCTAGA GATAAGCCAT TATATCAAGC AACGTATCTA CAAGGTATTC
AAGGAGACGG CTGAAGACAA ATTAAGTTTA TTGTACAATA ACTTAAATGA ACTACTAAGA
GGGGAAGGCA TTTTGCCATC CCTAGAACGG AAATTCGCCG CCCCCCAGAG GCAAAGTACT
CTAAAGAAAG CTCTACCGTT TAGCGAGGAG TCGTCAGTTC AAGCCAATGG AGCCGCTTCT
TCTTCAATGC CCGATAAACA AGAGGAAGCC CCGTTAAGCA ATCCCGTCTC CCCTGCCTCC
ATATCCCCTT CTTCTGTCCT GAAACCGGAG ACCCATGCTT TTGATCGCCC GCAACAAGCT
GATTCTACTA TAGCTACGGC TGAAGCTTAC CGGGTGACTC GTGAATTACT GGGTTTATAT
AAACTAGGTG GACGGCGCCC AAGAGGAGGA GAAGAACCCT CGGGAAAGGC TGCGGCTAGC
TCATTAAAGG CGAATAAGGC ATTCGCGAAT TCCAGCATGA CCAGTGTTAG CGAGGATAGA
ATTGCTAAGC GGATGGCCCA TCTTCTACCA GAGATCATGG AGCAGGAAGG CACTGTTCAC
AACGATCGCA GCAGTGCCGA AGCCTGCGAG CTTATGGAGA TTACGGGTAA CCTGCTGGTA
TCTATCTTAG AAGATAGTGT TCTGTCGGAG AATACGAAAA GCTGGATCGA GCAGTTGGAA
GCCCCCTTAC TGCGGCTCGC GGTCCTGGAT AAAGCTTTTC TGCATTTGGA AGATCATCCC
GCACGACAGC TTCTTAATCT ATTGGCGCAG CTTGAAATTC CTCCAAGCGA GGAGCGCGAT
GACATCGATG CGCAACTTGA GGACAACATT GATTATTTGG TGGAATATAT TGCTCGCGAT
TTTGATCAGG ATGTTGCTAT CTTTAACCGG GTTCTAGATG AGCTTAAGCA TCTGCTTGAA
CAGCGGACTC AGTCTATTTC TAGCAATATT GCTCAGGTAG TTGAGAGCTA TAAAGCACAA
CAAGAATTAA AAAAAAAGAG GCGGAGGATC GCCGAGAAGC AGCTAGAAGC AGGGCTGGGT
GGTGCCGGCG ATAGTCGTTG GGGCGATCCA ATGCCGAGCG TTAAAAATAG AGAGGAGGAT
CCGGAAGAGT GGGTAAATCG AGCTAAACAG CTACCTCTCC ATTGCTGGGT ACAATTTGCT
GATGGGCAAA ATCACCTGCG GCGTTTGCAG CTGGTGTGGG TTGCCGAAGA TTGTACTACC
TTTGTCTTTG TGGACTCTAA AGGTAGAAAA GCCGCCACTT TGAGTCTTAA TGAAGTTGCA
ATGCAGCTCC GCCGGGGAAC AGCCACGGTG CTAGAAGATG CGAATGTCCC TCTTTTGGAT
CGGGCGCAAT ACGCGGTTCT GCAAAAATTC CACAGCCATA TTGCTTATGA GGCGACCCAC
GATCCCCTGA CCGGATTGGT TAACCGCAAG GAATTCGAAC AGCAGGCAAA CCGGGCTCTA
GCCAAGGCAG ATCATGAAGC TCAATCCTAT GTCTTGCTCT ACTTGGATCT AGATCAGTTC
AAAATTATCA ACAGCACTTG TGGTTATGAG GCGGGAGATG CGCTATTAAA AGAAATTGCC
AGCTTGCTAA CGGAATTCTT GCCTGATGGA GGAATTTTGG CCCGTTTAGG GGATAATCAA
TTTGGAGTAT TGCTAAATCA ATGTTCCCAT GATGAGGGCT GTGAGATTGC GGAGCAGCAA
CGAATTGCCA TTGATAATCA TAGGCTAGTT TGGAATAACA AGCGTTTATC TGTGGGTGCC
AGTATTGGCT TGGTTTCCGT ATCCGAGCAA AAATGCGACG CAGCTGTATT GTTGCAACGA
GCAGAGGCAG CCTGCATGGA AGCAAAGGAG GCGGGAAGAA ACCGCATTCA AGCTTGCGAG
TTTGATGATG AGGAGTTTAG GCGCCGGCAT GGCATGATAG AGTGGGTTGC GAGAATCGAT
GAAGTATTGG AAGATGGTCG GTTGCAGCTA TGGTGTCAGC GAATTACCCC CATTGCGGAT
CATTTAGAAA TGGAACCCCA TTATGAGATC TTGCTACGTC TCCGGGACCA GGATGGGCGG
TGGATCGCCG CTGGGGAGTT TATTCAGACG GCGGAGTTCT ATCATCGGAT GGCGGCTATC
GATCAATGGG TGATTCAATC TGCTTTTCGG TGGATGGCTG ATCATAAGGA GAGGCTAGAG
CAGCTTGGAG GGATAGCGAT CAACCTTTCA GGGCAATCCT TGAATGATAG GCGGCTAGTG
ACATTTATAA AACGTGAATT TGCCAGAACC GGGGTGCCTC CCCAGCGAGT TTGCTTTGAA
ATTACGGAAA CCGCTGGCGT TGCTAATCTC TCCCATTCCG CCCAGTTGAT TGAGGCAGTG
AAGGATTTAG GCTGTCATTT TTCATTAGAT GATTTTGGCA GTGGCTTGTC CTCCTATTCA
TATCTAAAAA ACCTGCCGGT AGATTATCTT AAGATTGATG GAGCATTTGT GAAAGACATT
GCTACCAGTC CCAGCGACTA TGCAGTGGTC AAATCTATTA ATGAAATCGG CCATTTTATG
GGCAAGAAAG TGATTGCTGA ATTTGTAGAA AGCAAGGCTA TTTTGGCGAA ACTTCAAGAA
ATTGGCGTTG ATTTCGCGCA AGGATATGGT ATAGAGCCAC CCCAAATCTT GAGCGCAATA
GAATAA
 
Protein sequence
MYTPQERKLD RRAFPRQPVQ MDAYINSARL GRREVEIRDF CLGGLFLVPR DSQAAYIQFR 
GHTGETITVH LSCPTPAGHK DFSLNVRMAR VSEGGIGVAF SKPGPEVLQV LHYLTAQSRR
VPPKPTPALK RTVHPDERKG KQGDNALIKA CQQQIIDFSP IILRDFFERL DEALFLCARD
AENNLEQTFY LDAIEEIKGH RPQVEEHWQK KILARLSALG TTGNAPLPPR EDKTESSKLS
MVDKDEFEDW LAIAEAVSKM EARYADPLFE LEKRFSRLAG ESLDKENNPV GPRAICRALQ
EAIRDLEISH YIKQRIYKVF KETAEDKLSL LYNNLNELLR GEGILPSLER KFAAPQRQST
LKKALPFSEE SSVQANGAAS SSMPDKQEEA PLSNPVSPAS ISPSSVLKPE THAFDRPQQA
DSTIATAEAY RVTRELLGLY KLGGRRPRGG EEPSGKAAAS SLKANKAFAN SSMTSVSEDR
IAKRMAHLLP EIMEQEGTVH NDRSSAEACE LMEITGNLLV SILEDSVLSE NTKSWIEQLE
APLLRLAVLD KAFLHLEDHP ARQLLNLLAQ LEIPPSEERD DIDAQLEDNI DYLVEYIARD
FDQDVAIFNR VLDELKHLLE QRTQSISSNI AQVVESYKAQ QELKKKRRRI AEKQLEAGLG
GAGDSRWGDP MPSVKNREED PEEWVNRAKQ LPLHCWVQFA DGQNHLRRLQ LVWVAEDCTT
FVFVDSKGRK AATLSLNEVA MQLRRGTATV LEDANVPLLD RAQYAVLQKF HSHIAYEATH
DPLTGLVNRK EFEQQANRAL AKADHEAQSY VLLYLDLDQF KIINSTCGYE AGDALLKEIA
SLLTEFLPDG GILARLGDNQ FGVLLNQCSH DEGCEIAEQQ RIAIDNHRLV WNNKRLSVGA
SIGLVSVSEQ KCDAAVLLQR AEAACMEAKE AGRNRIQACE FDDEEFRRRH GMIEWVARID
EVLEDGRLQL WCQRITPIAD HLEMEPHYEI LLRLRDQDGR WIAAGEFIQT AEFYHRMAAI
DQWVIQSAFR WMADHKERLE QLGGIAINLS GQSLNDRRLV TFIKREFART GVPPQRVCFE
ITETAGVANL SHSAQLIEAV KDLGCHFSLD DFGSGLSSYS YLKNLPVDYL KIDGAFVKDI
ATSPSDYAVV KSINEIGHFM GKKVIAEFVE SKAILAKLQE IGVDFAQGYG IEPPQILSAI
E