Gene Noc_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0335 
Symbol 
ID3706506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp362415 
End bp365354 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content51% 
IMG OID637736847 
ProductPAS sensor diguanylate cyclase/phosphodiesterase 
Protein accessionYP_342391 
Protein GI77163866 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000621491 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAA GCGTAGCATT TTCTGCAAAA AATATACTAA TGCCAGACCC TGAACAAACT 
CCTGCCCCCC GTTTTTTAAG CCTTAAGTGG AAGGCTGTTT TGGTTTTTAG TCTGGTTCTG
GTGACGATTA ATATTTCCTT GGCAGGGCTT GCCTATTTAG GACTTCAGCG CCAATTTGCC
CAGCAGCGCG AGCAAATTTA CCTTGGGCAT ATTAAAGAAA TTCGAGGGCT TATTAAGACT
TCTTACCAGC GTATGGAACA GCTAGCTGAT ATCGTACCCC TCTTACGCGG GGATAGGGCT
GATAAGAACT CTTTGGCTGG AAAAATTGAC TCCATATTTG AGCGGCATGG GCACTTGTTG
CAGATAGTAT GGGGGGTGGA AATAGCGAGT TTTTACTCTT CCGAGAATAA ATTATTGGTC
TCGTGGGGGC AGCAGGTCGG CACAGATCGG ATTTTGGAAT GGGTGCGTGC AGCGAATAAG
AGAGAACGGC CGATTACCGG ATTAGACTGT GGAGAGCGGT GCCTCCAGTA CGTAGCAGTC
CCTTTACTGG CTAATGATGA GCGCGCCGGG GTTTTGCTGC TTGGCCGCTC CCTTGCCGAG
GTAGTGCTTT CTTTCCGCCA GATTTCTGGC GACGACATCG GTATTATGAC TGCTGTTAAG
TTACCTTCTG AAAATGCGAC TGAACTGCGC CGACTAATGC CTGAATGGGG TATGCGGATG
GCGGCCTTGA CAAATGCCGA ACGTAATTTA TTGGTGTTAC GGTCGCTTTC TGAAGGCCAT
TCGCTAGCGG AGGTAGCTAG CCAGCATGTG TGGTACCACC ATGCGGGTCG AGAGTATGAA
GTTCGCCTTA TACCGATAAA TAAAGCAGGG GTGGAGAATC GGGAGGCTCA GCTGGTGGTC
ATTAGTGATG TGACCCGTGC TCTTGCTGAT ATTCGGCTAG CAACCCGGCG GAGCTTGCTA
GGGGGTTTGG TTGGCCTGGT GGTTTCCGAA ACTCTGCTGT TACTTTTGTT GTGGAAGCCA
ATGGCTCGTT TGCAGCGAGT CGTTTTAAGC CTGCCTTATT TGGCGGAGCA TGCTTTCGAG
AAAGTCCGGG CGCGCCTCAG TCATTCAGCC CAGCCTGCTT GGGGCCGGGA CGAAATTGAT
GTCCTTAACG ATACCGCCGT GACTCTTTCC TATCAGCTAG AGGCGTTGCA GGCAGAAATA
CAAGATCGAA CCCGCCATTT AGCCGAGCGG GGTGATGATT TAGCGCGGGA GCGAGACTTT
GTTACTGGGT TGTTGAATAC TGCTCAGGTT ATCATTCTAA CCCAGGATAG CGCCGGGCGA
GTGACGATGT TAAATCGGCA GGGGCAGAAA ATAACAGGTT ATGGCGCTGA TCAAATCACA
GGCCGGCCCT TCTATGAACT ATTGGCGGGC GACAAGGTTT CACCGGAACT ATTCCAACAG
TTGGAAGAAC TCCGGACTGG CCGCCGAGGC CAGGTACGCG TGGATACGGG ACTCCAGTGC
CAGAATGGCA GCCAGCGTAC TATTTCCTGG TTTCATTCTC GGTTGGCGGT TCACCCCTCC
AGTGATGTCG TAGTTCTTAG TGTGGGACAT GATGTAACTG AGAGGGAGCA GGCGGAGCAA
CGGTTGGCCT GGCTAGCTGA TCATGATCCC TTGACCGAAC TGTTTAACCG GCGCCGTTTT
CAGCATGAAT TCGAACAAAT TCTCGGGGCT TCCATTCGTT ATGGAACCCA GGGAGGCTTG
CTTTATTTTG ACCTTGATCA ATTCAAATAT ATTAACGATA CCAGTGGCCA TCAAGCCGGA
GATGCCCTGC TGCGGATGGT CGCCGACAAG TTACGCCAAG TCGTGCGGGG GAGCGATATT
GTTGCCCGGC TTGGGGGTGA TGAATTTGCA GTGGTCATTC GTGAGTGTGA CGTTGAAAGT
GCGGTCCGGG TCGCGCGAAA AGTTTGTACC CAACTGAGTA CCTTGGAATT TCCAGCCCGG
GGTGGTAATC ACTCTATCTC TCTCAGCATC GGGATTGCGC TTTTTCCCCT CCATGGCGCT
ACTGTCCGCG ATCTCATGGC CAATGCTGAT GTGGCCATGT ATCAGGCCAA AGAGGAAGGA
AGAGGGCGTT GGCATTTATT TTCGAGCGAT GAACAAGTCC GCGAACGGAT GCAGCAGCGA
GTATATTGGA AGGAGCAAAT TGAACAAGCT CTGCGGGAAG ATCGATTTCT GCTTTATTTC
CAGCCCGTGT TGGATATCCG CACTCATACG ATAGGCCATT ATGAAGTTTT GCTCCGTATG
TATGACCATA GGGGCAGGAT TATTTCTCCA GCCCAGTTTA TCCCGGTGGC CGAGCAGTCA
GGCTTGATCC ACGCTATCGA TCATCTGGTT TTGCGAAAAG CTATTGCCCA GCAAGCGAAG
TTATGGTCTC AAGGGTATCA TTTGACGCTT TCCATTAACC TTTCCGGCCG GGTAGTGGAT
GATCCCGAAT TAGTACCTAT CTTAGAAGAT TTATTAAGGA CGACCGGCGT TAATCCGTCC
TCATTGATGT TTGAGGTGAC CGAGACAGCA GCGGTTGCCG ATCTGGCCGC TGCTGAGGGC
TTTATGCACA GAATAAAAGC CCATGGCTGC CGTTTTGCCG TGGACGATTT TGGGGTGGGT
TTTTCCTCCT TCTTTTATCT CAAGCGGTTG CCTGTCGATT ACGTCAAAAT CGATGGCATG
TTTGTGCGCG AGTTAGCCAA AAGCCATCAG GACCAGGTTT TTGTCAAAGC TCTAAGCGAG
GTTGCCAAGG GCCTTGGCAA AAAAGCGGTG GCCGAATTTG TAGAAGATGC TGAGGCCTTG
GCATTACTCC ATGAATACGG AGTGGATTAT GCCCAGGGCC ATTATATTGG CCGGCCAACT
CCCCATATTG TTGAAACCCC ATGTGCTGAG GGCAAGGTAG CTTGGTCCCA CGCCCAATAG
 
Protein sequence
MFKSVAFSAK NILMPDPEQT PAPRFLSLKW KAVLVFSLVL VTINISLAGL AYLGLQRQFA 
QQREQIYLGH IKEIRGLIKT SYQRMEQLAD IVPLLRGDRA DKNSLAGKID SIFERHGHLL
QIVWGVEIAS FYSSENKLLV SWGQQVGTDR ILEWVRAANK RERPITGLDC GERCLQYVAV
PLLANDERAG VLLLGRSLAE VVLSFRQISG DDIGIMTAVK LPSENATELR RLMPEWGMRM
AALTNAERNL LVLRSLSEGH SLAEVASQHV WYHHAGREYE VRLIPINKAG VENREAQLVV
ISDVTRALAD IRLATRRSLL GGLVGLVVSE TLLLLLLWKP MARLQRVVLS LPYLAEHAFE
KVRARLSHSA QPAWGRDEID VLNDTAVTLS YQLEALQAEI QDRTRHLAER GDDLARERDF
VTGLLNTAQV IILTQDSAGR VTMLNRQGQK ITGYGADQIT GRPFYELLAG DKVSPELFQQ
LEELRTGRRG QVRVDTGLQC QNGSQRTISW FHSRLAVHPS SDVVVLSVGH DVTEREQAEQ
RLAWLADHDP LTELFNRRRF QHEFEQILGA SIRYGTQGGL LYFDLDQFKY INDTSGHQAG
DALLRMVADK LRQVVRGSDI VARLGGDEFA VVIRECDVES AVRVARKVCT QLSTLEFPAR
GGNHSISLSI GIALFPLHGA TVRDLMANAD VAMYQAKEEG RGRWHLFSSD EQVRERMQQR
VYWKEQIEQA LREDRFLLYF QPVLDIRTHT IGHYEVLLRM YDHRGRIISP AQFIPVAEQS
GLIHAIDHLV LRKAIAQQAK LWSQGYHLTL SINLSGRVVD DPELVPILED LLRTTGVNPS
SLMFEVTETA AVADLAAAEG FMHRIKAHGC RFAVDDFGVG FSSFFYLKRL PVDYVKIDGM
FVRELAKSHQ DQVFVKALSE VAKGLGKKAV AEFVEDAEAL ALLHEYGVDY AQGHYIGRPT
PHIVETPCAE GKVAWSHAQ