Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0335 |
Symbol | |
ID | 3706506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 362415 |
End bp | 365354 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637736847 |
Product | PAS sensor diguanylate cyclase/phosphodiesterase |
Protein accession | YP_342391 |
Protein GI | 77163866 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000621491 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAAAA GCGTAGCATT TTCTGCAAAA AATATACTAA TGCCAGACCC TGAACAAACT CCTGCCCCCC GTTTTTTAAG CCTTAAGTGG AAGGCTGTTT TGGTTTTTAG TCTGGTTCTG GTGACGATTA ATATTTCCTT GGCAGGGCTT GCCTATTTAG GACTTCAGCG CCAATTTGCC CAGCAGCGCG AGCAAATTTA CCTTGGGCAT ATTAAAGAAA TTCGAGGGCT TATTAAGACT TCTTACCAGC GTATGGAACA GCTAGCTGAT ATCGTACCCC TCTTACGCGG GGATAGGGCT GATAAGAACT CTTTGGCTGG AAAAATTGAC TCCATATTTG AGCGGCATGG GCACTTGTTG CAGATAGTAT GGGGGGTGGA AATAGCGAGT TTTTACTCTT CCGAGAATAA ATTATTGGTC TCGTGGGGGC AGCAGGTCGG CACAGATCGG ATTTTGGAAT GGGTGCGTGC AGCGAATAAG AGAGAACGGC CGATTACCGG ATTAGACTGT GGAGAGCGGT GCCTCCAGTA CGTAGCAGTC CCTTTACTGG CTAATGATGA GCGCGCCGGG GTTTTGCTGC TTGGCCGCTC CCTTGCCGAG GTAGTGCTTT CTTTCCGCCA GATTTCTGGC GACGACATCG GTATTATGAC TGCTGTTAAG TTACCTTCTG AAAATGCGAC TGAACTGCGC CGACTAATGC CTGAATGGGG TATGCGGATG GCGGCCTTGA CAAATGCCGA ACGTAATTTA TTGGTGTTAC GGTCGCTTTC TGAAGGCCAT TCGCTAGCGG AGGTAGCTAG CCAGCATGTG TGGTACCACC ATGCGGGTCG AGAGTATGAA GTTCGCCTTA TACCGATAAA TAAAGCAGGG GTGGAGAATC GGGAGGCTCA GCTGGTGGTC ATTAGTGATG TGACCCGTGC TCTTGCTGAT ATTCGGCTAG CAACCCGGCG GAGCTTGCTA GGGGGTTTGG TTGGCCTGGT GGTTTCCGAA ACTCTGCTGT TACTTTTGTT GTGGAAGCCA ATGGCTCGTT TGCAGCGAGT CGTTTTAAGC CTGCCTTATT TGGCGGAGCA TGCTTTCGAG AAAGTCCGGG CGCGCCTCAG TCATTCAGCC CAGCCTGCTT GGGGCCGGGA CGAAATTGAT GTCCTTAACG ATACCGCCGT GACTCTTTCC TATCAGCTAG AGGCGTTGCA GGCAGAAATA CAAGATCGAA CCCGCCATTT AGCCGAGCGG GGTGATGATT TAGCGCGGGA GCGAGACTTT GTTACTGGGT TGTTGAATAC TGCTCAGGTT ATCATTCTAA CCCAGGATAG CGCCGGGCGA GTGACGATGT TAAATCGGCA GGGGCAGAAA ATAACAGGTT ATGGCGCTGA TCAAATCACA GGCCGGCCCT TCTATGAACT ATTGGCGGGC GACAAGGTTT CACCGGAACT ATTCCAACAG TTGGAAGAAC TCCGGACTGG CCGCCGAGGC CAGGTACGCG TGGATACGGG ACTCCAGTGC CAGAATGGCA GCCAGCGTAC TATTTCCTGG TTTCATTCTC GGTTGGCGGT TCACCCCTCC AGTGATGTCG TAGTTCTTAG TGTGGGACAT GATGTAACTG AGAGGGAGCA GGCGGAGCAA CGGTTGGCCT GGCTAGCTGA TCATGATCCC TTGACCGAAC TGTTTAACCG GCGCCGTTTT CAGCATGAAT TCGAACAAAT TCTCGGGGCT TCCATTCGTT ATGGAACCCA GGGAGGCTTG CTTTATTTTG ACCTTGATCA ATTCAAATAT ATTAACGATA CCAGTGGCCA TCAAGCCGGA GATGCCCTGC TGCGGATGGT CGCCGACAAG TTACGCCAAG TCGTGCGGGG GAGCGATATT GTTGCCCGGC TTGGGGGTGA TGAATTTGCA GTGGTCATTC GTGAGTGTGA CGTTGAAAGT GCGGTCCGGG TCGCGCGAAA AGTTTGTACC CAACTGAGTA CCTTGGAATT TCCAGCCCGG GGTGGTAATC ACTCTATCTC TCTCAGCATC GGGATTGCGC TTTTTCCCCT CCATGGCGCT ACTGTCCGCG ATCTCATGGC CAATGCTGAT GTGGCCATGT ATCAGGCCAA AGAGGAAGGA AGAGGGCGTT GGCATTTATT TTCGAGCGAT GAACAAGTCC GCGAACGGAT GCAGCAGCGA GTATATTGGA AGGAGCAAAT TGAACAAGCT CTGCGGGAAG ATCGATTTCT GCTTTATTTC CAGCCCGTGT TGGATATCCG CACTCATACG ATAGGCCATT ATGAAGTTTT GCTCCGTATG TATGACCATA GGGGCAGGAT TATTTCTCCA GCCCAGTTTA TCCCGGTGGC CGAGCAGTCA GGCTTGATCC ACGCTATCGA TCATCTGGTT TTGCGAAAAG CTATTGCCCA GCAAGCGAAG TTATGGTCTC AAGGGTATCA TTTGACGCTT TCCATTAACC TTTCCGGCCG GGTAGTGGAT GATCCCGAAT TAGTACCTAT CTTAGAAGAT TTATTAAGGA CGACCGGCGT TAATCCGTCC TCATTGATGT TTGAGGTGAC CGAGACAGCA GCGGTTGCCG ATCTGGCCGC TGCTGAGGGC TTTATGCACA GAATAAAAGC CCATGGCTGC CGTTTTGCCG TGGACGATTT TGGGGTGGGT TTTTCCTCCT TCTTTTATCT CAAGCGGTTG CCTGTCGATT ACGTCAAAAT CGATGGCATG TTTGTGCGCG AGTTAGCCAA AAGCCATCAG GACCAGGTTT TTGTCAAAGC TCTAAGCGAG GTTGCCAAGG GCCTTGGCAA AAAAGCGGTG GCCGAATTTG TAGAAGATGC TGAGGCCTTG GCATTACTCC ATGAATACGG AGTGGATTAT GCCCAGGGCC ATTATATTGG CCGGCCAACT CCCCATATTG TTGAAACCCC ATGTGCTGAG GGCAAGGTAG CTTGGTCCCA CGCCCAATAG
|
Protein sequence | MFKSVAFSAK NILMPDPEQT PAPRFLSLKW KAVLVFSLVL VTINISLAGL AYLGLQRQFA QQREQIYLGH IKEIRGLIKT SYQRMEQLAD IVPLLRGDRA DKNSLAGKID SIFERHGHLL QIVWGVEIAS FYSSENKLLV SWGQQVGTDR ILEWVRAANK RERPITGLDC GERCLQYVAV PLLANDERAG VLLLGRSLAE VVLSFRQISG DDIGIMTAVK LPSENATELR RLMPEWGMRM AALTNAERNL LVLRSLSEGH SLAEVASQHV WYHHAGREYE VRLIPINKAG VENREAQLVV ISDVTRALAD IRLATRRSLL GGLVGLVVSE TLLLLLLWKP MARLQRVVLS LPYLAEHAFE KVRARLSHSA QPAWGRDEID VLNDTAVTLS YQLEALQAEI QDRTRHLAER GDDLARERDF VTGLLNTAQV IILTQDSAGR VTMLNRQGQK ITGYGADQIT GRPFYELLAG DKVSPELFQQ LEELRTGRRG QVRVDTGLQC QNGSQRTISW FHSRLAVHPS SDVVVLSVGH DVTEREQAEQ RLAWLADHDP LTELFNRRRF QHEFEQILGA SIRYGTQGGL LYFDLDQFKY INDTSGHQAG DALLRMVADK LRQVVRGSDI VARLGGDEFA VVIRECDVES AVRVARKVCT QLSTLEFPAR GGNHSISLSI GIALFPLHGA TVRDLMANAD VAMYQAKEEG RGRWHLFSSD EQVRERMQQR VYWKEQIEQA LREDRFLLYF QPVLDIRTHT IGHYEVLLRM YDHRGRIISP AQFIPVAEQS GLIHAIDHLV LRKAIAQQAK LWSQGYHLTL SINLSGRVVD DPELVPILED LLRTTGVNPS SLMFEVTETA AVADLAAAEG FMHRIKAHGC RFAVDDFGVG FSSFFYLKRL PVDYVKIDGM FVRELAKSHQ DQVFVKALSE VAKGLGKKAV AEFVEDAEAL ALLHEYGVDY AQGHYIGRPT PHIVETPCAE GKVAWSHAQ
|
| |