Gene Noc_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0226 
Symbol 
ID3706281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp249384 
End bp251159 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content53% 
IMG OID637736742 
ProductPAS sensor diguanylate cyclase and phophodiesterase 
Protein accessionYP_342286 
Protein GI77163761 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGATG TCACCGCCCG TAACCGTGCC GAGGCCGCGC TGGTCGCCGA AAAAGAGCGT 
ATCCAGGTAA CCCTGGAATC CATCGGCGAC GGGGTGATTA CTACTGATGC TAATAGCCGT
ATCAACTACC TTAATCCAAC CGCCGAGGCT ATGACCGGTT GGCTATTGGC GGCTGCCCGG
GGCAAAGCAT TACCGGACGT ATTGCAAATT ATCAATGAGT CCACCCGTGA GCCGGTAGCC
GATCCTCTCG CCCCTTGCTT GACTGGCAGC TCCGCTGTGA GATCCACCAA CCCTACGGTA
CTCATCAGTC GCCACGGCGC CGAATATTCA ATTGAAATCT CCGCCGCCCC TATCCGCGAC
GCCAACAACC AGGTGCTAGG CGCGGTACTG GTATTTCACG ATGTTAGTGA ACAGCGCCGG
TTGCAGTGCG AAATTGCCCA TCAGGCGCAA CACGATGCCC TTACGGGTCT AGTCAATCGC
CGTGAGTTTG AGCGGCGCTT GCAAAGGGTA ATCGAGACCG TTCAGACGCA AAACAGCGAG
CACGCCCTAT GCTATCTCGA TCTCGATCAA TTTAAGCTTG TCAACGACAC TTGCGGGCAC
GCTTCAGGCG ATGCGTTGTT GCAACAGCTG GCGGTGCTGT TCGAAAAAAA TATCCGTCGG
CGCGATACGC TGGCGCGGCT GGGAGGCGAT GAATTTGGGC TGCTATTAGA GCACTGCTCG
TTGGACAGAG CGCTGCAAAT AGCCAATACC TTACGTCAAA CGGTTGAGGG TTTCCGTTTT
TGCTGGAATG GGCAACACTT TCGGATCGGC GTCAGTATCG GCTTGGTGCC CATCACCATT
GCTAATTCAA GCGCCGCCAG CGTCTTACAA ACAGCCGACA GCGCCTGTTA CGTTGCTAAA
GACGGCGGCC GTAATCGAAT TCACATTTAT CGTGAGCACG ATGTGGAGTT GGCCCGGCGC
CATGGTGAAA TGCAATGGGT GGCCCGTATT CAGCAGGCAT TGGAAGAGAA TCGCTTTCAA
CTGTATGCAC AACCGATAGT GCCGCTTAAG GCCACGCTAT CCGGTGGCAT ACACTGCGAA
TTGTTACTAC GGTTGGTGGA AAACGATGGT AAGATATCGC CGCCAGGCGC ATTTATGCCA
GCCGCCGAGC GCTATAATCT GGCCGTTGCG ATTGATCGCT GGGTCGTTAC CCAGGCATTG
CGCTGGCTGG CCGCCCATCC GGCGCTGCTT GATCGAATCA CGTTATGCAC TATTAATTTA
TCGGGCCACT CTATCGGTGA TCGCTTTTTC CACGCTTATG TACTGCGGCA ATTTGACGAT
ACCGGCCTGC CGGCTAAAAA AATCTGCTTC GAGATTACGG AAACAGCCGC CGTGGCTAAT
CTTGCTGATG CCACCCGGTT CATGGAGGCG CTGAAAACAC GCGGCTGTTG TTTCTCTCTT
GATGACTTTG GCAGCGGCTT GTCCTCGTTT GCTTACCTCA AGGCCCTGCC CGTTAATTTT
CTCAAAATTG ATGGTCTATT TGTTAAAGAC ATTGTCGATG ACCCCATCGA TCTGGCTATG
GTTCGTTCAA TCAACGAAAT CGGCCATCTG CTGGGGAAAA AAACCATTGC TGAATACATC
GAAAACGACG CTATCCTGGA TAAACTACGC GGCCTCGGTA TAGATTACGG GCAAGGCTAT
GGCCTTGGTC AGCCGCAACC GTTGTCCGCG CTACTTGCAA CAGTGTCTAG GCCCGCTAAT
TCCATAAAAA CGGGAGCGGA TACCCATTTC ATATAA
 
Protein sequence
MHDVTARNRA EAALVAEKER IQVTLESIGD GVITTDANSR INYLNPTAEA MTGWLLAAAR 
GKALPDVLQI INESTREPVA DPLAPCLTGS SAVRSTNPTV LISRHGAEYS IEISAAPIRD
ANNQVLGAVL VFHDVSEQRR LQCEIAHQAQ HDALTGLVNR REFERRLQRV IETVQTQNSE
HALCYLDLDQ FKLVNDTCGH ASGDALLQQL AVLFEKNIRR RDTLARLGGD EFGLLLEHCS
LDRALQIANT LRQTVEGFRF CWNGQHFRIG VSIGLVPITI ANSSAASVLQ TADSACYVAK
DGGRNRIHIY REHDVELARR HGEMQWVARI QQALEENRFQ LYAQPIVPLK ATLSGGIHCE
LLLRLVENDG KISPPGAFMP AAERYNLAVA IDRWVVTQAL RWLAAHPALL DRITLCTINL
SGHSIGDRFF HAYVLRQFDD TGLPAKKICF EITETAAVAN LADATRFMEA LKTRGCCFSL
DDFGSGLSSF AYLKALPVNF LKIDGLFVKD IVDDPIDLAM VRSINEIGHL LGKKTIAEYI
ENDAILDKLR GLGIDYGQGY GLGQPQPLSA LLATVSRPAN SIKTGADTHF I