Gene Noc_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1756 
Symbol 
ID3704773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1975322 
End bp1977478 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content50% 
IMG OID637738239 
ProductPAS sensor signal transduction histidine kinase 
Protein accessionYP_343758 
Protein GI77165233 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain
[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.977279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACCC ACACCCTATG GAAACGGTTT GCCCTCGTTT TCTTCCCACT TGCGGGGCTG 
ACCACCTTGT CGTTCATCTC CCTTTATCAA GCGGAATTCA AAACCCTACG GGCCTTGACC
GAGCTTAGCG AGAAAGATTC TCTGCAATTA CTCAGGGAGG AAATCCGTGA TGAATTTGAC
GCCATTCAGG CCACCCTCCT ATTTTTGTCG CGGCACCAAG CCCTCAAGGA TCTTTTGGCA
GGGAGTGAAT CGGAGCAACG ACTGGCCCAA GATTACGTGG TCTTATTGGG AAGCAGAGAG
CGCTATGACC AGATCCGTCT GCTTGATTTA CAGGGCAAGG AATGGGTTCG GGTAAACTTT
AATGGAGGCC AGCCAAGCAT CGTTCCCCCA GAGGGATTAC AGAACAAGAA AGCCCGCGGC
TACTTCCAAA AAGCCTTGCA GCTTCAACCA GGAGAAATTT ACCTTTCTCC GCTTAACCTT
AATACCGAGA GGGGCATCAT CGAGCAGCCC CTAAAGCCAA CCATTCAGGC TATCACGCCG
ATCTTCGATG AGCAACGACG TAAGCAGGGA ATACTGGTGC TGAATTACCG AGGCATCCAC
CTTTTGGATA AACTTCACGC AGCCGCACAA AATACTAAAA AAGCACTTTG GTTAGTCAGG
GGAAGTGGCC TTTGGCTTTG GGAGTCCGGC GCCGGGCAGA AGCAAGGAGT GATCTATCTC
GAAAAAGGCA AGCAAGCATT CGCGGTTGCT TATCCTGATC TCTGGGCGCT AATCCAGAAC
AATGACGAAG GTCAATTCTA TGCCCATAAG GATCTCTTTA CTTATACCAC CATCACCCCC
ACGGCAAAAA CTTTCCCCCC CATAGCAAAA CTTCATTCCT CTATCCACTG GAAGCTGATC
TCCCAGCAGT CCTTTCCTGT CCTATCTACG ATGATATCAA CGAGCCGCAT TCAGCGACTG
ATACTTCTCT ATGGTATCTT TCTTACTGTC TTTATAATCG TAGCCTGGCG GCTGGCCTAC
GTACATCAAC GCCGCGAAAG AGATACCCAG GCGGTGCGTC TCAGCGAGCT TCGCTTCCGG
GGTCTTTTTG AGGCGGTGCC CGATGGTATC GTGATGACTG ATTCCAACGG CCAAATTCTC
CTCGTGAACA AACAAGCAGA AAAAATATTT GGCTATTCCC GGGAGGAATT AGCGGGGCAA
AAAATAGAAA TCCTACTGCC AGAGCAACAT CGCAAAGCTC ACGTTATCTA TCGCCAAGGT
TATACTCGGG ATGCCATTAC GCGGCCAATG GGAGTTGGAT TGGATCTTTA CGGACGGCGC
AAGGACGGTA CCGAATTTCC AGCGGAAATT AGTTTAAGCC CCCTTAATAA AAAAGAAGTG
GATTTCTCTA TTATCAGCAC CATACGCGAT ATCAGTGAGC GAAGAAAAAC TGAAAAAAAA
GTTAAAGAAT TGAACCGGAA ACTGCAACAA AATGTGACCG AGCTGAGCAC GTTAAACGGA
GAATTGGAAG CCTTCAGCTA TTCAGTTTCC CATGATCTAC GGGCGCCCTT GCGCAGTATC
GATGGTTTCA GCCAAGCCCT GCTAGAGGAT TATGAAGATA AACTTGATAG CGAAGGGCAA
GACTATTTAC AGCGGGTACG CGCAGCAACT CAGCGAATGG GAAAGCTTAT CGATGATTTA
CTGCAACTCT CCCGTATTAG CCGGGTGGAG ATGATACCCC AGAAGGTAGA CCTTAGCTGC
TTGGCGCAAG CAATTGTCAC GACTCTACGG GCCGAGGAAC CTCAACGGCA GGTTGAGTTT
TGTATCGAGC AGGGTCTTAC CGCCTCAGGC GATTCGCACT TGCTTCAAAT TCTGCTTGAC
AACTTGCTCG GCAATGCCTG GAAATTCACT GCTCATCAGC CCCAGGCCCA GATCACGCTG
GGAATGCTAG CGAAAGAAGG CAAGCCTGTT TTCTTTGTCC AGGACAATGG CGCTGGCTTT
GATATGCGTT ATATTGACAA ACTATTCGAT CCCTTCCAAC GACTACACGG CGCCAGCGAA
TATCCTGGAA CCGGGATAGG ACTGGCTACC GCGCGGCGCA TTGTCCACCG CCATGGAGGG
CGAATTTGGG CGGAAGGACA AATTAACCAA GGGGCCACTT TTTATTTTAC GCTTTAA
 
Protein sequence
MSTHTLWKRF ALVFFPLAGL TTLSFISLYQ AEFKTLRALT ELSEKDSLQL LREEIRDEFD 
AIQATLLFLS RHQALKDLLA GSESEQRLAQ DYVVLLGSRE RYDQIRLLDL QGKEWVRVNF
NGGQPSIVPP EGLQNKKARG YFQKALQLQP GEIYLSPLNL NTERGIIEQP LKPTIQAITP
IFDEQRRKQG ILVLNYRGIH LLDKLHAAAQ NTKKALWLVR GSGLWLWESG AGQKQGVIYL
EKGKQAFAVA YPDLWALIQN NDEGQFYAHK DLFTYTTITP TAKTFPPIAK LHSSIHWKLI
SQQSFPVLST MISTSRIQRL ILLYGIFLTV FIIVAWRLAY VHQRRERDTQ AVRLSELRFR
GLFEAVPDGI VMTDSNGQIL LVNKQAEKIF GYSREELAGQ KIEILLPEQH RKAHVIYRQG
YTRDAITRPM GVGLDLYGRR KDGTEFPAEI SLSPLNKKEV DFSIISTIRD ISERRKTEKK
VKELNRKLQQ NVTELSTLNG ELEAFSYSVS HDLRAPLRSI DGFSQALLED YEDKLDSEGQ
DYLQRVRAAT QRMGKLIDDL LQLSRISRVE MIPQKVDLSC LAQAIVTTLR AEEPQRQVEF
CIEQGLTASG DSHLLQILLD NLLGNAWKFT AHQPQAQITL GMLAKEGKPV FFVQDNGAGF
DMRYIDKLFD PFQRLHGASE YPGTGIGLAT ARRIVHRHGG RIWAEGQINQ GATFYFTL