Gene Noc_0722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0722 
Symbol 
ID3706988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp780027 
End bp781685 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content52% 
IMG OID637737225 
Productsignal transduction histidine kinase 
Protein accessionYP_342766 
Protein GI77164241 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00200947 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATTGGC AAAAAGCGAT AAAGGATAAA AGAATCATCG AGATTGATGG CCAGGATATA 
GCGGAAGGAC CTCTTCCTCC TTGGCGGCCG CTTGCCGTTT TTTGTCTCTA TCGCTTGTTA
ATTGTCTCCT TGCTTTTGGT GGCCGTCATT ACTGGTGCTG GCCCCGGTTT TCTGGGCGAA
TCTCACCCTA ATCTATTCTT GATTACTAGT CTGGTTTATG CCGCCGCCGC CATTGCCCTA
GGTGTTGCCA CGATAGCTCG GATGGGCGGA TTTTGCTTTC AAGTGTGGTT CCAGTTAACC
TTGGATATTG GCGCTATTAC GCTTTTGATG CATGCCAGTG GCGGTGTGCT CAGCGGCCTG
GGAATGCTGC TGGTGGTAGT GATTGCCGCC GGGGGTATTT TAACGGTGGG GCGGACCGCT
AGCGCCTTTG CTGCCCTGGC CACATTGGCG GTGTTGCTTG AACAGTCTCA TGCTCTCGTA
TTCCGTGATT TCGATACCGT TCACTATACT CAGGCGGGTT TATTAGGCGC CACCCTATTT
GCCACGGCTC TACTAGCCCA GGTGCTGACA GCGCGGATTC GAGAGAGTGA GGCCCTCGCG
GCCCAGCGGA GCTTGGATCT GGCAAATATC AGCCAGCTCA ATGGGTACAT TATCCAGCAC
CTACAGTCTG GAGTCCTTGT GATAGATAGG GAGGATACCC TGCGGCTCAT TAATCAGGCT
GGGCGAGCAC TATTAGGGTT GAAGCCGGGA GGAGAAAAAA AATCGTTGGA CCGGATAGCC
CCATGCCTGG CTAAACAGCT TAATTGCTGG CGAGAAGGAC TACGCTCTCA TCAGCCTGAA
GCTTTCCGGT CACGTTGGGG ACAATCCGAG ATATTGTCTA AATTCATCAG CCTGGGGCCC
CGCTCAGGCA CACTGATTTT TTTAGAGGAC GCTTCGGCTT CGGCTCGCCA AGCTCAGCAA
TTGAAGCTAG CTTCCTTAGG ACGGCTTACA GCGAGCATTG CCCATGAAAT CCGTAATCCT
CTGGGCGCCA TCAGTCATGC CAGTCAACTT CTAAGAGAAT CTTCCACCCT AAGTCAAAGT
GAGCAGCGGC TGTTGGAAAT TATTCTTAAT CACTGCACCG GGGTAAACGG GATCGTTAAG
AATGTCTTGC AGTTGAGCCG CCGGCAGCAG CATAGCCGTC TAAAAGTGTT AGCCCTCAAG
CCATGGTTGC TGGATTTTCT GGATGAATTT TGCCGTACCC AAGGAATTGA CCGGACAGAG
GTGGCGCTCC AAATTCGCTC CGGCACAGTT CAAGTTCATA TGGACCCTTC TCAGTTTCAC
CAGATATTGT GGAATCTTTG CGATAACGCT CGGCGTCATT CCCGAAGCTT GGGCCGTATT
CCTTGTTTTC AAATTTCAGT AGAGAGCGCA GTGGATATGG GCCAAGTCTT TTTGGAGGTC
CTGGATAGGG GATCCGGTAT TCCCAACGAT ATAGCCGATA AAATATTCGA GCCCTTTTTC
ACAACCCAAG GCACAGGTAC CGGGCTAGGT CTTTATTTGG CCCGCGAACT ATCGGAATGT
AATGGTGCCA GTCTGGAGTA TCGTCCAGCG CCTGGCGGAG GAAGCTGTTT TCGGCTTTGC
TTTGCCCCCC TGGGCATGAT GGGGGCGAAT GTGGCATGA
 
Protein sequence
MNWQKAIKDK RIIEIDGQDI AEGPLPPWRP LAVFCLYRLL IVSLLLVAVI TGAGPGFLGE 
SHPNLFLITS LVYAAAAIAL GVATIARMGG FCFQVWFQLT LDIGAITLLM HASGGVLSGL
GMLLVVVIAA GGILTVGRTA SAFAALATLA VLLEQSHALV FRDFDTVHYT QAGLLGATLF
ATALLAQVLT ARIRESEALA AQRSLDLANI SQLNGYIIQH LQSGVLVIDR EDTLRLINQA
GRALLGLKPG GEKKSLDRIA PCLAKQLNCW REGLRSHQPE AFRSRWGQSE ILSKFISLGP
RSGTLIFLED ASASARQAQQ LKLASLGRLT ASIAHEIRNP LGAISHASQL LRESSTLSQS
EQRLLEIILN HCTGVNGIVK NVLQLSRRQQ HSRLKVLALK PWLLDFLDEF CRTQGIDRTE
VALQIRSGTV QVHMDPSQFH QILWNLCDNA RRHSRSLGRI PCFQISVESA VDMGQVFLEV
LDRGSGIPND IADKIFEPFF TTQGTGTGLG LYLARELSEC NGASLEYRPA PGGGSCFRLC
FAPLGMMGAN VA