Gene Noc_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1700 
Symbol 
ID3705613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1902839 
End bp1904500 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content47% 
IMG OID637738181 
ProductPAS sensor, signal transduction histidine kinase 
Protein accessionYP_343702 
Protein GI77165177 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.012241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGCTA ACTCAAACAT CGTCCCAAAA ATCAATCTAC TCCTTATCGA AGATAATCCA 
GGTGATGTCC GGCTGGTACA ACTTGCCCTA CAAGAGGTAA CCGGGGTTAA GTTCGAGACT
ACGGCGGTAG AGCGACTCAG CCAAGCGCTG TCCTTGCTCG AGCATCAGCA ATTTGATGCC
ATTTTACTAG ATTTAACCCT ACCTGACTGT CACGGTCTCC ATACTTTTAG CCAGGTAAAA
GAGGCTGCAC CTCAGCTACC CATCGTGGTG CTAAGCGGTT TATCTAATGA AGAACTAGCC
ATCGAGGCAG TAAAACTCGG AGCCCAGGAT TATTTAGTCA AAGGTCAAAG TAATAATCAG
CTGGCGGGGC GAGCATTACG TTATGCGGTG GAAAGAAAGC AAACTGAAAC TGTATTGCGC
CAAGCTCGAG ATGAATTGGA GCAACGCATC GCTGAGCGAA CAGCCCATCT GAAGAAGGCC
AATTGCCAGC TTCAGCAAGA AATCCTCCAG CGGAAACGCA CTGAAGCATT GCTCCGAAAA
GAACGGGACT TTAGCTCTAT GATCTTGGAT ACTGCCGATG TCCTGGTGGT AATTCTCAAC
AGTCAGGGCC AGATTGTTCG CCTTAACCGA GCCTTCCAGA AAATTAGCGG ATATCCTTCC
GAGGAAGCGC AAGGACGGTA TCTTTGGGAG CTTACCTATT TTCCAGACCA AATAAAAAAA
GACACCAGGG AAAAACTCAA ACAATGGCAA ACACCAAACA CTCCCAAGAA GCATGAAAGC
TATTGGCAAG CGAAAACGGG GGAGCGATAC CTGATCGCTT GGTCAAGCAC AGCACTATTC
GCCCCCAGTG GGGCTCTAGA TTATGTTATT TATACTGGTA TCGATATTAC CGAGCAAAGA
CAGGCTGAAG ATCTTGCTCG GCAGCGTTTG CTTGAGCTAG CTCATATCTC CCGTTTAAGC
ACCCTCGGGG AAATGGCTGC CCAGATTGCC CACGAACTCA ACCAACCCCT AGGAGCCATC
ACTACGTATA GCGATATTTG CTTGCGCACT CTTGAGCCAC AAACCTCAAA ACATCAACCA
CTTCGCGATA TATTGGAGGA AATCGCAACC CAAGCCGAGC GAGGAGGAAA AATTATTCGT
CATCTTCGTA ACCTTATCCA CAAAAAAGAG CAACGATGGG CTTCTCTGGC AATCAACGAA
CTCATTCGTG AAACCGTTGG TATCATGCAG GCTGAAGCAC GGTGGCAAAA CATTACGATT
AAACTCGACT TGCAAACGTC ACTTCCTTCT ATTACCGCTG ATAGCCTTCT ACTCCAACAG
GTATTTCTCA ATCTGATGCG TAATGCCTTC GATGCCATGA TAGCGAACCC CTGCAACGGT
GATCGGCAAA TCAGAATTAA AACGTCATGG ATAAAAAAAA CCGCTATTGA AATTCAAATC
CAAGATACAG GGCCGGGGCT ACCAGATAAC CTCAAGCAGA AAATATTTGA GCCTTTTTTC
ACGACTAAAA CGGAAGGTAT GGGAATGGGA TTGCCGATTT GTCAATCCAT TATCGAAGCC
CATGGAGGTT GGCTTTTAGC GACTGATAAT AAGCACGGTG GTGCTGTATT TCAGCTTAGG
CTGCCAATTA TCTCCCCAAA GAATACTCTT CATGGCAGCT AA
 
Protein sequence
MLANSNIVPK INLLLIEDNP GDVRLVQLAL QEVTGVKFET TAVERLSQAL SLLEHQQFDA 
ILLDLTLPDC HGLHTFSQVK EAAPQLPIVV LSGLSNEELA IEAVKLGAQD YLVKGQSNNQ
LAGRALRYAV ERKQTETVLR QARDELEQRI AERTAHLKKA NCQLQQEILQ RKRTEALLRK
ERDFSSMILD TADVLVVILN SQGQIVRLNR AFQKISGYPS EEAQGRYLWE LTYFPDQIKK
DTREKLKQWQ TPNTPKKHES YWQAKTGERY LIAWSSTALF APSGALDYVI YTGIDITEQR
QAEDLARQRL LELAHISRLS TLGEMAAQIA HELNQPLGAI TTYSDICLRT LEPQTSKHQP
LRDILEEIAT QAERGGKIIR HLRNLIHKKE QRWASLAINE LIRETVGIMQ AEARWQNITI
KLDLQTSLPS ITADSLLLQQ VFLNLMRNAF DAMIANPCNG DRQIRIKTSW IKKTAIEIQI
QDTGPGLPDN LKQKIFEPFF TTKTEGMGMG LPICQSIIEA HGGWLLATDN KHGGAVFQLR
LPIISPKNTL HGS