Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1700 |
Symbol | |
ID | 3705613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1902839 |
End bp | 1904500 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637738181 |
Product | PAS sensor, signal transduction histidine kinase |
Protein accession | YP_343702 |
Protein GI | 77165177 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.012241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGGCTA ACTCAAACAT CGTCCCAAAA ATCAATCTAC TCCTTATCGA AGATAATCCA GGTGATGTCC GGCTGGTACA ACTTGCCCTA CAAGAGGTAA CCGGGGTTAA GTTCGAGACT ACGGCGGTAG AGCGACTCAG CCAAGCGCTG TCCTTGCTCG AGCATCAGCA ATTTGATGCC ATTTTACTAG ATTTAACCCT ACCTGACTGT CACGGTCTCC ATACTTTTAG CCAGGTAAAA GAGGCTGCAC CTCAGCTACC CATCGTGGTG CTAAGCGGTT TATCTAATGA AGAACTAGCC ATCGAGGCAG TAAAACTCGG AGCCCAGGAT TATTTAGTCA AAGGTCAAAG TAATAATCAG CTGGCGGGGC GAGCATTACG TTATGCGGTG GAAAGAAAGC AAACTGAAAC TGTATTGCGC CAAGCTCGAG ATGAATTGGA GCAACGCATC GCTGAGCGAA CAGCCCATCT GAAGAAGGCC AATTGCCAGC TTCAGCAAGA AATCCTCCAG CGGAAACGCA CTGAAGCATT GCTCCGAAAA GAACGGGACT TTAGCTCTAT GATCTTGGAT ACTGCCGATG TCCTGGTGGT AATTCTCAAC AGTCAGGGCC AGATTGTTCG CCTTAACCGA GCCTTCCAGA AAATTAGCGG ATATCCTTCC GAGGAAGCGC AAGGACGGTA TCTTTGGGAG CTTACCTATT TTCCAGACCA AATAAAAAAA GACACCAGGG AAAAACTCAA ACAATGGCAA ACACCAAACA CTCCCAAGAA GCATGAAAGC TATTGGCAAG CGAAAACGGG GGAGCGATAC CTGATCGCTT GGTCAAGCAC AGCACTATTC GCCCCCAGTG GGGCTCTAGA TTATGTTATT TATACTGGTA TCGATATTAC CGAGCAAAGA CAGGCTGAAG ATCTTGCTCG GCAGCGTTTG CTTGAGCTAG CTCATATCTC CCGTTTAAGC ACCCTCGGGG AAATGGCTGC CCAGATTGCC CACGAACTCA ACCAACCCCT AGGAGCCATC ACTACGTATA GCGATATTTG CTTGCGCACT CTTGAGCCAC AAACCTCAAA ACATCAACCA CTTCGCGATA TATTGGAGGA AATCGCAACC CAAGCCGAGC GAGGAGGAAA AATTATTCGT CATCTTCGTA ACCTTATCCA CAAAAAAGAG CAACGATGGG CTTCTCTGGC AATCAACGAA CTCATTCGTG AAACCGTTGG TATCATGCAG GCTGAAGCAC GGTGGCAAAA CATTACGATT AAACTCGACT TGCAAACGTC ACTTCCTTCT ATTACCGCTG ATAGCCTTCT ACTCCAACAG GTATTTCTCA ATCTGATGCG TAATGCCTTC GATGCCATGA TAGCGAACCC CTGCAACGGT GATCGGCAAA TCAGAATTAA AACGTCATGG ATAAAAAAAA CCGCTATTGA AATTCAAATC CAAGATACAG GGCCGGGGCT ACCAGATAAC CTCAAGCAGA AAATATTTGA GCCTTTTTTC ACGACTAAAA CGGAAGGTAT GGGAATGGGA TTGCCGATTT GTCAATCCAT TATCGAAGCC CATGGAGGTT GGCTTTTAGC GACTGATAAT AAGCACGGTG GTGCTGTATT TCAGCTTAGG CTGCCAATTA TCTCCCCAAA GAATACTCTT CATGGCAGCT AA
|
Protein sequence | MLANSNIVPK INLLLIEDNP GDVRLVQLAL QEVTGVKFET TAVERLSQAL SLLEHQQFDA ILLDLTLPDC HGLHTFSQVK EAAPQLPIVV LSGLSNEELA IEAVKLGAQD YLVKGQSNNQ LAGRALRYAV ERKQTETVLR QARDELEQRI AERTAHLKKA NCQLQQEILQ RKRTEALLRK ERDFSSMILD TADVLVVILN SQGQIVRLNR AFQKISGYPS EEAQGRYLWE LTYFPDQIKK DTREKLKQWQ TPNTPKKHES YWQAKTGERY LIAWSSTALF APSGALDYVI YTGIDITEQR QAEDLARQRL LELAHISRLS TLGEMAAQIA HELNQPLGAI TTYSDICLRT LEPQTSKHQP LRDILEEIAT QAERGGKIIR HLRNLIHKKE QRWASLAINE LIRETVGIMQ AEARWQNITI KLDLQTSLPS ITADSLLLQQ VFLNLMRNAF DAMIANPCNG DRQIRIKTSW IKKTAIEIQI QDTGPGLPDN LKQKIFEPFF TTKTEGMGMG LPICQSIIEA HGGWLLATDN KHGGAVFQLR LPIISPKNTL HGS
|
| |