Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0891 |
Symbol | |
ID | 3707218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 971206 |
End bp | 972720 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637737394 |
Product | transcriptional regulator |
Protein accession | YP_342933 |
Protein GI | 77164408 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00132013 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTAC CGCTAAAGCT AGAACGCCAG AGCAAACAAA CCTTACAAAA CCAGCTTTTC GAACAAATTC GCAGCTTGAT TTTGAGCGGT AAACTGAAGC CTGGTACACC CATGCCTGCC ACCCGCTCTT TAAGCGAGCA GCTGGGAGTT TCCCGTAATA CGGTACTCTT GGCATATGAT CGCCTCATCG CCGAGGATTA TCTTCAAACT CAAGAAGCAG TAGGCACCTA TGTCAACTCC TACTTACCAC CAGACTCTCT TGTCCTCAAA GCTCCCACAC AACCACTTGT ACTACCTGAA AAGCCTCAAT CAAGGCGGCA TCCCATATTA TTTCGAGGCC GGGCTCAAAA GGTAGCTAAT TCCCAGCGAA GCCGCCTCGC AGTAGATTTT CGAGTGGGGC GTTTGGATCC TCACTCTTTC CCTATTAAAA TCTGGCGTCG GCTAATTTTA CGCCATCTGG GCGCAGGCGG AGCTAATCTA ACAGAATATC GCAATCCCAT TGGGATTCTA GCCCTACGAG AAGCAATAGC CAACCACCTA GGACCTGCTA GAGGTATTGC TGTTACTCCA GAACAAATTA TTGTGGTCAG CGGCAGCCAA CAAGCCCTAA ACATCGTCGC CCGTTTATTG ATAGCTCAAG GAACCCGGGT GGTCACAGAA TGCCCCTGCT ACCAGGGAGC TGCTTATGTA TTTGAAAGCT ATGGCGCCCA ACTCCATCCT GTGCCAGTGG ATCAATATGG ACTACAGGTC TCAAAACTTC CTCTTGCGCC CGTGAGTTTG GCTTATGTTA CCCCCTCCCA CCAATATCCT ATGGGGTCAA CCCTTTCCCT AAAACGCCGC GTTCAATTAT TAGACTGGGC TGGACAAGTC GGTGCGTACT TGATTGAGGA TGATTACGAT AGCGACTTCC GGCATAATGG CTCCCCATTG ACAGCTTTAG CAGGATTGGA TCCCTATGAC TGTGTGATTT ATATGGGAAC GGTATCAAAA TCGATTGGTG CCGGACTTCG TCTTGGCTAT GTCTTAGTTC CAGAGGAGTT AATGGAACCT GCAAAGACAG TCAAGGCCTT ACTAGACAAC GGTAATCCTT GGCTTGATCA AGCAATTCTA GCCGATTTCA TCTCTGGCGG TGGCTACGCC AAGCACCTGC GGCAAATACG GCGAATGTAT CTCCGTCGCC GTGACTGCCT AATAGCCGCC TTAAAATACC ATTTTGGAGA GGTTAAACTA TCGGGATTAG AGGGAGGAAT GCATATTGTT TGGCATCTGC CCCCCGATTT CCCTACAGCC ATCGAAATGC AAGCAATTGC TCGGGAAACA GGAGTTGGGA TGTATGCCTT AGAGAGCGGA GGCGCCTATG ATTACGGTTA TAAGGAATAC AGTGAACGCA CCCTCCTCCT TGGTTATTCC TCTCTTCCCG AAACCCAAAT TCGTGCAGGA ATTGCCAAAG TAGCAGCGGC GTTTTTAAAG GTGCTAGGCA ACCCCCCAGT AAAATCCAAA CTGGCTTCTA GCTAA
|
Protein sequence | MQLPLKLERQ SKQTLQNQLF EQIRSLILSG KLKPGTPMPA TRSLSEQLGV SRNTVLLAYD RLIAEDYLQT QEAVGTYVNS YLPPDSLVLK APTQPLVLPE KPQSRRHPIL FRGRAQKVAN SQRSRLAVDF RVGRLDPHSF PIKIWRRLIL RHLGAGGANL TEYRNPIGIL ALREAIANHL GPARGIAVTP EQIIVVSGSQ QALNIVARLL IAQGTRVVTE CPCYQGAAYV FESYGAQLHP VPVDQYGLQV SKLPLAPVSL AYVTPSHQYP MGSTLSLKRR VQLLDWAGQV GAYLIEDDYD SDFRHNGSPL TALAGLDPYD CVIYMGTVSK SIGAGLRLGY VLVPEELMEP AKTVKALLDN GNPWLDQAIL ADFISGGGYA KHLRQIRRMY LRRRDCLIAA LKYHFGEVKL SGLEGGMHIV WHLPPDFPTA IEMQAIARET GVGMYALESG GAYDYGYKEY SERTLLLGYS SLPETQIRAG IAKVAAAFLK VLGNPPVKSK LASS
|
| |