Gene Noc_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0891 
Symbol 
ID3707218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp971206 
End bp972720 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content49% 
IMG OID637737394 
Producttranscriptional regulator 
Protein accessionYP_342933 
Protein GI77164408 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00132013 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAC CGCTAAAGCT AGAACGCCAG AGCAAACAAA CCTTACAAAA CCAGCTTTTC 
GAACAAATTC GCAGCTTGAT TTTGAGCGGT AAACTGAAGC CTGGTACACC CATGCCTGCC
ACCCGCTCTT TAAGCGAGCA GCTGGGAGTT TCCCGTAATA CGGTACTCTT GGCATATGAT
CGCCTCATCG CCGAGGATTA TCTTCAAACT CAAGAAGCAG TAGGCACCTA TGTCAACTCC
TACTTACCAC CAGACTCTCT TGTCCTCAAA GCTCCCACAC AACCACTTGT ACTACCTGAA
AAGCCTCAAT CAAGGCGGCA TCCCATATTA TTTCGAGGCC GGGCTCAAAA GGTAGCTAAT
TCCCAGCGAA GCCGCCTCGC AGTAGATTTT CGAGTGGGGC GTTTGGATCC TCACTCTTTC
CCTATTAAAA TCTGGCGTCG GCTAATTTTA CGCCATCTGG GCGCAGGCGG AGCTAATCTA
ACAGAATATC GCAATCCCAT TGGGATTCTA GCCCTACGAG AAGCAATAGC CAACCACCTA
GGACCTGCTA GAGGTATTGC TGTTACTCCA GAACAAATTA TTGTGGTCAG CGGCAGCCAA
CAAGCCCTAA ACATCGTCGC CCGTTTATTG ATAGCTCAAG GAACCCGGGT GGTCACAGAA
TGCCCCTGCT ACCAGGGAGC TGCTTATGTA TTTGAAAGCT ATGGCGCCCA ACTCCATCCT
GTGCCAGTGG ATCAATATGG ACTACAGGTC TCAAAACTTC CTCTTGCGCC CGTGAGTTTG
GCTTATGTTA CCCCCTCCCA CCAATATCCT ATGGGGTCAA CCCTTTCCCT AAAACGCCGC
GTTCAATTAT TAGACTGGGC TGGACAAGTC GGTGCGTACT TGATTGAGGA TGATTACGAT
AGCGACTTCC GGCATAATGG CTCCCCATTG ACAGCTTTAG CAGGATTGGA TCCCTATGAC
TGTGTGATTT ATATGGGAAC GGTATCAAAA TCGATTGGTG CCGGACTTCG TCTTGGCTAT
GTCTTAGTTC CAGAGGAGTT AATGGAACCT GCAAAGACAG TCAAGGCCTT ACTAGACAAC
GGTAATCCTT GGCTTGATCA AGCAATTCTA GCCGATTTCA TCTCTGGCGG TGGCTACGCC
AAGCACCTGC GGCAAATACG GCGAATGTAT CTCCGTCGCC GTGACTGCCT AATAGCCGCC
TTAAAATACC ATTTTGGAGA GGTTAAACTA TCGGGATTAG AGGGAGGAAT GCATATTGTT
TGGCATCTGC CCCCCGATTT CCCTACAGCC ATCGAAATGC AAGCAATTGC TCGGGAAACA
GGAGTTGGGA TGTATGCCTT AGAGAGCGGA GGCGCCTATG ATTACGGTTA TAAGGAATAC
AGTGAACGCA CCCTCCTCCT TGGTTATTCC TCTCTTCCCG AAACCCAAAT TCGTGCAGGA
ATTGCCAAAG TAGCAGCGGC GTTTTTAAAG GTGCTAGGCA ACCCCCCAGT AAAATCCAAA
CTGGCTTCTA GCTAA
 
Protein sequence
MQLPLKLERQ SKQTLQNQLF EQIRSLILSG KLKPGTPMPA TRSLSEQLGV SRNTVLLAYD 
RLIAEDYLQT QEAVGTYVNS YLPPDSLVLK APTQPLVLPE KPQSRRHPIL FRGRAQKVAN
SQRSRLAVDF RVGRLDPHSF PIKIWRRLIL RHLGAGGANL TEYRNPIGIL ALREAIANHL
GPARGIAVTP EQIIVVSGSQ QALNIVARLL IAQGTRVVTE CPCYQGAAYV FESYGAQLHP
VPVDQYGLQV SKLPLAPVSL AYVTPSHQYP MGSTLSLKRR VQLLDWAGQV GAYLIEDDYD
SDFRHNGSPL TALAGLDPYD CVIYMGTVSK SIGAGLRLGY VLVPEELMEP AKTVKALLDN
GNPWLDQAIL ADFISGGGYA KHLRQIRRMY LRRRDCLIAA LKYHFGEVKL SGLEGGMHIV
WHLPPDFPTA IEMQAIARET GVGMYALESG GAYDYGYKEY SERTLLLGYS SLPETQIRAG
IAKVAAAFLK VLGNPPVKSK LASS