Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2066 |
Symbol | |
ID | 3705237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2376951 |
End bp | 2378690 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637738541 |
Product | hypothetical protein |
Protein accession | YP_344056 |
Protein GI | 77165531 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTTG AAATGAGACA AGAGGAGCTG AAAAGGCTTA TTCTCAAAGG CAAGGCGCAA GGTTTGCTGA CTGAGGAAGA GTTACAAGAC TACATTATAC AGGAAGTGGA GGACAGTGAT GAGGCTGAAA CCATGGCCCA TCTCTTTCAT CATCACCAGG GTCTCGATGA ATTTAACGAA GCCCTAGATT CCACACTATT GCTTAAAAAT AGCTCGGAGG AGGAAATAGC TGAAACACTT CCGACTTTAG AAGGAGAAAG CACGGGAGCA GTTAGTGATA TGGTCTATGC CTACATGCGG GAGATGGGCG CCCATGCACT CCTTACCCGC GAGGAAGAAA TCACCTTAGC CAAGGCGATT GAGACAGGAT TAAGCCAAAG CACCGAAGCC TTAGGTAACT GTCCCGCAGC CGTGGCTGAG CTGGTACGTT GGGCCGAAGA AGTGGCCACA GGCAAGGGCC GTTTGAAAGA CTGGCTAACT GGCTTTACTG AACCAGAAAT AGAAGAAACT GAGATAAAGG ACGGGAAAGA ATCAAGTTTG GAAGCAACGA GCAAGCATCT ATCCCGAGTC CGCACTCTTT ATGGCAACCT AGAGGCAACT TTAGCCCAAG AAGGGGTAGC AAGCCCCCGA GCACGAGAGC TGCGAAGGAA ACTTGCTCGG GAATTTCAGT TCCTGCGCCT AATTCCGTCC AGAATTAATC AGCTGACTCA ATTGGTGCAA GGCTGGATAA GCAAAGCCCA AACCCAGGAA CAAATCATAA GGTCCTGCTG TATCCATCAA GGTGAGATGT CCCAAGAGGA CTTTTGGCAC AATTTTTCCA GCGGTACCAC TCATCCCCAG TGGTTGGATG ACCTTCTAGC CCACAGTCGG ATAGACCGGC AAGGGCTGCA GACCCAAGCC AAAACGCTTC GAGCAGCCCA AGCAGTACTG CTTCAGGTGG AAGTCGAGGC GGGACTCCCT CTGGACGAGC TCAAGGCAAT TCACCAACGC CTGGTCCGAA GCCAATTTCA AGCGCAGCGG GCTAAAGCAC AAATGGTGGA GGCCAATCTG CGCCTGGTGG TCTCGGTAGC GAAGAAGTAT CGCAATCGGG GATTGACCTT TCTGGATTTG ATCCAAGAGG GGAATATCGG CTTGATGAAA GCGGTGGACA AATTCGATTA CCACCGGGGC TATAAATTCT CTACCTATGC CCATTGGTGG ATCCGGCAGG CAATTACGCG AGCTATTGAC GATCAGTCCC GAACTATCCG TATTCCGGTC CATGTGATGG AGAAACTCAG CAAACTCAAC CGGGCCTCCT ACCAGCTCCG ACAGGAAAAA GGCCGCGAAG GCCGTCCTGA GGAATTGGCC GAACGTCTCG CCCTGTCCGA GCAACAGATT CATCGTATGC ACGAGATCGC TAAACAACCC ATCTCCTTGG AAACCCCCCT AGGTAAAGAT GAGGACTCGC AATTAGGTGA ACTCATGGAA GACGAGCAAG TCCCAAATCC CATGGAGGTT GCTATCACAG CCGGACTGCA GACCGGGGCC CAGCAGTTGC TCGCAGCACT TTCTCCCCGA GAGGCCCAGG TGGTTGCCAT GCGCTTTGGG ATCGGTATGG ACACTGACCA TACCTTGGGA GAAGTGGCCC AGCAATTTGA TTTGAGCCGG GAACGAATCC GGCAAATCGA GGCTCAAGCC TTGGGTAAAC TGCGTCGCTT GGGCCACTCC AAGGCCCTGC GCAACTTTCT CGAAGACTAG
|
Protein sequence | MDFEMRQEEL KRLILKGKAQ GLLTEEELQD YIIQEVEDSD EAETMAHLFH HHQGLDEFNE ALDSTLLLKN SSEEEIAETL PTLEGESTGA VSDMVYAYMR EMGAHALLTR EEEITLAKAI ETGLSQSTEA LGNCPAAVAE LVRWAEEVAT GKGRLKDWLT GFTEPEIEET EIKDGKESSL EATSKHLSRV RTLYGNLEAT LAQEGVASPR ARELRRKLAR EFQFLRLIPS RINQLTQLVQ GWISKAQTQE QIIRSCCIHQ GEMSQEDFWH NFSSGTTHPQ WLDDLLAHSR IDRQGLQTQA KTLRAAQAVL LQVEVEAGLP LDELKAIHQR LVRSQFQAQR AKAQMVEANL RLVVSVAKKY RNRGLTFLDL IQEGNIGLMK AVDKFDYHRG YKFSTYAHWW IRQAITRAID DQSRTIRIPV HVMEKLSKLN RASYQLRQEK GREGRPEELA ERLALSEQQI HRMHEIAKQP ISLETPLGKD EDSQLGELME DEQVPNPMEV AITAGLQTGA QQLLAALSPR EAQVVAMRFG IGMDTDHTLG EVAQQFDLSR ERIRQIEAQA LGKLRRLGHS KALRNFLED
|
| |