Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0464 |
Symbol | |
ID | 3706635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 496664 |
End bp | 499618 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637736973 |
Product | hypothetical protein |
Protein accession | YP_342517 |
Protein GI | 77163992 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCCC GAACGAATAA TTTCCCCCCG GCAGCCGCGG AAACACTGGC AGCCGACCTG CGCCAGCGAA TTCAGGGCGA GGTGCGATTT GATCATGGCA GCCGCGCTTT ATACGCTACC GATGGCTCCA ACTACCGCCA AGTTCCCATC GGTGTGGTGG TTCCTCAAAA TCGGGAAGAC ATTATCGAAA CCATGGCGGT TTGCCGGGAG CACCAGGCGC CGGTCCTCGC GCGAGGGGGC GGGACCAGTT TGGCGGGGCA ATGCTGTAAC ACGGCGGTCA TCATGGATAT GTCCAAATAT TTGCGGCGGG TACTTGAACT CGATCCCGAA CGCCGCCGGG CACGGGTGGA GCCCGGCTGC GTGCTGGACG ACCTGCGCGA CGAAGCGGAA CAACACCACC TTACTTTCGG CCCCGATCCT TCAACCCACG ATCATAACAG CCTAGGGGGT ATGATCGGCA ACAACTCCTG CGGCGTTCAC TCCATCATGG CTGGCCGCAC CGCCGATAAT GTCAACGCCT TGGAAATCCT CACCTATGAT GGTCTGCGCC TGTGGGTCGG TCCTACTTCG GAAGAGAAAC TGGAACAGAT TATTCGGACG GGCGGAAGAC GAGGTGAAAT CTATTCGGGC CTGAAAGCGA TCCGGGATAA GTACGCCGAC TTGATCCGGC AACGCTACCC CAAAATACCA CGTCGAGTCT CTGGCTACAA TTTGGACGAA TTGCTTCCCG AGAACGGTTT TAACGTAGCC CGGGCTTTAG TAGTAGGCAC TGAAGGCACC TGTGTAACCG TGCTCCAAGC TGATCTGTGC CTCATTCCAA GTCCCCCTAG TCGTACCGTG GTGGTGCTCG GTTACCCCGA TGTTTACACC GCCGGCGATC ATATACCTCA AATTCTCGAG TATGCTCCCA TAGGGCTGGA AGGCATGGAC AATCTGCTGC TCAAGTACAT GAAAAAAGAG CATATGTATC CCAAGGGCCG GGCGTTGCTA CCGGAGGGAG GCGGCTGGTT GCTCGTGGAG TTTGGCGGTG AAACAAAGGC CGAGGCAGAC GAAAAAGCAA AGCGCCTGAT GCAGGCTCTT AAGCAATCGG ACAATCCACC GAATATGAAA CTCTTCGATG ATCCCGAGGA AGAAGAACGT ATTTGGAAAA TACGGAAAGC GGGCTTAGGA GCCACGGCGC ATCTACGGGG CGAGGAAGAT ACCTGGCCCG GCTGGGAGGA TGCCGCTGTC TCTCCGGAGA AAGTGGGGCC GTATCTACGG GACTTCCGTC AGTTGCTGAA ACGCTACCAT TATGACTGTT CCCTCTATGG CCATTTTGGC GATGGTTGTA TTCACGTCCG GATCGATTTC GATCTTATTA CCAAGGAAGG CATTAAAAAT TTTAAAGCCT TCACCCATGA TGCCGCGGAT CTGGTCTTGA GCTACGGCGG CTCCCTGTCG GGAGAACATG GCGATGGCCA AGCCCGCGCC GATCTGCTGC CAAAAATGTA TGGGGAGGAA TTGATCCAAG CCTTCCGAGA GTTTAAGACC CTCTGGGACC CGCTAAATCA CATGAATCCA GGCAAGGTGG TTGATCCTTA TCCACGGGAT TCCAATTTAC GGCTGGGCGC TGATTTCCGC CCCCCTACGC TCAAGACCGT CTTTGCCTTT TCCGAAGACG ACGGCAGCTT TTCCAAAGCT TCCTTACGTT GCGTGGGGGT AGGCGAATGC CGTCGTAACC ACCAGGGCGT GATGTGCCCG AGCTACATGG CCACGAAAGA GGAAATGCAT TCCACCCGTG GCCGGGCTCG CTTGCTCTAT GAAATGATCT ACGGCATGAC CCATGAGGAT GCGCCCCTAA CGGCAGGCTG GAAAAGCAAA GCGGTCTATG ATTCCTTGGA TCTTTGCCTA TCCTGCAAGG GGTGCCTTAG TGACTGCCCA GTAGATGTGG ATATGGCCAC CTACAAGGCC GAGTTTAACT ATCATCATTT TCAAGGCCGG CTGCGGCCGC GGGTGGCTTA TTCCATGGGC GGGATTTATG AGGCGAGCCG GTTAGCTGCT CTAGCGCCTT GGGCGGTTAA TTTTTTCTCT CAAACCCCTA TTTTCTCCCA TGTAGTCAAA TTCGCCGCCG GTATTGCCCA GGAACGGCGG CTACCCCGCT TTGCGCCTCA GACCTTTAAA CGCTGGTACC AACAACGGCC ACATCGCCCC GGCAACCCAG GGGGCCATAA GGTGCTCCTG TGGCCCGATA CGTTTAACAA CCATTTGCAC CCGGAAATTC TGGTGGCTGC CGTAGAAGTG CTGGAAGCTG GCGGCTTCCA GGTGATCGTG CCCACGCCTG CCCTTTGCTG TGGCCGCCCT CTCTATGCCT GGGGCATGTT GGATAAAGCC AAAAAGCGGC TCACCCACTT GCTCGATGCC CTGACCCCGG AAGTCTCTCA AGGAGTTCCC ATCGTGGGTC TAGAACCGTC CTGCGTCGCC GCCTTTAGAG ATGAATTAAT CAAGCTGTTT CCCAAGGATG ATCGAGCACG ACGCATCAGC CAACAGACCT TTATGCTGGG CGAGTTTTTA GCCCAACAGA AAAATTATGA ACCGCCACTA CTCCACCGCA AGGCCGTGGT TCATGCCCAT TGCCACCACC ACGCGGTGAT CGGCCTGGAA GGGGAAAAGC AAATTCTGGA ACGGATGGGA CTTCAGTATC ATTGGCTGGA TTCTGGTTGC TGCGGCATGG CGGGTCCTTT TGGCTTCGAG GCGGATCATT ATGAGCTATC GCTTAAAATC GGAGAGCGGG TACTGCTGCC AGCCGTGCGC GCGGCTGAAA AGGATACCTT GATTATCACC GATGGTTTTG CCTGTCAGGA ACAAATCACT CAAACGACCG ATCGTTCTGC GCTGCATTTA TCGCAAATTT TGCTGATGGC CCTCCGGGAA GGCCAACGAG GAACCCGTGG CGCTTATCCG GAACGAAAAT GGTGA
|
Protein sequence | MASRTNNFPP AAAETLAADL RQRIQGEVRF DHGSRALYAT DGSNYRQVPI GVVVPQNRED IIETMAVCRE HQAPVLARGG GTSLAGQCCN TAVIMDMSKY LRRVLELDPE RRRARVEPGC VLDDLRDEAE QHHLTFGPDP STHDHNSLGG MIGNNSCGVH SIMAGRTADN VNALEILTYD GLRLWVGPTS EEKLEQIIRT GGRRGEIYSG LKAIRDKYAD LIRQRYPKIP RRVSGYNLDE LLPENGFNVA RALVVGTEGT CVTVLQADLC LIPSPPSRTV VVLGYPDVYT AGDHIPQILE YAPIGLEGMD NLLLKYMKKE HMYPKGRALL PEGGGWLLVE FGGETKAEAD EKAKRLMQAL KQSDNPPNMK LFDDPEEEER IWKIRKAGLG ATAHLRGEED TWPGWEDAAV SPEKVGPYLR DFRQLLKRYH YDCSLYGHFG DGCIHVRIDF DLITKEGIKN FKAFTHDAAD LVLSYGGSLS GEHGDGQARA DLLPKMYGEE LIQAFREFKT LWDPLNHMNP GKVVDPYPRD SNLRLGADFR PPTLKTVFAF SEDDGSFSKA SLRCVGVGEC RRNHQGVMCP SYMATKEEMH STRGRARLLY EMIYGMTHED APLTAGWKSK AVYDSLDLCL SCKGCLSDCP VDVDMATYKA EFNYHHFQGR LRPRVAYSMG GIYEASRLAA LAPWAVNFFS QTPIFSHVVK FAAGIAQERR LPRFAPQTFK RWYQQRPHRP GNPGGHKVLL WPDTFNNHLH PEILVAAVEV LEAGGFQVIV PTPALCCGRP LYAWGMLDKA KKRLTHLLDA LTPEVSQGVP IVGLEPSCVA AFRDELIKLF PKDDRARRIS QQTFMLGEFL AQQKNYEPPL LHRKAVVHAH CHHHAVIGLE GEKQILERMG LQYHWLDSGC CGMAGPFGFE ADHYELSLKI GERVLLPAVR AAEKDTLIIT DGFACQEQIT QTTDRSALHL SQILLMALRE GQRGTRGAYP ERKW
|
| |