Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0504 |
Symbol | |
ID | 3706675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 544078 |
End bp | 545826 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637737013 |
Product | hypothetical protein |
Protein accession | YP_342557 |
Protein GI | 77164032 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA AAAAAGAGAA CCTGCCGGGA CAGCAAGGCG CGAAAGCAGC ACCCTACTAT AGTAAAAGGA AAGAAAAGAA AGCCAGATTA TCGGAAAAAG CAGGTTTTAA GGGGGCCAGT GGGGAAGCTG ACTCCAAAGC CCCAGAGCCG GAAAAACCAG AGTCTGAGAC GCCTGTTGCT GCATCCAGAG TAGTAACAAC GGAGCAAGAC TCCAAGGAGA GCAGCATTCC TTTTTCGAAA TCGGAAAGTA ATACACCAGC TTCCGAAGCA CCAGCAGAAT CCCCTCCTCC TATAGGGAGT TCCTCCGAAA AATCTTCTGA AGATGCTACC TTGTCAAAAA CAAAGGAAGC CGAGCTGACT TCGAAAGCCC AGGAGCCGGA AAATCCGGAG GCTGAGAAAA GAACCGGAGC TTCAGAATCG GTGCCACCAG CTCAGGGCTC CAAAGAGAAA AGTGCCTCTT TCTCGGAGCC CCCCCAAGCA GCCGGAAGTG CGGCTAAGTC GCCCAAAAAA GAGGGGCTTC CTGCAAGAGA TAGTAAACGC TCTATCCGTC GTACTCATCC CTATCAGGTA GAGAAGGCAA GTGGTGGGCC TGGTCGAGGC AAAACCTTGG TAGGGTTTGT GATACTGGCA ATTGCCCTAA TTTTAGTAGC GCTGGGTAGC GTTTATTACA CCCGCTTCTT AGTTGAGAAA GAGCGACAAG CGCGGGAGTC TTTGTCCAAG CAAGTGGCTG AGCAATTAGA GACTCAATCT CCCCAGATAG ATGCCCGCAT AGCTGCGCAA GTAGATGAGC AACTTGCCGG AAGAGTCGCC TCTGAGATGG ATGAACGGCT TGCTGGACAA GAAGAATCGC TTCAGCAGGA AGTTACTAAT CTAAAGAAAG AAATAGCGGA GAATAAAGGC GATCTCAAGC AGATTCAGCA AGGAATGGCG ACCTTAGAGT CTACTCTGGG GCTTCTTCGT TCCGAAGTAG AAAAGGGGCC GCAGCCGGGC AACTGGGATA TCGCGGAAGC GGCTTATCTC ATGCGCATTG CCAATGAGCG ATTACAACTA GGACAAAATG TAAGTGTAGC CCTGGTTGCC TTGCAAGCGG CTGACCGGAT CCTTCGCGAT ATGGCTAATC CTGCGTTTAT GCCGGTGCGT GCTAAATTAG CGGAAGAAAT TAATTCCCTA AAAGCAGTTC CTGATCCTGA TATTGACGGT ATGGCCCTGT CTCTCACTAA TATTATCAAT CGAGTAAAAA CCCTAGAGCT CAAAGAAACG GTTCTAGCAG AGTCAGTGCC CGCTTCAAAG GCTGAAGGAG AAGCCCCCGA GAAGACTCCG GAGCCGGAAG GCAATACTTA TATTGCGAAA ATCAAGGAAT TTTTGCGGGT TATTTGGGAC GACCTCAAGA GTTTGGTTGT GGTCAAGCAC CGGCATGAAG TAGAAGGTGG CGGTATTCCT ACCTTACTGC CGGAAGAGCG TTATTTTCTC TATCAAAATC TGCGGCTTGA ATTAAAGACG GCCCGGCTGA ATCTTTTACT AAAAAATGAA GCCGCTTTTC AGCAAAGCCT TGAGTTAGCG CAAAGCTGGT TGCAGACCTA TTTCCAAGGT TCTGAGGCTA AAGTGATAAA GGATACGCTC GCTAAGCTAG AACAGGCGAC TATAGAATCT TCTTTGCCAG ATATTTCTGG ATCGCTGAAA ACTTTAAGTC AGGTATTAAG GCACATAAAG CCCCAAGCCG CCAGAGGCAG TGAAGGAGGC AGGGCATGA
|
Protein sequence | MSDKKENLPG QQGAKAAPYY SKRKEKKARL SEKAGFKGAS GEADSKAPEP EKPESETPVA ASRVVTTEQD SKESSIPFSK SESNTPASEA PAESPPPIGS SSEKSSEDAT LSKTKEAELT SKAQEPENPE AEKRTGASES VPPAQGSKEK SASFSEPPQA AGSAAKSPKK EGLPARDSKR SIRRTHPYQV EKASGGPGRG KTLVGFVILA IALILVALGS VYYTRFLVEK ERQARESLSK QVAEQLETQS PQIDARIAAQ VDEQLAGRVA SEMDERLAGQ EESLQQEVTN LKKEIAENKG DLKQIQQGMA TLESTLGLLR SEVEKGPQPG NWDIAEAAYL MRIANERLQL GQNVSVALVA LQAADRILRD MANPAFMPVR AKLAEEINSL KAVPDPDIDG MALSLTNIIN RVKTLELKET VLAESVPASK AEGEAPEKTP EPEGNTYIAK IKEFLRVIWD DLKSLVVVKH RHEVEGGGIP TLLPEERYFL YQNLRLELKT ARLNLLLKNE AAFQQSLELA QSWLQTYFQG SEAKVIKDTL AKLEQATIES SLPDISGSLK TLSQVLRHIK PQAARGSEGG RA
|
| |