Gene Noc_0504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0504 
Symbol 
ID3706675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp544078 
End bp545826 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content49% 
IMG OID637737013 
Producthypothetical protein 
Protein accessionYP_342557 
Protein GI77164032 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2959] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATA AAAAAGAGAA CCTGCCGGGA CAGCAAGGCG CGAAAGCAGC ACCCTACTAT 
AGTAAAAGGA AAGAAAAGAA AGCCAGATTA TCGGAAAAAG CAGGTTTTAA GGGGGCCAGT
GGGGAAGCTG ACTCCAAAGC CCCAGAGCCG GAAAAACCAG AGTCTGAGAC GCCTGTTGCT
GCATCCAGAG TAGTAACAAC GGAGCAAGAC TCCAAGGAGA GCAGCATTCC TTTTTCGAAA
TCGGAAAGTA ATACACCAGC TTCCGAAGCA CCAGCAGAAT CCCCTCCTCC TATAGGGAGT
TCCTCCGAAA AATCTTCTGA AGATGCTACC TTGTCAAAAA CAAAGGAAGC CGAGCTGACT
TCGAAAGCCC AGGAGCCGGA AAATCCGGAG GCTGAGAAAA GAACCGGAGC TTCAGAATCG
GTGCCACCAG CTCAGGGCTC CAAAGAGAAA AGTGCCTCTT TCTCGGAGCC CCCCCAAGCA
GCCGGAAGTG CGGCTAAGTC GCCCAAAAAA GAGGGGCTTC CTGCAAGAGA TAGTAAACGC
TCTATCCGTC GTACTCATCC CTATCAGGTA GAGAAGGCAA GTGGTGGGCC TGGTCGAGGC
AAAACCTTGG TAGGGTTTGT GATACTGGCA ATTGCCCTAA TTTTAGTAGC GCTGGGTAGC
GTTTATTACA CCCGCTTCTT AGTTGAGAAA GAGCGACAAG CGCGGGAGTC TTTGTCCAAG
CAAGTGGCTG AGCAATTAGA GACTCAATCT CCCCAGATAG ATGCCCGCAT AGCTGCGCAA
GTAGATGAGC AACTTGCCGG AAGAGTCGCC TCTGAGATGG ATGAACGGCT TGCTGGACAA
GAAGAATCGC TTCAGCAGGA AGTTACTAAT CTAAAGAAAG AAATAGCGGA GAATAAAGGC
GATCTCAAGC AGATTCAGCA AGGAATGGCG ACCTTAGAGT CTACTCTGGG GCTTCTTCGT
TCCGAAGTAG AAAAGGGGCC GCAGCCGGGC AACTGGGATA TCGCGGAAGC GGCTTATCTC
ATGCGCATTG CCAATGAGCG ATTACAACTA GGACAAAATG TAAGTGTAGC CCTGGTTGCC
TTGCAAGCGG CTGACCGGAT CCTTCGCGAT ATGGCTAATC CTGCGTTTAT GCCGGTGCGT
GCTAAATTAG CGGAAGAAAT TAATTCCCTA AAAGCAGTTC CTGATCCTGA TATTGACGGT
ATGGCCCTGT CTCTCACTAA TATTATCAAT CGAGTAAAAA CCCTAGAGCT CAAAGAAACG
GTTCTAGCAG AGTCAGTGCC CGCTTCAAAG GCTGAAGGAG AAGCCCCCGA GAAGACTCCG
GAGCCGGAAG GCAATACTTA TATTGCGAAA ATCAAGGAAT TTTTGCGGGT TATTTGGGAC
GACCTCAAGA GTTTGGTTGT GGTCAAGCAC CGGCATGAAG TAGAAGGTGG CGGTATTCCT
ACCTTACTGC CGGAAGAGCG TTATTTTCTC TATCAAAATC TGCGGCTTGA ATTAAAGACG
GCCCGGCTGA ATCTTTTACT AAAAAATGAA GCCGCTTTTC AGCAAAGCCT TGAGTTAGCG
CAAAGCTGGT TGCAGACCTA TTTCCAAGGT TCTGAGGCTA AAGTGATAAA GGATACGCTC
GCTAAGCTAG AACAGGCGAC TATAGAATCT TCTTTGCCAG ATATTTCTGG ATCGCTGAAA
ACTTTAAGTC AGGTATTAAG GCACATAAAG CCCCAAGCCG CCAGAGGCAG TGAAGGAGGC
AGGGCATGA
 
Protein sequence
MSDKKENLPG QQGAKAAPYY SKRKEKKARL SEKAGFKGAS GEADSKAPEP EKPESETPVA 
ASRVVTTEQD SKESSIPFSK SESNTPASEA PAESPPPIGS SSEKSSEDAT LSKTKEAELT
SKAQEPENPE AEKRTGASES VPPAQGSKEK SASFSEPPQA AGSAAKSPKK EGLPARDSKR
SIRRTHPYQV EKASGGPGRG KTLVGFVILA IALILVALGS VYYTRFLVEK ERQARESLSK
QVAEQLETQS PQIDARIAAQ VDEQLAGRVA SEMDERLAGQ EESLQQEVTN LKKEIAENKG
DLKQIQQGMA TLESTLGLLR SEVEKGPQPG NWDIAEAAYL MRIANERLQL GQNVSVALVA
LQAADRILRD MANPAFMPVR AKLAEEINSL KAVPDPDIDG MALSLTNIIN RVKTLELKET
VLAESVPASK AEGEAPEKTP EPEGNTYIAK IKEFLRVIWD DLKSLVVVKH RHEVEGGGIP
TLLPEERYFL YQNLRLELKT ARLNLLLKNE AAFQQSLELA QSWLQTYFQG SEAKVIKDTL
AKLEQATIES SLPDISGSLK TLSQVLRHIK PQAARGSEGG RA