Gene Noc_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0464 
Symbol 
ID3706635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp496664 
End bp499618 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content55% 
IMG OID637736973 
Producthypothetical protein 
Protein accessionYP_342517 
Protein GI77163992 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase
[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCCC GAACGAATAA TTTCCCCCCG GCAGCCGCGG AAACACTGGC AGCCGACCTG 
CGCCAGCGAA TTCAGGGCGA GGTGCGATTT GATCATGGCA GCCGCGCTTT ATACGCTACC
GATGGCTCCA ACTACCGCCA AGTTCCCATC GGTGTGGTGG TTCCTCAAAA TCGGGAAGAC
ATTATCGAAA CCATGGCGGT TTGCCGGGAG CACCAGGCGC CGGTCCTCGC GCGAGGGGGC
GGGACCAGTT TGGCGGGGCA ATGCTGTAAC ACGGCGGTCA TCATGGATAT GTCCAAATAT
TTGCGGCGGG TACTTGAACT CGATCCCGAA CGCCGCCGGG CACGGGTGGA GCCCGGCTGC
GTGCTGGACG ACCTGCGCGA CGAAGCGGAA CAACACCACC TTACTTTCGG CCCCGATCCT
TCAACCCACG ATCATAACAG CCTAGGGGGT ATGATCGGCA ACAACTCCTG CGGCGTTCAC
TCCATCATGG CTGGCCGCAC CGCCGATAAT GTCAACGCCT TGGAAATCCT CACCTATGAT
GGTCTGCGCC TGTGGGTCGG TCCTACTTCG GAAGAGAAAC TGGAACAGAT TATTCGGACG
GGCGGAAGAC GAGGTGAAAT CTATTCGGGC CTGAAAGCGA TCCGGGATAA GTACGCCGAC
TTGATCCGGC AACGCTACCC CAAAATACCA CGTCGAGTCT CTGGCTACAA TTTGGACGAA
TTGCTTCCCG AGAACGGTTT TAACGTAGCC CGGGCTTTAG TAGTAGGCAC TGAAGGCACC
TGTGTAACCG TGCTCCAAGC TGATCTGTGC CTCATTCCAA GTCCCCCTAG TCGTACCGTG
GTGGTGCTCG GTTACCCCGA TGTTTACACC GCCGGCGATC ATATACCTCA AATTCTCGAG
TATGCTCCCA TAGGGCTGGA AGGCATGGAC AATCTGCTGC TCAAGTACAT GAAAAAAGAG
CATATGTATC CCAAGGGCCG GGCGTTGCTA CCGGAGGGAG GCGGCTGGTT GCTCGTGGAG
TTTGGCGGTG AAACAAAGGC CGAGGCAGAC GAAAAAGCAA AGCGCCTGAT GCAGGCTCTT
AAGCAATCGG ACAATCCACC GAATATGAAA CTCTTCGATG ATCCCGAGGA AGAAGAACGT
ATTTGGAAAA TACGGAAAGC GGGCTTAGGA GCCACGGCGC ATCTACGGGG CGAGGAAGAT
ACCTGGCCCG GCTGGGAGGA TGCCGCTGTC TCTCCGGAGA AAGTGGGGCC GTATCTACGG
GACTTCCGTC AGTTGCTGAA ACGCTACCAT TATGACTGTT CCCTCTATGG CCATTTTGGC
GATGGTTGTA TTCACGTCCG GATCGATTTC GATCTTATTA CCAAGGAAGG CATTAAAAAT
TTTAAAGCCT TCACCCATGA TGCCGCGGAT CTGGTCTTGA GCTACGGCGG CTCCCTGTCG
GGAGAACATG GCGATGGCCA AGCCCGCGCC GATCTGCTGC CAAAAATGTA TGGGGAGGAA
TTGATCCAAG CCTTCCGAGA GTTTAAGACC CTCTGGGACC CGCTAAATCA CATGAATCCA
GGCAAGGTGG TTGATCCTTA TCCACGGGAT TCCAATTTAC GGCTGGGCGC TGATTTCCGC
CCCCCTACGC TCAAGACCGT CTTTGCCTTT TCCGAAGACG ACGGCAGCTT TTCCAAAGCT
TCCTTACGTT GCGTGGGGGT AGGCGAATGC CGTCGTAACC ACCAGGGCGT GATGTGCCCG
AGCTACATGG CCACGAAAGA GGAAATGCAT TCCACCCGTG GCCGGGCTCG CTTGCTCTAT
GAAATGATCT ACGGCATGAC CCATGAGGAT GCGCCCCTAA CGGCAGGCTG GAAAAGCAAA
GCGGTCTATG ATTCCTTGGA TCTTTGCCTA TCCTGCAAGG GGTGCCTTAG TGACTGCCCA
GTAGATGTGG ATATGGCCAC CTACAAGGCC GAGTTTAACT ATCATCATTT TCAAGGCCGG
CTGCGGCCGC GGGTGGCTTA TTCCATGGGC GGGATTTATG AGGCGAGCCG GTTAGCTGCT
CTAGCGCCTT GGGCGGTTAA TTTTTTCTCT CAAACCCCTA TTTTCTCCCA TGTAGTCAAA
TTCGCCGCCG GTATTGCCCA GGAACGGCGG CTACCCCGCT TTGCGCCTCA GACCTTTAAA
CGCTGGTACC AACAACGGCC ACATCGCCCC GGCAACCCAG GGGGCCATAA GGTGCTCCTG
TGGCCCGATA CGTTTAACAA CCATTTGCAC CCGGAAATTC TGGTGGCTGC CGTAGAAGTG
CTGGAAGCTG GCGGCTTCCA GGTGATCGTG CCCACGCCTG CCCTTTGCTG TGGCCGCCCT
CTCTATGCCT GGGGCATGTT GGATAAAGCC AAAAAGCGGC TCACCCACTT GCTCGATGCC
CTGACCCCGG AAGTCTCTCA AGGAGTTCCC ATCGTGGGTC TAGAACCGTC CTGCGTCGCC
GCCTTTAGAG ATGAATTAAT CAAGCTGTTT CCCAAGGATG ATCGAGCACG ACGCATCAGC
CAACAGACCT TTATGCTGGG CGAGTTTTTA GCCCAACAGA AAAATTATGA ACCGCCACTA
CTCCACCGCA AGGCCGTGGT TCATGCCCAT TGCCACCACC ACGCGGTGAT CGGCCTGGAA
GGGGAAAAGC AAATTCTGGA ACGGATGGGA CTTCAGTATC ATTGGCTGGA TTCTGGTTGC
TGCGGCATGG CGGGTCCTTT TGGCTTCGAG GCGGATCATT ATGAGCTATC GCTTAAAATC
GGAGAGCGGG TACTGCTGCC AGCCGTGCGC GCGGCTGAAA AGGATACCTT GATTATCACC
GATGGTTTTG CCTGTCAGGA ACAAATCACT CAAACGACCG ATCGTTCTGC GCTGCATTTA
TCGCAAATTT TGCTGATGGC CCTCCGGGAA GGCCAACGAG GAACCCGTGG CGCTTATCCG
GAACGAAAAT GGTGA
 
Protein sequence
MASRTNNFPP AAAETLAADL RQRIQGEVRF DHGSRALYAT DGSNYRQVPI GVVVPQNRED 
IIETMAVCRE HQAPVLARGG GTSLAGQCCN TAVIMDMSKY LRRVLELDPE RRRARVEPGC
VLDDLRDEAE QHHLTFGPDP STHDHNSLGG MIGNNSCGVH SIMAGRTADN VNALEILTYD
GLRLWVGPTS EEKLEQIIRT GGRRGEIYSG LKAIRDKYAD LIRQRYPKIP RRVSGYNLDE
LLPENGFNVA RALVVGTEGT CVTVLQADLC LIPSPPSRTV VVLGYPDVYT AGDHIPQILE
YAPIGLEGMD NLLLKYMKKE HMYPKGRALL PEGGGWLLVE FGGETKAEAD EKAKRLMQAL
KQSDNPPNMK LFDDPEEEER IWKIRKAGLG ATAHLRGEED TWPGWEDAAV SPEKVGPYLR
DFRQLLKRYH YDCSLYGHFG DGCIHVRIDF DLITKEGIKN FKAFTHDAAD LVLSYGGSLS
GEHGDGQARA DLLPKMYGEE LIQAFREFKT LWDPLNHMNP GKVVDPYPRD SNLRLGADFR
PPTLKTVFAF SEDDGSFSKA SLRCVGVGEC RRNHQGVMCP SYMATKEEMH STRGRARLLY
EMIYGMTHED APLTAGWKSK AVYDSLDLCL SCKGCLSDCP VDVDMATYKA EFNYHHFQGR
LRPRVAYSMG GIYEASRLAA LAPWAVNFFS QTPIFSHVVK FAAGIAQERR LPRFAPQTFK
RWYQQRPHRP GNPGGHKVLL WPDTFNNHLH PEILVAAVEV LEAGGFQVIV PTPALCCGRP
LYAWGMLDKA KKRLTHLLDA LTPEVSQGVP IVGLEPSCVA AFRDELIKLF PKDDRARRIS
QQTFMLGEFL AQQKNYEPPL LHRKAVVHAH CHHHAVIGLE GEKQILERMG LQYHWLDSGC
CGMAGPFGFE ADHYELSLKI GERVLLPAVR AAEKDTLIIT DGFACQEQIT QTTDRSALHL
SQILLMALRE GQRGTRGAYP ERKW