Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0639 |
Symbol | |
ID | 3706871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 684968 |
End bp | 689836 |
Gene Length | 4869 bp |
Protein Length | 1622 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637737147 |
Product | hypothetical protein |
Protein accession | YP_342688 |
Protein GI | 77164163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGGGGG CGCCGTGCAC GAATGAAGAA GAGCATCAGT TACTTGCACA TTTCGTGCTC ATACAGTTCG ACTTCCTTCG CGAAGGTGCC ACTGATCCCC CTGATGCCAT TAATCGTGTT CGGGATTGCC TAGCGCCAGA TGACACAGCC AAGGCACCGC TTGTCTGGTC GCGAGTAGTT CAGCTAGCCC GTTCATCAGC TGGAAAAGCT GGTCAGTTTG ATCGCATACG ACTTGTTCGC TCGATTTCAC CTGTCGCACG CCTACGCGGC GCGACGTCAT TACGTCTCAA CCTGGACAAG TTGACAGAGC TCGCGAAGAG CTATGCGAAC CTAATTCCGG ACGATATTGG GGGAACAAAG CTCGACCGCA TCTCGCTCCT TGAGAGCATT GATGCAAAAC TCGCCACAGC TCGTGCCGTC CAGGTGCGCG GCCTGCCAGG AAGTGGCAAA TCAGTCGTGG TGAGGCGAGC GGTACAGCGC GCGTTAGAAC ACGGGCCGAC TCTCTTCCTC AAAGCCGAAC AGCTCGAAGG GACCAGCTGG ATCAGCTATG CGAACTCACA GGGTTTATCG GGTGCTCATC TAGAGCAACT TCTCGTAGAG ATCGGTGCTG CCGGTACCCC CGTACTCTTT GTCGATGCGA TCGATCGCAT TGACAAAGAA CACCAGCCCA TCATCGTCGA TGTGATTCAC ACCATTGTGG AATCACCACT GCTTGATAAC TGGCGCGTAG TCGTTTCCCT TCGTGACACC GGCATTGAAA TGCTTCGTAA TTGGTTGGGT GAATTCCTCG ATACCCTTAA TGTCGAGACG CTAAGCGTTG GTCAGCTGAG CGACGATGAG GCTGAGTCGC TCGCAAAGGC CAAGCCACAC CTGAGATCGC TCCTGTTTGG ATCCGCTCAG GTGCGAGAAA TCGTCCGACG GCCCTTTTTC GCGAAAGTGC TGAACCAGAG TTACATGGTC GACTCCAGCA GTTCGACATT CGCCCCCCAA TCTGAAGTCG GTTTGATTGA GAATTGGTGG CGGAGGGGTG GCTACAATGA GACGGGTCAA AACGCGTTAG AACGGCAGCG CGCACTGCTC GACTTGGCAA GTGTGCGTGC CCGCCAGCTT AGTCAGCCGA TTGCCCTTAG CCAGCTAACA TCCGTCGCAC ACATAGATGG TCTAAGATCA GACGGCATAC TCCAAAATGC CCGTGAAGGG GTTTCCGTCC GCTTTGCACA CGATATCTTC TTCGAATGGG CCTTCTTCCA TGTCCTAGCC GATCGCGGAC TCCAATGGGT GGAGGTTATC AAGGCCTGCG GCGAGCCACC GGCAGTAGCC CGCGTCGTCG AGCTGACTTC GCAATGGGAA TATGCGCAGG GAACAAACTG GCCAGCGCAC CTTGCCCAAA CGGAGAGTTC GGATCTTAGG TCACAATGGC TGCGAGCGTG GCTGGTCGGT CCTCTTGGGA CTGCGAGATT CGATGATGAT GAAGACCAAT TCGCGACGGC CGTCATTGCC AACGACTTTC GACTTTTCAG GAAGACGCTC GTCTGGTTTC AGGCTGAGAA GACCACCCCG AATGAGAGTA TCCTTACTGG TGGTCTCCCA CAGGAACAAC GCCAACGGTT CGCTGATCTT CTCGGATGGC CTTCCGACTT CTCCGCGTGG CGTCGCCTCA TCAATTTTAT ACTACAGCGC ATTGCAGATA TTCCGCAGCG GCTTTATCCA GAGATCGTCT CCATCTTCGA GGTCTGGCAG AATGCCTTGG CCGACCTCCG CAACCCGACG TCTCATGCAC TTCTGCAACA ATGCGCTGTC TGGCTGGCCG CCATCGACGC GATCAGTACT GCCGATAAGC CTGATGAGAA CTCTGCTTAT TGGGGGGCGG TTCCCGGCTT GGGTGCTCTC AGAAAATCGC TGGGCCAGCT CCTCTTAAGA TCGTCGAGAG CCGAGCCATC ACTTTCAGCC GACTATTTGC GACGGGTCGC CAACTCGGAG CGCATCCGAG ACGATGCATT TCATGACATC ATCGCCTACT CACCCGTCCT TGCCCAATCG CTACCTCAGT CTGTGGTCGA GCTTTCGCTA GCGTTCCTGC TTGGGGAGCT TCCGGATGAG CAAGTCGCCC GAGAAAAACA GGAACTTCAT GATACGGCCG AGTGGCGCAA AGCAGTTTTA GCAAAGCCGG AAGCCGAACA GACGCGTAAA GAAAAAATGG CACTTTCGGG CGGATTCTAT CTGCGAACCG TCGGCGATTT CAGCTACCAC GACTGGGAAA GGCTTTCGAT CCACGACGAC CACCGAAACT TCTGGCCGCC TTCGCCACTT CGAGAGCCCT TTCACTCGCT TTTCCAATCA TCTCCTGACC ATGCACTGCG GCTCCTTCGA GAACTCTGTA ATCATGCGAT GACCGCATGG CGACAGCTAC ATCATCACTC ACACGACCGT GGGGGCACCC CTATACCGCT CGAACTTACG TTTCCCTGGG GTACCCAAAT TTTCTGGGGC ACCGATCGAG AGTATCTTTG GTTTCGGTCG ACGTGGGCAC CTAAAGCCAT CGGCTGCGCA TTCATGGCGC TCGAGGAGTG GTGCTTCACC GAGCTTGAGC AAAGCCGGAC TGTTGACGAG CTGATCCAGC AAATCGTCGA GGGGAACGAG TGCATCGCCA TCCTAGGAAT GGCGTCTATG CTCGCTCTCC ACACCGAGTG GGTGTCTGAA ACAACATTGC CGCTCTTCAC CTCGCAGCGC CTGTTGGCCG CTGACCATAA TCGAATGGCG CAAGACCTTT CATCATCGAC GAACCTGATC GGTTTCACGA GTAGTACTGA CAAACCCCAC ATTGAGGCAA TCCAGATGGC AAATGCTAGG ACTGTCCGTA AGACACAGCT TAGTTGGATG GTACCCAGGT TTGTCTTTGC CACGGAGCCA TTCCGCGATC GAGCACGCGA AGCGATCCTC AATTTCAAGA ACGATCTGCC CTTCCAGTAC GAAGAACACC GCGACATTCC GGAGGCACGG GAGTACCTCA CAAAGCAGGC CCTCGAATAT GCCGAGTTGG CGGATCCGGA AAACTATCAG GCCTACCGCA CCGAAGAGGG TTCGGACCAA ATAGCGATTG TCCACGTCAG CCCATCTGCT GCTCAACCTG AAAATATCGC CAGAGCCGAA GAGGCTAATA AGTACCTCAG GCAGACCGGC CTCTGGACAT GGGCATCCAA ATCATTCGAA GAGACGACAT TGAACGACAC TTATACGATC GAGGATGCCA TCGTATTGTC CAAGGAAGCG GACGCCAGCG ACTTATTCGA ACACCCGAAC AGCGAGAATG AAGAAGAGCA GTTGGGGATG CGTCGAGGTG CAGTCGCCGC CACAGCAGCC ATAGCACTTA ACTTTAGAGA AGGGTGTACA CACGAAGACC TTGAGTGGGC TCGTGGTGTC CTAGGACGCG CAATCCGTCT GCCAGAAAAG TTCGAATCGA TGTGGTCCCC TGGTTCCGTC GTCCCGTGGC ACCAGGGGAT CTATGTGGCA CGGGGGCTCG CAGCGGATCT CCGGGAAGGT ACAGCAGCGC GCAGCACGGC CAATGACCTC CTCGGGCTAA TCGCGCACCC ACTTGAAATT GTTGCGTTGA CCGCACTCGA AGAGGCCTGC AAGCTTTGGC CTAATGACTC GAAGTTGACT TGGGCCGCGC TGATATTGGC GTTTTCACTC TGCCACGTTC CTCCGCGGCC GCGTGACCAG CCCCGTCAAC ACGGTGAGGC ACTCCATACA TCAAATGAAG CACAAGCCGC TGTCAACGCG GCGCTTGCGT TCTATGAGCA CGGAAGCGAA TGGGCGCCTC TCCCTTTACC GCCTCCAGCC TGGGTGAAGG TCGAACCTGA GAAGGGCCGG CGTGGGTATC AAAGATATGA AGATTACGAC TTGGATGATG CAACCGACGC TGCTGAAGTA TGGGGTGAAC CGGATGTCTT CTGGCACTCA AAGCAGGCTG CAGAAGTCCT TCAGCGCATC CCCTTGGACG AGGTTCTAAA CAGCAGTGCA AAGAGCGTGC TTCTCGACTT CCTTGCCAGT GTTCTCGACT GGACCAACCA GAAGAATGCA CCACCGTGGG TGAAGCCCGG ACGCCGCGAT CGGTCGGCGA CCCAAATCTT TGAGTGGACG CATACGCTCG GATCAAGGCT GGGATACATG GCTGGCCTTA TGCCGCTTGC CGACTTTCAG GCGCGCTTTC TTGATCCGAT TTTGGACCTC GAAGGCGACA ATTGCTGGTC TCTTCTTTCG CCATTCACGA GCACCTTTGT CTGCGCTTAC GTGTACGATG CGCCGGTCGC TCCAGTCGAT ACAGTTGCGA TACTCGATCT CTGTCTCGCG CGGCTCCTCC AGGACCGTAC CTTCAAGCGT GACACCTACC GGAGCGGAGA ATTTTCAGGC TTCGATCAAC CCGAGCTTGT TCGTACGTTG ATGTTTGTCT CAGTCGAACG CGCCGATTTG GCCGCTCGCT ACGTGAATGG CGACTGGTCC GAAATCAACC GCATCCTGCC CTTGATTGAT CGATTTATTC GCGCTGGCGG ATGGGCTGCT TCAGTGATGG ACTCATTCTT GACGCTCTGT GAACGAGCAA GAGCTAACTA CCCGGCCGAA GCCTTCGCAG AGCAAGTGCT CGCAATTATC GGCGACGGTC CCGACAGCCT GAAAGGTTGG CATGGAACAT TCATCCCAGC ACGCATTGCA GAGCTGGTGC AACACTTTGC ACATCGTGAC GCGCCGATGA CGCTGGCCCT GGCACAGAAG TTCCTGCGAA TCCTTGACAT GTTAGTCGAC ATGGGAGATC GGCGAAGTGC CGCGCTGCAG CTTGGTGAAT CGTTCCGCGA GATTAGTGCG CTCTCTTAG
|
Protein sequence | MKGAPCTNEE EHQLLAHFVL IQFDFLREGA TDPPDAINRV RDCLAPDDTA KAPLVWSRVV QLARSSAGKA GQFDRIRLVR SISPVARLRG ATSLRLNLDK LTELAKSYAN LIPDDIGGTK LDRISLLESI DAKLATARAV QVRGLPGSGK SVVVRRAVQR ALEHGPTLFL KAEQLEGTSW ISYANSQGLS GAHLEQLLVE IGAAGTPVLF VDAIDRIDKE HQPIIVDVIH TIVESPLLDN WRVVVSLRDT GIEMLRNWLG EFLDTLNVET LSVGQLSDDE AESLAKAKPH LRSLLFGSAQ VREIVRRPFF AKVLNQSYMV DSSSSTFAPQ SEVGLIENWW RRGGYNETGQ NALERQRALL DLASVRARQL SQPIALSQLT SVAHIDGLRS DGILQNAREG VSVRFAHDIF FEWAFFHVLA DRGLQWVEVI KACGEPPAVA RVVELTSQWE YAQGTNWPAH LAQTESSDLR SQWLRAWLVG PLGTARFDDD EDQFATAVIA NDFRLFRKTL VWFQAEKTTP NESILTGGLP QEQRQRFADL LGWPSDFSAW RRLINFILQR IADIPQRLYP EIVSIFEVWQ NALADLRNPT SHALLQQCAV WLAAIDAIST ADKPDENSAY WGAVPGLGAL RKSLGQLLLR SSRAEPSLSA DYLRRVANSE RIRDDAFHDI IAYSPVLAQS LPQSVVELSL AFLLGELPDE QVAREKQELH DTAEWRKAVL AKPEAEQTRK EKMALSGGFY LRTVGDFSYH DWERLSIHDD HRNFWPPSPL REPFHSLFQS SPDHALRLLR ELCNHAMTAW RQLHHHSHDR GGTPIPLELT FPWGTQIFWG TDREYLWFRS TWAPKAIGCA FMALEEWCFT ELEQSRTVDE LIQQIVEGNE CIAILGMASM LALHTEWVSE TTLPLFTSQR LLAADHNRMA QDLSSSTNLI GFTSSTDKPH IEAIQMANAR TVRKTQLSWM VPRFVFATEP FRDRAREAIL NFKNDLPFQY EEHRDIPEAR EYLTKQALEY AELADPENYQ AYRTEEGSDQ IAIVHVSPSA AQPENIARAE EANKYLRQTG LWTWASKSFE ETTLNDTYTI EDAIVLSKEA DASDLFEHPN SENEEEQLGM RRGAVAATAA IALNFREGCT HEDLEWARGV LGRAIRLPEK FESMWSPGSV VPWHQGIYVA RGLAADLREG TAARSTANDL LGLIAHPLEI VALTALEEAC KLWPNDSKLT WAALILAFSL CHVPPRPRDQ PRQHGEALHT SNEAQAAVNA ALAFYEHGSE WAPLPLPPPA WVKVEPEKGR RGYQRYEDYD LDDATDAAEV WGEPDVFWHS KQAAEVLQRI PLDEVLNSSA KSVLLDFLAS VLDWTNQKNA PPWVKPGRRD RSATQIFEWT HTLGSRLGYM AGLMPLADFQ ARFLDPILDL EGDNCWSLLS PFTSTFVCAY VYDAPVAPVD TVAILDLCLA RLLQDRTFKR DTYRSGEFSG FDQPELVRTL MFVSVERADL AARYVNGDWS EINRILPLID RFIRAGGWAA SVMDSFLTLC ERARANYPAE AFAEQVLAII GDGPDSLKGW HGTFIPARIA ELVQHFAHRD APMTLALAQK FLRILDMLVD MGDRRSAALQ LGESFREISA LS
|
| |