Gene Noc_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0639 
Symbol 
ID3706871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp684968 
End bp689836 
Gene Length4869 bp 
Protein Length1622 aa 
Translation table11 
GC content56% 
IMG OID637737147 
Producthypothetical protein 
Protein accessionYP_342688 
Protein GI77164163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGGGGG CGCCGTGCAC GAATGAAGAA GAGCATCAGT TACTTGCACA TTTCGTGCTC 
ATACAGTTCG ACTTCCTTCG CGAAGGTGCC ACTGATCCCC CTGATGCCAT TAATCGTGTT
CGGGATTGCC TAGCGCCAGA TGACACAGCC AAGGCACCGC TTGTCTGGTC GCGAGTAGTT
CAGCTAGCCC GTTCATCAGC TGGAAAAGCT GGTCAGTTTG ATCGCATACG ACTTGTTCGC
TCGATTTCAC CTGTCGCACG CCTACGCGGC GCGACGTCAT TACGTCTCAA CCTGGACAAG
TTGACAGAGC TCGCGAAGAG CTATGCGAAC CTAATTCCGG ACGATATTGG GGGAACAAAG
CTCGACCGCA TCTCGCTCCT TGAGAGCATT GATGCAAAAC TCGCCACAGC TCGTGCCGTC
CAGGTGCGCG GCCTGCCAGG AAGTGGCAAA TCAGTCGTGG TGAGGCGAGC GGTACAGCGC
GCGTTAGAAC ACGGGCCGAC TCTCTTCCTC AAAGCCGAAC AGCTCGAAGG GACCAGCTGG
ATCAGCTATG CGAACTCACA GGGTTTATCG GGTGCTCATC TAGAGCAACT TCTCGTAGAG
ATCGGTGCTG CCGGTACCCC CGTACTCTTT GTCGATGCGA TCGATCGCAT TGACAAAGAA
CACCAGCCCA TCATCGTCGA TGTGATTCAC ACCATTGTGG AATCACCACT GCTTGATAAC
TGGCGCGTAG TCGTTTCCCT TCGTGACACC GGCATTGAAA TGCTTCGTAA TTGGTTGGGT
GAATTCCTCG ATACCCTTAA TGTCGAGACG CTAAGCGTTG GTCAGCTGAG CGACGATGAG
GCTGAGTCGC TCGCAAAGGC CAAGCCACAC CTGAGATCGC TCCTGTTTGG ATCCGCTCAG
GTGCGAGAAA TCGTCCGACG GCCCTTTTTC GCGAAAGTGC TGAACCAGAG TTACATGGTC
GACTCCAGCA GTTCGACATT CGCCCCCCAA TCTGAAGTCG GTTTGATTGA GAATTGGTGG
CGGAGGGGTG GCTACAATGA GACGGGTCAA AACGCGTTAG AACGGCAGCG CGCACTGCTC
GACTTGGCAA GTGTGCGTGC CCGCCAGCTT AGTCAGCCGA TTGCCCTTAG CCAGCTAACA
TCCGTCGCAC ACATAGATGG TCTAAGATCA GACGGCATAC TCCAAAATGC CCGTGAAGGG
GTTTCCGTCC GCTTTGCACA CGATATCTTC TTCGAATGGG CCTTCTTCCA TGTCCTAGCC
GATCGCGGAC TCCAATGGGT GGAGGTTATC AAGGCCTGCG GCGAGCCACC GGCAGTAGCC
CGCGTCGTCG AGCTGACTTC GCAATGGGAA TATGCGCAGG GAACAAACTG GCCAGCGCAC
CTTGCCCAAA CGGAGAGTTC GGATCTTAGG TCACAATGGC TGCGAGCGTG GCTGGTCGGT
CCTCTTGGGA CTGCGAGATT CGATGATGAT GAAGACCAAT TCGCGACGGC CGTCATTGCC
AACGACTTTC GACTTTTCAG GAAGACGCTC GTCTGGTTTC AGGCTGAGAA GACCACCCCG
AATGAGAGTA TCCTTACTGG TGGTCTCCCA CAGGAACAAC GCCAACGGTT CGCTGATCTT
CTCGGATGGC CTTCCGACTT CTCCGCGTGG CGTCGCCTCA TCAATTTTAT ACTACAGCGC
ATTGCAGATA TTCCGCAGCG GCTTTATCCA GAGATCGTCT CCATCTTCGA GGTCTGGCAG
AATGCCTTGG CCGACCTCCG CAACCCGACG TCTCATGCAC TTCTGCAACA ATGCGCTGTC
TGGCTGGCCG CCATCGACGC GATCAGTACT GCCGATAAGC CTGATGAGAA CTCTGCTTAT
TGGGGGGCGG TTCCCGGCTT GGGTGCTCTC AGAAAATCGC TGGGCCAGCT CCTCTTAAGA
TCGTCGAGAG CCGAGCCATC ACTTTCAGCC GACTATTTGC GACGGGTCGC CAACTCGGAG
CGCATCCGAG ACGATGCATT TCATGACATC ATCGCCTACT CACCCGTCCT TGCCCAATCG
CTACCTCAGT CTGTGGTCGA GCTTTCGCTA GCGTTCCTGC TTGGGGAGCT TCCGGATGAG
CAAGTCGCCC GAGAAAAACA GGAACTTCAT GATACGGCCG AGTGGCGCAA AGCAGTTTTA
GCAAAGCCGG AAGCCGAACA GACGCGTAAA GAAAAAATGG CACTTTCGGG CGGATTCTAT
CTGCGAACCG TCGGCGATTT CAGCTACCAC GACTGGGAAA GGCTTTCGAT CCACGACGAC
CACCGAAACT TCTGGCCGCC TTCGCCACTT CGAGAGCCCT TTCACTCGCT TTTCCAATCA
TCTCCTGACC ATGCACTGCG GCTCCTTCGA GAACTCTGTA ATCATGCGAT GACCGCATGG
CGACAGCTAC ATCATCACTC ACACGACCGT GGGGGCACCC CTATACCGCT CGAACTTACG
TTTCCCTGGG GTACCCAAAT TTTCTGGGGC ACCGATCGAG AGTATCTTTG GTTTCGGTCG
ACGTGGGCAC CTAAAGCCAT CGGCTGCGCA TTCATGGCGC TCGAGGAGTG GTGCTTCACC
GAGCTTGAGC AAAGCCGGAC TGTTGACGAG CTGATCCAGC AAATCGTCGA GGGGAACGAG
TGCATCGCCA TCCTAGGAAT GGCGTCTATG CTCGCTCTCC ACACCGAGTG GGTGTCTGAA
ACAACATTGC CGCTCTTCAC CTCGCAGCGC CTGTTGGCCG CTGACCATAA TCGAATGGCG
CAAGACCTTT CATCATCGAC GAACCTGATC GGTTTCACGA GTAGTACTGA CAAACCCCAC
ATTGAGGCAA TCCAGATGGC AAATGCTAGG ACTGTCCGTA AGACACAGCT TAGTTGGATG
GTACCCAGGT TTGTCTTTGC CACGGAGCCA TTCCGCGATC GAGCACGCGA AGCGATCCTC
AATTTCAAGA ACGATCTGCC CTTCCAGTAC GAAGAACACC GCGACATTCC GGAGGCACGG
GAGTACCTCA CAAAGCAGGC CCTCGAATAT GCCGAGTTGG CGGATCCGGA AAACTATCAG
GCCTACCGCA CCGAAGAGGG TTCGGACCAA ATAGCGATTG TCCACGTCAG CCCATCTGCT
GCTCAACCTG AAAATATCGC CAGAGCCGAA GAGGCTAATA AGTACCTCAG GCAGACCGGC
CTCTGGACAT GGGCATCCAA ATCATTCGAA GAGACGACAT TGAACGACAC TTATACGATC
GAGGATGCCA TCGTATTGTC CAAGGAAGCG GACGCCAGCG ACTTATTCGA ACACCCGAAC
AGCGAGAATG AAGAAGAGCA GTTGGGGATG CGTCGAGGTG CAGTCGCCGC CACAGCAGCC
ATAGCACTTA ACTTTAGAGA AGGGTGTACA CACGAAGACC TTGAGTGGGC TCGTGGTGTC
CTAGGACGCG CAATCCGTCT GCCAGAAAAG TTCGAATCGA TGTGGTCCCC TGGTTCCGTC
GTCCCGTGGC ACCAGGGGAT CTATGTGGCA CGGGGGCTCG CAGCGGATCT CCGGGAAGGT
ACAGCAGCGC GCAGCACGGC CAATGACCTC CTCGGGCTAA TCGCGCACCC ACTTGAAATT
GTTGCGTTGA CCGCACTCGA AGAGGCCTGC AAGCTTTGGC CTAATGACTC GAAGTTGACT
TGGGCCGCGC TGATATTGGC GTTTTCACTC TGCCACGTTC CTCCGCGGCC GCGTGACCAG
CCCCGTCAAC ACGGTGAGGC ACTCCATACA TCAAATGAAG CACAAGCCGC TGTCAACGCG
GCGCTTGCGT TCTATGAGCA CGGAAGCGAA TGGGCGCCTC TCCCTTTACC GCCTCCAGCC
TGGGTGAAGG TCGAACCTGA GAAGGGCCGG CGTGGGTATC AAAGATATGA AGATTACGAC
TTGGATGATG CAACCGACGC TGCTGAAGTA TGGGGTGAAC CGGATGTCTT CTGGCACTCA
AAGCAGGCTG CAGAAGTCCT TCAGCGCATC CCCTTGGACG AGGTTCTAAA CAGCAGTGCA
AAGAGCGTGC TTCTCGACTT CCTTGCCAGT GTTCTCGACT GGACCAACCA GAAGAATGCA
CCACCGTGGG TGAAGCCCGG ACGCCGCGAT CGGTCGGCGA CCCAAATCTT TGAGTGGACG
CATACGCTCG GATCAAGGCT GGGATACATG GCTGGCCTTA TGCCGCTTGC CGACTTTCAG
GCGCGCTTTC TTGATCCGAT TTTGGACCTC GAAGGCGACA ATTGCTGGTC TCTTCTTTCG
CCATTCACGA GCACCTTTGT CTGCGCTTAC GTGTACGATG CGCCGGTCGC TCCAGTCGAT
ACAGTTGCGA TACTCGATCT CTGTCTCGCG CGGCTCCTCC AGGACCGTAC CTTCAAGCGT
GACACCTACC GGAGCGGAGA ATTTTCAGGC TTCGATCAAC CCGAGCTTGT TCGTACGTTG
ATGTTTGTCT CAGTCGAACG CGCCGATTTG GCCGCTCGCT ACGTGAATGG CGACTGGTCC
GAAATCAACC GCATCCTGCC CTTGATTGAT CGATTTATTC GCGCTGGCGG ATGGGCTGCT
TCAGTGATGG ACTCATTCTT GACGCTCTGT GAACGAGCAA GAGCTAACTA CCCGGCCGAA
GCCTTCGCAG AGCAAGTGCT CGCAATTATC GGCGACGGTC CCGACAGCCT GAAAGGTTGG
CATGGAACAT TCATCCCAGC ACGCATTGCA GAGCTGGTGC AACACTTTGC ACATCGTGAC
GCGCCGATGA CGCTGGCCCT GGCACAGAAG TTCCTGCGAA TCCTTGACAT GTTAGTCGAC
ATGGGAGATC GGCGAAGTGC CGCGCTGCAG CTTGGTGAAT CGTTCCGCGA GATTAGTGCG
CTCTCTTAG
 
Protein sequence
MKGAPCTNEE EHQLLAHFVL IQFDFLREGA TDPPDAINRV RDCLAPDDTA KAPLVWSRVV 
QLARSSAGKA GQFDRIRLVR SISPVARLRG ATSLRLNLDK LTELAKSYAN LIPDDIGGTK
LDRISLLESI DAKLATARAV QVRGLPGSGK SVVVRRAVQR ALEHGPTLFL KAEQLEGTSW
ISYANSQGLS GAHLEQLLVE IGAAGTPVLF VDAIDRIDKE HQPIIVDVIH TIVESPLLDN
WRVVVSLRDT GIEMLRNWLG EFLDTLNVET LSVGQLSDDE AESLAKAKPH LRSLLFGSAQ
VREIVRRPFF AKVLNQSYMV DSSSSTFAPQ SEVGLIENWW RRGGYNETGQ NALERQRALL
DLASVRARQL SQPIALSQLT SVAHIDGLRS DGILQNAREG VSVRFAHDIF FEWAFFHVLA
DRGLQWVEVI KACGEPPAVA RVVELTSQWE YAQGTNWPAH LAQTESSDLR SQWLRAWLVG
PLGTARFDDD EDQFATAVIA NDFRLFRKTL VWFQAEKTTP NESILTGGLP QEQRQRFADL
LGWPSDFSAW RRLINFILQR IADIPQRLYP EIVSIFEVWQ NALADLRNPT SHALLQQCAV
WLAAIDAIST ADKPDENSAY WGAVPGLGAL RKSLGQLLLR SSRAEPSLSA DYLRRVANSE
RIRDDAFHDI IAYSPVLAQS LPQSVVELSL AFLLGELPDE QVAREKQELH DTAEWRKAVL
AKPEAEQTRK EKMALSGGFY LRTVGDFSYH DWERLSIHDD HRNFWPPSPL REPFHSLFQS
SPDHALRLLR ELCNHAMTAW RQLHHHSHDR GGTPIPLELT FPWGTQIFWG TDREYLWFRS
TWAPKAIGCA FMALEEWCFT ELEQSRTVDE LIQQIVEGNE CIAILGMASM LALHTEWVSE
TTLPLFTSQR LLAADHNRMA QDLSSSTNLI GFTSSTDKPH IEAIQMANAR TVRKTQLSWM
VPRFVFATEP FRDRAREAIL NFKNDLPFQY EEHRDIPEAR EYLTKQALEY AELADPENYQ
AYRTEEGSDQ IAIVHVSPSA AQPENIARAE EANKYLRQTG LWTWASKSFE ETTLNDTYTI
EDAIVLSKEA DASDLFEHPN SENEEEQLGM RRGAVAATAA IALNFREGCT HEDLEWARGV
LGRAIRLPEK FESMWSPGSV VPWHQGIYVA RGLAADLREG TAARSTANDL LGLIAHPLEI
VALTALEEAC KLWPNDSKLT WAALILAFSL CHVPPRPRDQ PRQHGEALHT SNEAQAAVNA
ALAFYEHGSE WAPLPLPPPA WVKVEPEKGR RGYQRYEDYD LDDATDAAEV WGEPDVFWHS
KQAAEVLQRI PLDEVLNSSA KSVLLDFLAS VLDWTNQKNA PPWVKPGRRD RSATQIFEWT
HTLGSRLGYM AGLMPLADFQ ARFLDPILDL EGDNCWSLLS PFTSTFVCAY VYDAPVAPVD
TVAILDLCLA RLLQDRTFKR DTYRSGEFSG FDQPELVRTL MFVSVERADL AARYVNGDWS
EINRILPLID RFIRAGGWAA SVMDSFLTLC ERARANYPAE AFAEQVLAII GDGPDSLKGW
HGTFIPARIA ELVQHFAHRD APMTLALAQK FLRILDMLVD MGDRRSAALQ LGESFREISA
LS