Gene Noc_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1993 
Symbol 
ID3704877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2291968 
End bp2295045 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content52% 
IMG OID637738469 
Productacriflavin resistance protein 
Protein accessionYP_343985 
Protein GI77165460 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR01297] cation diffusion facilitator family transporter 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATCC GTTTTTTCCA CAATCACGTT CTAGCCAATC TCACCTTTGT GCTAGTGCTC 
ACGGCAGGTG CCCTAGCCTA TCTTCACCTC CCTAAGGAAC AAAACCCGCC GGTTAATTTC
AACTGGGTAC AAATTAGCAC GGTATTTCCA GGCGCCTCCG CGGAAGATGT GGAGAAACTA
ATTACCGAAC CGCTGGAAGA TGCGCTCCGT CAAGTCCAGG ATATCCGTTT TGCTTCCAGC
ACTAGCCGGG AAGATCTATC ACTCATTCTG GTGCGGTTTC AAGACATCAA TGAGCGCACC
TATGATAAAC GTGTCACTGA TCTGCGCCGG GAAGTGCAAA GTAAGGCAAA TCAAGAACTG
CCAGAAGCCG CCGAAGAACC CGAGTTTTTC GAGTTGAGTA CCGCGAATAT GTTTCCCGCC
GCCATGATTT TGGTCAGCGG AGTAGCGGAC AATGAGGTGC TACGGCGTAA CGCCCGCATC
ATTCAGGAAG ATCTAGAGCG CATACCCGGG GTAAAAAACG TGGGCACGGT GGGACTGCGG
GACCCAGAAT TGCAAGTGGA GTTTTCGCCA CGGCGTCTAT ATGCCGTTGG AGCCACCCCC
GAGGATTTGG CTGAAACGGT GGCTGCCTAT TTTCAGGATG CCGCCGCGGG AAGCGTCCGC
CAGGCAAACC AACAATGGCT GATACGCCTA GTGGGAACAA GCAGCGATCC CCAGGTATTA
GGCCGTTTCC CGCTCAAAAC TGCTTTGGGT GAAACGCCAA TTAATAGCGT CGCCGCAGTA
AGCCGCGGCC GGGAGGAACC AGAGGAATTA GTTCGTCATA AAGGCCAGCC GGGAGTATTG
CTGGTTATAG ACAAAAAGGG CAGCACCGGC TCTCTGAAAT TCATAAAAAG CCTTAAAGCC
TATATTGCCT CCCACAATGA GATCAGCCGG GAAAGCGGCG TCAAACTGAT CTTAGTGGAT
GATCAGACGC CCCGCACCGA AGCCGCATTG AAAATCATGG AAAATAATGC GGTAGTTGGC
TTGGTGCTGG TAATGCTGAC CACCTGGCTA TTTCTTGGAT CGCGGATCTC GTTTTTTATC
GGGATCGGCA TTCCCTTTAC TCTAGCAGGA ACCTTCCTGT TACTTTATCT GCTCGGGGAA
AGTCTGAATA TCTCGGTATT TCTCGGGGTA GTGATCGCTT TGGGTATGTT GGTGGACGAT
GCGGTGGTCG TGGTAGAGGC TATTTACTAC CGCTTACAGC GAGGGGTAGC ACATGCGGTT
GCAGTGTCCC AGGGGCTTTC GGAAGTGGCT GCTCCTGTGG CAACCTCGGT GTTAACCACG
ATAGCCGCTT TTCTTCCGCT TATGCTCACG CCGGGTATCC TCGGAAAATT TATGTACGTC
ATTCCCCTGG TAGTTACCAT TGCTCTGCTG ATTAGTCTGT TCGAAGCCTT TTGGATACTG
CCGGTGCATG TCATGGGCTC GAAAACCGGC GCCCATAACC CTTCCCCCTT GCAACGTTAT
CGCGTGCGGT TTGCTCACTA TTTGCGGGTT AAATATACCC ACGCATTGAT TACCCTTCTA
AAGCGACCTA AACGTTCCGT GCTGGCTATA TTGCTGCTCG TTGCTGCTTT TGGCGTCCTA
GCGGGCAGCA TGGTACGAAC TGATTTTTTT GCCGCTGACC CGATTCGGTT GTTCTATGTG
AATGTGGAAA TGGCGCCTAA TACGACCCTG GATGAAACCC TTCGTTTAAT TGAAACATTA
GATAGCAAGG TGGGAAAACA TCTCAAGCCA GAAGAATTAC GCCAAACTGT GGGCTATGCA
GGCATGATGT TCACAGGCGC GGAAGTGCTG TCAGGCGACC AGTACGGCCA GGTCGTCGTT
AGTTTGAAGC CTCAAAACGG GCAACTTCGC AGCGTCGATG AAACCATTGA AACCATGCGG
AAAAACCTCG TAAAAACACC GGGACCCGTA CGCATTTCCT TCCTCCGCAT CTCTGAAGGC
CCCCCCGTCT CTAGACCCAT CAGTATTAAG GTGCGTGGTG ATGATCTAGA GAAATTGCAC
CAAGCAGTGG CCGCCTTGGA AGCTACTTTA CAGCGTCTTC CCGGCGTCAA AGACATCACC
AGTGATAACA TTCCTGGCAA ACTGCAACTC AACCTGCGCC TAAATGGAGA CGCTATCAAG
CGTTCCGGAC TAGATCCGGC TTCCGTCACC CGCATTATTC GGCTGCTATT TGACGGAGAG
ATCGTTGCCA GCACGCGCGA ACAGGGAGAA AAGCTGGAGG TACGGGTACG CGCTAAACGA
GCCTCGGTAC CTGATATCAA TGCCCTATTC AGACAGCCTA TTGCCTTACC CGGCGGCGGC
GAGATCGCCC TAGGCCGGCT CGTAGAGGTA GATAAAAGCC TTGGGCAAGT AGCAATTCGG
CACTATAATT TTCGCCGCAC CATTACTTTG GAGGCTGACA TTAATCGAGA AATGACCGAT
GTAGTGGCTG TTAATGAGCA TATCCAGCAA GAGTGGGAAA ACCTGCGCGC CCGTTTTCCC
GGCATTGACC TGGATTTCTC CGGTGCCTTT GAAGATATTC AGGAAAGCAT CCAGGCCCTC
ACCTTGCTAT TTCTACTAGG TATAGGATTA ATCTATCTTA TCTTGGGCAC CCAGTTTCGC
AGCTACTGGC AGCCCCTGAT AATTCTCGCC GCCGTACCCA TGGCGTTTAC CGGTGTGCTG
TTCGGGCTCA TGGTGACTCG TAATCCCCTA AGTCTTTATA CGCTCTACGG TGCCGTAGCA
CTAACTGGCA TTGCCGTCAA TGCGGCGATC GTACTCATCG TGGCGGCCAA TGAACGCTTA
AACCGGGGCA TGAGTGTATT ACATGCTGCG GTATATGCAG CCCGGCGGCG CCTTATCCCT
ATATTAATTA CTAGCCTTAG CACTATTGCC GGTCTATTTG CCCTTGCCAC CGGCCTAGGC
GGGCGCTCCC TGCTGTGGGG CCCCATGGCC TCGGCGATTG TATGGGGCGT GGGAATTTCC
ACCTTGCTGA CTCTTTTTGT TATTCCTTTG TTATACCAAT TATCCATGAC CAACAAGAAC
AAAGGTTACC GCCCTTAA
 
Protein sequence
MLIRFFHNHV LANLTFVLVL TAGALAYLHL PKEQNPPVNF NWVQISTVFP GASAEDVEKL 
ITEPLEDALR QVQDIRFASS TSREDLSLIL VRFQDINERT YDKRVTDLRR EVQSKANQEL
PEAAEEPEFF ELSTANMFPA AMILVSGVAD NEVLRRNARI IQEDLERIPG VKNVGTVGLR
DPELQVEFSP RRLYAVGATP EDLAETVAAY FQDAAAGSVR QANQQWLIRL VGTSSDPQVL
GRFPLKTALG ETPINSVAAV SRGREEPEEL VRHKGQPGVL LVIDKKGSTG SLKFIKSLKA
YIASHNEISR ESGVKLILVD DQTPRTEAAL KIMENNAVVG LVLVMLTTWL FLGSRISFFI
GIGIPFTLAG TFLLLYLLGE SLNISVFLGV VIALGMLVDD AVVVVEAIYY RLQRGVAHAV
AVSQGLSEVA APVATSVLTT IAAFLPLMLT PGILGKFMYV IPLVVTIALL ISLFEAFWIL
PVHVMGSKTG AHNPSPLQRY RVRFAHYLRV KYTHALITLL KRPKRSVLAI LLLVAAFGVL
AGSMVRTDFF AADPIRLFYV NVEMAPNTTL DETLRLIETL DSKVGKHLKP EELRQTVGYA
GMMFTGAEVL SGDQYGQVVV SLKPQNGQLR SVDETIETMR KNLVKTPGPV RISFLRISEG
PPVSRPISIK VRGDDLEKLH QAVAALEATL QRLPGVKDIT SDNIPGKLQL NLRLNGDAIK
RSGLDPASVT RIIRLLFDGE IVASTREQGE KLEVRVRAKR ASVPDINALF RQPIALPGGG
EIALGRLVEV DKSLGQVAIR HYNFRRTITL EADINREMTD VVAVNEHIQQ EWENLRARFP
GIDLDFSGAF EDIQESIQAL TLLFLLGIGL IYLILGTQFR SYWQPLIILA AVPMAFTGVL
FGLMVTRNPL SLYTLYGAVA LTGIAVNAAI VLIVAANERL NRGMSVLHAA VYAARRRLIP
ILITSLSTIA GLFALATGLG GRSLLWGPMA SAIVWGVGIS TLLTLFVIPL LYQLSMTNKN
KGYRP