Gene Noc_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0201 
Symbol 
ID3706236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp218757 
End bp221843 
Gene Length3087 bp 
Protein Length1028 aa 
Translation table11 
GC content53% 
IMG OID637736718 
Productacriflavin resistance protein 
Protein accessionYP_342262 
Protein GI77163737 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA CCGATCTCTT TGTTCGTCGC CCGGTGCTGG CCAGCGTGGT GAGCTTGCTA 
ATTCTACTGA TTGGGATGCG CTCTTTAACT CTACTGGAGG TGCGGCAGTA TCCTGAGACC
GAGAATACAG TGGTGACGGT GTCTACCGCT TATCCGGGGG CCAGCAGTGA ATTGATTAAC
GGCTTTATTA CCACCCCCTT GCAACAGGCA ATTGCCGAGG CGGAAGGCAT AGATTATCTA
GTAGCGACTA GCACCCAGGG CCACTCGATC ATCGAAGCCC ACATGGTGTT GAACTATGAT
TCCAATGCCG CAGTTGCCGA AATTCAAGCT AAGGTCGCCA GTCAGCGCAA CGTACTGCCC
GAAGAAGCCG AGAACCCGGT CATCGACTCT ACCACGGGCG ATTCTACTGC ATTGATGTAC
ATGGCCTTCT ATAGCGAGGC GATGTCACCG GCCCAAATCA CGGACTATCT GCTGCGGGTG
GTACGGCCCA AGCTCCAGGC CGTGCCGGGT GTATCCAAGG CGCAGCTTAT TGGTAATAAA
ACCTTTGCCA TGCGTATCTG GCTCGATCCG CGCAGGATGG CGGCCTTGGG GGTGACAGCT
AACGATGTGA GAGAAGTCCT GCTGGAAAAT AATTATCTTG CCAGCGTGGG GCACACCAAA
GGGGCCTACG TCGCTGTTGA TCTTAGTGCG ACTACGGATA TTAGCCGGGA GGAAGATTTC
CTAAATCTGG TGGTGCGGGA AGCCGATGGC GCCCTGGTAC GTCTGCGGGA TGTCGCCGAA
CCCCAACTCG GCGCCGAAGA CTACGATTCC AGTAATTGGT ATAACGGAAA GCCGGCCATC
TTTATTGGCA TTGAGCAGGC GCCAGGCGCC AATCCGCTAG ATATTGCCGG GCACCTCCAT
GATCTGATGC CGGAGATACG CCGCCAACTC CCCTCCGGAG TGGACGGTTA TATTGTCTAT
GATGCCAGCG CCTATATCGA GGATGCGATC CGGGAGGTAT TCCGCACCCT CGCGGAAGCG
GTGCTGGTTG TGCTTGTAGT GATTTTTCTG TTTCTTGGCT CCCTGCGAGC GGCGCTAGTA
CCTGCTATTG CCGTTCCTCT ATCCTTGATT GGTGGGGCTT TTTTGATGCT GGTTTTAGGA
TTCTCCTTGA ACTTACTCAC CTTGCTGGCC ATGGTATTGG CCATTGGGTT GGTGGTCGAT
GACGCCATCA TCGTGGTGGA AAATATCCAC CGGCATCTGG AGCATGGGGA GTCCCGTTTT
CAGGCTGCAA TCCACGGCGC TAGGGAATTG GGACTGCCTA TTATTGCCAT GACGACGACC
CTGGTAGCCG TTTATGCGCC CATTGGTTTT ATGGGCGGTC TGGTGGGAAC CCTTTTCACT
GAATTTGCTT TCACGCTAGC GAGCGCAGTG TTGGTATCGG GGGTGGTAGC GCTCACCCTC
TCTCCCATGC TTTCTTCTAA GATGTTAAGA CCAGTGAGCG AAGCAGGCCG CTTTGAGCAG
TGGGTGGAAC GCTTCTTTAC CCGCCTTGCT GGATACTATC TGCGCCTCTT GCGTTACGCG
CTGGAGAGTC TACCCGTTGT CATTGTCTTT GCGGGAGCCA TACTGTGTAG TATTTATTTT
ATGTATGTAA CCAGCCAAAA TGAATTAGCG CCCACCGAAG ATCAAAGTAT TTTGTTTTTT
CAGGCCACTG CACCCCAGAC TGCCACTATT GATTACGATG AGGCCTATTC CCGCCAAATT
ATCGATATCT TCGAGTCCTT CCCGGAATAT CATGAGAGTT TTCTGCTGCT TGGCCAAGGC
GGCGATCCCA GTACTGTTTT TGGTGGTTTC AAGATGCCCG TGCCTTCCCA GAGGGAGCGC
TCCCAGATGG AGATCCAGCC TGAGATGCAG CAAAAGTTGC AAGGGATTGC AGGATTTCAA
ATCGCGGTTT TCCCCCGACC TAGTTTGCCT GGCTCCGGCG GTGGTCTGCC GCTTCAATTC
GTGATCACCT CGGCTGCGGA TTTTTCCCGT TTGGATCAGA TTGCCGAAGA GCTGATTAGA
CAATCCATGG CAAGCGGCAA GTTTGCTTTT CTGCAAAAAT CCGTCAAATT TTCTCGCCCT
AAGACTACCC TTAAAATTAA CCGGGATCTG GCGGGGGATC TTGGCATCCG TATGAAAGAC
ATTGGGCAGA ATTTGGGGAT CATGCTAGGG GGCGGCTATA TCAACTGGTT CAACTTAGAA
GGACGCAGCT ATAAGGTGAT TCCGCAAGTG GACCGCCGTT ATCGTCTCGA TCAGGAAATG
CTGGAAAATT ACTATATTCG CACTGGTGCG GGCGAGCTCA TTCCCCTGGC GACGCTGATT
TCCTTTGAGG AAACAGTGGA GCCGAGCAAG CGAGTCCAGT TTCAACAGCT TAATTCATTG
ACCGTTCAGG GCGTGATGGC TCCGGGGGTA GCGGTGGGCG AGGCGTTGGC TTACCTGGAG
GATAAGGCCC GGGAGATTTT TCCTTCCGCT TTTAGCTGGG ATTACGCGGG GGAGTCCCGC
CAGTATACCC AGCAGGGCAG TGCCTTGATG GTGACTTTCT TTTTCTCCCT GCTGGTGATC
TACCTTGTTT TGGCAGCACA ATTTGAGAGC TGGCGTGACC CGGTTATTAT TTTGATGTCT
GTTCCTATGT CTATTGCTGG GGCACTGGTG TTTCTTACCT TGGGTTTTGC GACTGTCAAT
ATCTATACCC AAGTGGGGCT GATCACGCTG ATTGGTCTAA TTGCTAAAAA CGGCATCCTC
ATCGTGGAGT TCGCTAATCA ACTGCAATTG CAGGAGGGGC TTGATAAACA AGCGGCGGTA
GAGAAGGCCT CCAGCATCCG GTTACGCCCC ATTCTAATGA CAACGGTTTC CATGATAGTG
GCGATGGTTC CCCTGCTAAT GGCAAGCGGT CCTGGCGCGG TGAGTCGTTT CGATATCGGG
TTGGTGGTGG CCAGCGGTTT GGGGATTGGG ACTTTGTTTA CCCTGTTCGT GGTTCCGGCG
GTGTATTTGC TAGTGGCCGG TGACCATAGG GAAGAAAAGG AGGCAGTCCA AGAGCCCATG
GAACAGGAGT CCGGCTTTAA AGCCTAG
 
Protein sequence
MKFTDLFVRR PVLASVVSLL ILLIGMRSLT LLEVRQYPET ENTVVTVSTA YPGASSELIN 
GFITTPLQQA IAEAEGIDYL VATSTQGHSI IEAHMVLNYD SNAAVAEIQA KVASQRNVLP
EEAENPVIDS TTGDSTALMY MAFYSEAMSP AQITDYLLRV VRPKLQAVPG VSKAQLIGNK
TFAMRIWLDP RRMAALGVTA NDVREVLLEN NYLASVGHTK GAYVAVDLSA TTDISREEDF
LNLVVREADG ALVRLRDVAE PQLGAEDYDS SNWYNGKPAI FIGIEQAPGA NPLDIAGHLH
DLMPEIRRQL PSGVDGYIVY DASAYIEDAI REVFRTLAEA VLVVLVVIFL FLGSLRAALV
PAIAVPLSLI GGAFLMLVLG FSLNLLTLLA MVLAIGLVVD DAIIVVENIH RHLEHGESRF
QAAIHGAREL GLPIIAMTTT LVAVYAPIGF MGGLVGTLFT EFAFTLASAV LVSGVVALTL
SPMLSSKMLR PVSEAGRFEQ WVERFFTRLA GYYLRLLRYA LESLPVVIVF AGAILCSIYF
MYVTSQNELA PTEDQSILFF QATAPQTATI DYDEAYSRQI IDIFESFPEY HESFLLLGQG
GDPSTVFGGF KMPVPSQRER SQMEIQPEMQ QKLQGIAGFQ IAVFPRPSLP GSGGGLPLQF
VITSAADFSR LDQIAEELIR QSMASGKFAF LQKSVKFSRP KTTLKINRDL AGDLGIRMKD
IGQNLGIMLG GGYINWFNLE GRSYKVIPQV DRRYRLDQEM LENYYIRTGA GELIPLATLI
SFEETVEPSK RVQFQQLNSL TVQGVMAPGV AVGEALAYLE DKAREIFPSA FSWDYAGESR
QYTQQGSALM VTFFFSLLVI YLVLAAQFES WRDPVIILMS VPMSIAGALV FLTLGFATVN
IYTQVGLITL IGLIAKNGIL IVEFANQLQL QEGLDKQAAV EKASSIRLRP ILMTTVSMIV
AMVPLLMASG PGAVSRFDIG LVVASGLGIG TLFTLFVVPA VYLLVAGDHR EEKEAVQEPM
EQESGFKA