Gene Noc_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1724 
Symbol 
ID3705039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1929219 
End bp1931435 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content50% 
IMG OID637738205 
Productorganic solvent tolerance protein 
Protein accessionYP_343726 
Protein GI77165201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAACATA AAAGAAACAA TATCTTACTC GCGGGGCTTT TTTTCCTCCT GCTAGGGTTA 
GTCTCTATAG CTCGGGCACA GATACCCCAA TGGGAGAAAT GTGGAGCGTT TGTCGAGCCA
GCTTCGGAGG AGTTGCTGGA CCCTGAGCCT AGGGGTCCGG TGAATGTTGA GGCTGATCGA
GTTGAATCCG AAAAAAATGG GGTCTCTGTT TTTAGCGGCG AGGTCAAATT TAGACGCCGG
GGACAGTGGC TAGACGCTGA TGAAGTGCTT TATGACAAGC CGAATGATAC CGTGGAAGCT
TTTGGCGATG TGCGCTATCA GGACGCCACA ATGGATGTTA TTAGTGATTC CGCTAAAGTA
AATTTGGAGG CGGATATTGG TGAGGCGGAA AATGCTCGTT ATTTTCTGCG GGATTACCAT
GCCCGTGGAG AGGCGGGAGC GGTGGAACGG GAGGGCTCGG TTAAGACCGA ATTGCGAGAC
GCCACCTTTA CAACTTGTGA TATTGGGGAT AATGCTTGGC AATTGAAGGC GGATCGGGTG
AGTTTGGATC ATAAGGAAGG CGTGGGCTGG GCCCGCGGTG CCCGCCTTAG ACTCTGGGAT
ACGACGGTGT TCTATGTCCC TTTTCTGCGT TTTCCTATCG ATAATCGGCG TAAATCCGGT
TTTCTTGTCC CTTCAGGCGG AAGCTCCAGT AATTCAGGTA TAGGTATTAG CACTCCCTAT
TATTGGAATA TTGCCCCTAA TATGGATGCT ACCATTACGC CACGTTATCT CTCCGATAGA
GGCCCCATGA TGGAAGGAGA AGTGCGTTAT CTCAATCCTA GTAATTTTGG CCGAATAAGG
GGGTCTTTTT TGCCCCATGA TGCGAAAAGA GACGACTATC GCGGCGCTTT TTCCTATCGT
CATAGTGGCA GCCCCCGGCC ACGCTGGTTT ACTAACCTTG ATCTCAATCT TGTTTCCGAT
GATAGATATT TTGAGGATTT TGGTAATAGC CTAAGTATCG CGAGCACCAC TGTCTTAAAT
AATTCCTTGG ATATAGGCTA CCAAGGTAAC GGCTGGAATG CCCTAGGGCG TTTTCAGGGA
TTTCAAACCA TTGATCGGAG CATTCCTGCT TTTGCTCGAC CCTACCAGCG TTTGCCTCAG
TTCTTGGTGG ATGGATTTTT CCCGGATCGG TTTTTAGGAC TGGATGTAGA TTTTCACGGG
GAAGTGGTAC GTTTTGATCG GGATGCCGCC CCGCCCACGG GAGGCGTACG TTTAGACTTT
TGGCCGACCG TGAGTTTACC TTTTCGGACT CCAGGTACTT TCTTTACCCC TAGTATCGGC
GTGCGGGATA CCCGTTATTT TCTAGAGGAT GCTCCTCCAG GCACGGACAG TACATTAAGC
CGTACCTTGC CTATTGTTAG TATGGATACA GGGGCTATAT TCGAGCGTTC ACTGACTTTG
TGGGGAAGTG ATTTGCGCCA AACGCTGGAA CCGCGCGCTT ACTATCTGTA TGTCCCTTTT
GAAGACCAAT CGGCTTTTCC AGTGTTTGAT AGCGCCCCGC TGGATTTTTA TTTCAGCCGG
CTTTTCCAAC CCAACCGTTT TACAGGTGCC GATCGTCTTA ACGATGCCAA TCAGCTCACG
CTGGCGGTAA CGACCCGTTT GCTTCAGTCC GATACGGGAG CAGAGCTGCT TCGTGCATCT
ATTGGCCAGA TTCAGTTTTT TCGTGATCGC AGGGTTACGA TGCCTGGTGC CGCCAAGGAG
ACGGATTCAA GCTCGCTGGT TATTGCTGAA GTCGCTGCAC GACTGGCACG GGAGTGGTCC
CTGCGAGGCG AATTGCGTTT CGATCCCCAT AAAAAACAAA CTGATTTAGG CGCGGCTGAG
TTGCACTACC GTGGTGATGA GGGCGGTCTG CTAAATATCA ATTACCGTTT CCGCCGGAAT
TTTCTAGAAC AACTCAATGT CTCTGGCCGC TATCCAATTG CCGATAACTG GAGTGTGGTG
GGGCGTTGGT ACCAGTCAAT CGCCGATGGC CGCCTCCTTG AACTCCTGGG AGGGGTGGAA
TATGACAGTT GTTGCTGGGC AATACGCTTG GTGGGTCGTA GCTATATTAC CAATATCGAG
GGAGACAGGA ATAATTCGGT ATTGGTCCAA TTGGAGTTAA AAGGATTAGG TAATTTGGGC
CAGAACGTGG AAAGGTTGCT GGAGCGCTCG GTATTGGGCT ATGGGCAGCC GTTCTAA
 
Protein sequence
MEHKRNNILL AGLFFLLLGL VSIARAQIPQ WEKCGAFVEP ASEELLDPEP RGPVNVEADR 
VESEKNGVSV FSGEVKFRRR GQWLDADEVL YDKPNDTVEA FGDVRYQDAT MDVISDSAKV
NLEADIGEAE NARYFLRDYH ARGEAGAVER EGSVKTELRD ATFTTCDIGD NAWQLKADRV
SLDHKEGVGW ARGARLRLWD TTVFYVPFLR FPIDNRRKSG FLVPSGGSSS NSGIGISTPY
YWNIAPNMDA TITPRYLSDR GPMMEGEVRY LNPSNFGRIR GSFLPHDAKR DDYRGAFSYR
HSGSPRPRWF TNLDLNLVSD DRYFEDFGNS LSIASTTVLN NSLDIGYQGN GWNALGRFQG
FQTIDRSIPA FARPYQRLPQ FLVDGFFPDR FLGLDVDFHG EVVRFDRDAA PPTGGVRLDF
WPTVSLPFRT PGTFFTPSIG VRDTRYFLED APPGTDSTLS RTLPIVSMDT GAIFERSLTL
WGSDLRQTLE PRAYYLYVPF EDQSAFPVFD SAPLDFYFSR LFQPNRFTGA DRLNDANQLT
LAVTTRLLQS DTGAELLRAS IGQIQFFRDR RVTMPGAAKE TDSSSLVIAE VAARLAREWS
LRGELRFDPH KKQTDLGAAE LHYRGDEGGL LNINYRFRRN FLEQLNVSGR YPIADNWSVV
GRWYQSIADG RLLELLGGVE YDSCCWAIRL VGRSYITNIE GDRNNSVLVQ LELKGLGNLG
QNVERLLERS VLGYGQPF