Gene Noc_0655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0655 
Symbol 
ID3706887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp706395 
End bp709304 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content51% 
IMG OID637737163 
Producthypothetical protein 
Protein accessionYP_342704 
Protein GI77164179 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAA TAAAAACACC CAAAAAACTG ATTGAAGTCG CGCTACCGCT GGATGACATC 
AACACAGCGG CAGCGCGAGA AAAGTCAATT CGGCATGGTC ATCCCTCAAC CTTACATCTG
TGGTGGGCGC GCCGACCTTT GGCAGCAGCA AGGGCCGTGT TGTTTGCGCA AATGGTCAAC
GATCCCGGAT ACCAGCAAGG AGAAGGATTC AAGTATGGGG TTAACAAGAA AGAGGCCGAG
ATCAAACGCG AGAAACTCTT TCAGATAATC CGTGATTTGG TGAAGTGGGA AAACACCAAT
AATGAGGAGG TGCTGAATCG CGCACGTGAA GCGATCTGGG AAAGCTGGCG CGAAACTTGT
CATCTAAACC GCAATCATCC CCAAGCAGCA GAACTTTTCA ATCCGGATAA ACTCCCGGCC
TTTCATGATC CATTTGCGGG CGGAGGGGCG ATCCCACTGG AGGCGCAGCG ACTGGGATTA
GAAAGTTATG CCAGTGACTT GAATCCAGTG GCGGTGATGA TCAACAAAGC CATGATCGAG
ATCCCTCCGA AGTTTGCCGG GCAACGACCA GTAGGTCCTC TGCCCCAAGG CGAGAAGCAA
GGCAAGTTGA TGGATGACTG GTCTGGCGCA AGAGGCCTGG CCGAGGATGT GCGTCGTTAC
GGCCACTGGA TGCGGGAGGA AGCATTCGAG CGTATCGGTA ATCTCTACCC TAGGATCAAG
ATCACGCAGG AGATGGTCGC AGAACGACCC GACCTGAAAC CATACCAGGG ACAGGAACTG
ACTGTTATCG CCTGGCTCTG GGCCAGAACC GTGAAAAGCC CCAATCCGGC TTTCAGCCAT
GCCGATATCC CGCTAGCGTC TAGCTTTCTA CTTTCCACCA AGAAGGGAAA AGAGTCTTAT
GTTAATCCGT TAGTCGAAGG ACATAACTAT CAATTCGAAG TGTGTATGGG AGTTCCGCCA
GCAGAAGCAA GGAACGGAAC TAAACTGGGT CGTGGGGCCA ACTTTACCTG CCTACTATCA
GATACACCTA TTGACCCGAA ATACATTTAT GCACAGGCAC AATCCGGAAA TCTGGGTCAG
CGACTAATGG CGGTTGTAGC TGAGGGGAAA AGTGGGCGTA TCTATCTTAC GCCCACCGCA
GAAATGGAGC AGGCCGCAAG TGCTGCATCA CCGGACTGGA AGCCAGATGC ATTAATGCCA
GAAAACCCTC GTTGGTTTTC ACCACCGATG TACGGTATGA AATCCTATGG CGATCTTTTC
ACTCCCCGCC AACTTGTCGC TCTAAATACA TTCTCTGATC TAGTTCAGGA AGCCTGCTAC
AAAGCCATCG CTGATGCCAA AGCAGCGGGA ATGACCGACG ATGGAATCGG TATTGATGAT
GGAGGCAGAG GTGCGACTGC TTATGGCGAT GCTTTGGCGG TTTATTTGAC CTTCGCGATA
AATAAATTGG CAGACAGAGG CTCAACCATT TGCACATGGG ATTCATCGAG AAGTAGCACT
CGAAACACAT TTGGCCGCCA AGCTATACCA ATGACATGGG ATTTTGCAGA ACCAAATCCA
CTCTCAGACT CCACTGGAAA CTTTATGGGA GGAATTGGAT GGGCAAATGA CGTGCTTAGC
CGAATGATTC CATCTAGTGG TGGAATAGCA GTTCAACAAG ACGCAGCAAC CCAAAACATT
AGTGCTGAAA AGGTCATTTC AACTGACCCG CCTTACTATG ACAATATTGG TTATGCCGAC
CTGTCTGATT TTTTTTACGT ATGGATGAGG CGCTCATTAA AATCATTCTA TCCAAGTTTG
TTCGCTACTA TGGCGGTTCC AAAGGCTGAG GAATTGGTCG CTATACCTTA TCGTCATGGC
ACAAAAGAAA AAGCAGAGAC CTTCTTTCTA GATGGTATGA CACAAGCCAT TCATAATATG
GCTGACAAAG GGCATCCTGC TTTCCCTGTA TCTATTTACT ACGCGTTCAA ACAATCAGAA
ACCAAAGAAG GGGCCACATC TAATACAGGC TGGGAAACAT TTTTAGAGGC TGTGATTAGA
GCAGGCTTTT CTATCGATGG TACTTGGCCA ATGAGAACTG AAATGTCCAA TCGTATGATT
GGCTCTGGTA CTAATGCTTT GGCTTCCTCT GTTGTTCTGG TATGCAAAAA ACGTGAAATC
GAAGCCGAAT CTATTTCACG ACGCGACTTC CAACGTGAAC TGCGTGAACA GATGCCCGAC
GCACTAGAAG CCATGATCGG CGGAGAAACG GGCACTACAC CCATCGCTCC CGTGGATTTA
GCCCAGGCCG CGATTGGCCC GGGGATGGCC ATCTTCTCCA AGTACGAGGC CGTACTGAAT
CAGGATGGTT CACGTATGAG CGTGCATGAT GCGCTAATCC TGATCAACCG CGCCATCACG
GAATACCTCA GCCCGGAATC CGGCAGTTTC GATGCCGACA CCCAGTTCTG CTCCAGTTGG
TTTGATCAGT ACGGATGGAG TACTGGTCCC TTTGGTGAAG CGAACGTGCT AGCGCAAGCA
AAGGGCACGA CGGTAGATGG CGTAAATACA GCCGGGGTCG TCGAATCCGG CGGCGGCAAG
GTTCGCCTAT TGAAATGGGC GGAGTACGAA GCCGATTGGG ATCCCATTAA AGACAACCGC
ACACCTATCT GGGAAGCCTG CCACCAAATG ATTCGCAGTC TCAACAACCA GGGTGAATCG
GCCGCTGGCG AACTACTGGC CAAGATGCCG GAGAAAGGAG AACCCATTCG TCAGCTCGCC
TATCACCTGT ACACCCTGTG CGAACGCAAG AAGTGGGCCG AAGATGCCCG CGCCTACAAC
GAATTGATCG GTTCCTGGCA TGCCATTGTC ACCGCCTCCC ACGAGGTTGG CCACAGTGGC
TCGCAAGCCG AACTCGGACT GGATTTTTGA
 
Protein sequence
MAEIKTPKKL IEVALPLDDI NTAAAREKSI RHGHPSTLHL WWARRPLAAA RAVLFAQMVN 
DPGYQQGEGF KYGVNKKEAE IKREKLFQII RDLVKWENTN NEEVLNRARE AIWESWRETC
HLNRNHPQAA ELFNPDKLPA FHDPFAGGGA IPLEAQRLGL ESYASDLNPV AVMINKAMIE
IPPKFAGQRP VGPLPQGEKQ GKLMDDWSGA RGLAEDVRRY GHWMREEAFE RIGNLYPRIK
ITQEMVAERP DLKPYQGQEL TVIAWLWART VKSPNPAFSH ADIPLASSFL LSTKKGKESY
VNPLVEGHNY QFEVCMGVPP AEARNGTKLG RGANFTCLLS DTPIDPKYIY AQAQSGNLGQ
RLMAVVAEGK SGRIYLTPTA EMEQAASAAS PDWKPDALMP ENPRWFSPPM YGMKSYGDLF
TPRQLVALNT FSDLVQEACY KAIADAKAAG MTDDGIGIDD GGRGATAYGD ALAVYLTFAI
NKLADRGSTI CTWDSSRSST RNTFGRQAIP MTWDFAEPNP LSDSTGNFMG GIGWANDVLS
RMIPSSGGIA VQQDAATQNI SAEKVISTDP PYYDNIGYAD LSDFFYVWMR RSLKSFYPSL
FATMAVPKAE ELVAIPYRHG TKEKAETFFL DGMTQAIHNM ADKGHPAFPV SIYYAFKQSE
TKEGATSNTG WETFLEAVIR AGFSIDGTWP MRTEMSNRMI GSGTNALASS VVLVCKKREI
EAESISRRDF QRELREQMPD ALEAMIGGET GTTPIAPVDL AQAAIGPGMA IFSKYEAVLN
QDGSRMSVHD ALILINRAIT EYLSPESGSF DADTQFCSSW FDQYGWSTGP FGEANVLAQA
KGTTVDGVNT AGVVESGGGK VRLLKWAEYE ADWDPIKDNR TPIWEACHQM IRSLNNQGES
AAGELLAKMP EKGEPIRQLA YHLYTLCERK KWAEDARAYN ELIGSWHAIV TASHEVGHSG
SQAELGLDF