Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0987 |
Symbol | |
ID | 3707379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1089044 |
End bp | 1091779 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637737493 |
Product | hypothetical protein |
Protein accession | YP_343026 |
Protein GI | 77164501 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3857] ATP-dependent nuclease, subunit B |
TIGRFAM ID | [TIGR03623] probable DNA repair protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000754464 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGAAAT TACCAATTCC ATTGATTTGT GAGGATGATG TCTTTGCGGT ACTAGGAGCG GGGGCGCTCC TGCTAACGAT TAATAATCGT TTGGCGCGGG AACTTCAACA TCGTTATGAT CGGGTCCAGC AGGTAAAAGG ATTAACCGTC TGGGAAACGC CGCAAATTCT TCCTTGGTCG GTTTGGTTGC AGCGCTGTTA CGATTATCAA ACGTTGACTC TCACTGAAAC AGATAATTCT AGCCCTGCCC TGCTTAGTCC CTTACAGGAA CAAAGTTTAT GGGAGCGGGT GATCTATGAT TCGCCCTATA GCGGGGCGCT ATTACAAGTT CCTGCCACGG TTCGTACAGC CCAGGAAGCA TGGCGGCTGT GGCATGCCTG GCGCCTCCCC CTAGCGGGCC AATCCCTCTT TCTAACGGAA GATACCCAAG CTTTTCTAGA ATGGGCTCAG GTTTTTGAGG ACTATTGTCG GGTGGATCAT TGGTTGGATA ATGCCCGTTT ACCCGATGCT GTGGGCGGGA TGTTGGAAAG CGGGCAAATT CCTTTGCCGG GAACAGTTAT CTTGGCAGGA TTTGATGAAT ACACCCCCCA GCAACAAGAG CTTCTAGCCG TTCTTGAACG GCAGGGAGTT TTGCTTCAGG TCTTTGCTAA TCAGGGCGGC TCACAACAAA CCAGGCGGGT AGCCCTAGCC GATACGATAG AAGAGATTAC TGTGGCGGCC CGCTGGGCTC GGCATAGGCT AGAACACTCT CCTGGGGAGA AGATTGGAGT TATTGTCCCT GAGCTGGAAT TTTTACGGGT TCAGGTGGCG CGTATTTTCG ATGATATCCT TCATCCTGAA GCTGTATTGC CAGGCCGGGG GAGGATAGAG CGGGCTTACA ATCTTTCCTT GGCGCAGCCG TTAGCGGACA ATCCTTTAGT TCATACAGCG CTACTTATTT TGGAGCTTAG CAAAGGGGAA CTCTCCATGG TGGAGATGGG GGCGTTCTTG CGCTCGCCTT TTGTGGGTGC GGCAGAGCAG GAATTTTCTC ATCGCGCCCT TTTGGATGCT TATCTCCGCA AGACTCGGGA AGAGAGGGTG TCGTTAGAAC GCTTATGGAA GGCTGCTATT ACAGCGCGGG AGGAGGACGC TAACCACAGG TGTCCCGCCT TGGGAGAACG GCTCCAGCAG TTTAAGGTTG AGGTTGATTC ATTGCCGGCG AGACAGCCTC CGAGCGGTTG GGCCCAGAGT TTTACCTGCT GGCTTCAATT GTTAGGCTGG CCTGGAGAGC GCCCCCTTGA TAGTGAAGAA TATCAAGCAG TTTCGGCTTG GCATAAAAGC ATACAGTCTT TTTCCAGTTT GGATCGGGTA GTGCCCTCCT TAAAAAAAAA TGTAGCCATT GGAAAATTCC GGCACCTGCT CGTGGAAACT TTGTTTCAAC CCGAAAATCC TATTGTGCCG GTTCAGGTGA TGGGGGTTTT GGAAGCGGCT GGCGAGCAGT TCGATGCCGC CTGGATGCTG GGACTGCATG ATGGTATTTG GCCGACGGCG CCTCGCCCTA ATCCCCTGCT ACCCATCGAG CTGCAACGTC ACTATCGTTT GCCCCATGCT TCCGCCGAAC GGGAGCTGGC ATATACTCGC GTGGTAACGG AGCGTTTGCT TGCCAGCGCA CCTGTGGTCA TTGTCAGTCA CCCCCGCCGG GAGGGCGACA GGGATCTACG GCCTAGCCCC CTTATTGCTG AGCTAGCGTC TGTCCTTCCA GAAAGATTGC AATTAGCTTC GGTTGAGTCC TATGTCAATG AGATTCAGCA AACGGGGAGA ATGGAAACTT TGGTCGACGC GCAGGGACCT CCCTTGGAAG CAGGCGCCCA GGTGGGGGGC GGCACCGGAC TGCTTAAGGC TCAGGCTGCT TGTCCTTTCC GCGCTTTTGC CGAGTATCGC CTTGGCGCCA AAGGTCTTGA AGAACCCAGC GTGGGTTTAG AATCTTTGGA CCAAGGAATT TTGATTCACG TTGCTTTGCA ATATTTGTGG GAAAAATTGC AAAATCAGCA TACCCTTTTA TCCTGTAGCG CTGAAGAATT ACATGGGCTA ATAGCAGAGG CAGCAAAACA AGCGATTGCT ACGCAAACGG CAGTACGCCC CCGGATTTTT ACCGAGCGGT TTACTGCTAT AGAGCAGGAG CGGTTGGAGC AGTTGTTGCT GGAATGGCTG GAGCGGGACA AGCAACGGCC CCCTTTTGCC GTGTTGCATC AAGAGAGGTC TCAGCCCCTA AACCTTGGTG GTCTCAGTTT GGATACCAGG GCGGATCGGA TTGATCAGTT GGAAAGCGGG GAGCGGGTTA TTGTGGATTA TAAGACCGGT CGCTCTAATC CCCGGCACTG GTTTGGCGAG CGGCCCGAAG AGCCCCAGCT TCCTTTATAT TGTATCGCCC ATGAGGCACC TCTGGCGGCC GTCCTGTTTG CGCAAGTCCG TCGAGGAGAG ATGAAGTATC TGGGGGTTAC TAAAGAAGAA GGAAGTATGC CTGAAGTTGC TGTTTTTACT CGTGTTGCTG GCGGTCTAGA CAGTTGGGAA GAGTTATTAG CGCGATGGTT TCAGGTTCTC CACGCTCTTG CCGTGGAGGT CGTGGAGGGG TATGCGGCCG TAGCGCCAAG GGATGCTAAC AGTTGCGATT ATTGTGCTTT ACCTGGGTTA TGCCGGATTA AAGAGCTGGG CGGTGGCGCT GCTAAAAATA AAAAAGAACG GGACAGGAAT GATTAA
|
Protein sequence | MLKLPIPLIC EDDVFAVLGA GALLLTINNR LARELQHRYD RVQQVKGLTV WETPQILPWS VWLQRCYDYQ TLTLTETDNS SPALLSPLQE QSLWERVIYD SPYSGALLQV PATVRTAQEA WRLWHAWRLP LAGQSLFLTE DTQAFLEWAQ VFEDYCRVDH WLDNARLPDA VGGMLESGQI PLPGTVILAG FDEYTPQQQE LLAVLERQGV LLQVFANQGG SQQTRRVALA DTIEEITVAA RWARHRLEHS PGEKIGVIVP ELEFLRVQVA RIFDDILHPE AVLPGRGRIE RAYNLSLAQP LADNPLVHTA LLILELSKGE LSMVEMGAFL RSPFVGAAEQ EFSHRALLDA YLRKTREERV SLERLWKAAI TAREEDANHR CPALGERLQQ FKVEVDSLPA RQPPSGWAQS FTCWLQLLGW PGERPLDSEE YQAVSAWHKS IQSFSSLDRV VPSLKKNVAI GKFRHLLVET LFQPENPIVP VQVMGVLEAA GEQFDAAWML GLHDGIWPTA PRPNPLLPIE LQRHYRLPHA SAERELAYTR VVTERLLASA PVVIVSHPRR EGDRDLRPSP LIAELASVLP ERLQLASVES YVNEIQQTGR METLVDAQGP PLEAGAQVGG GTGLLKAQAA CPFRAFAEYR LGAKGLEEPS VGLESLDQGI LIHVALQYLW EKLQNQHTLL SCSAEELHGL IAEAAKQAIA TQTAVRPRIF TERFTAIEQE RLEQLLLEWL ERDKQRPPFA VLHQERSQPL NLGGLSLDTR ADRIDQLESG ERVIVDYKTG RSNPRHWFGE RPEEPQLPLY CIAHEAPLAA VLFAQVRRGE MKYLGVTKEE GSMPEVAVFT RVAGGLDSWE ELLARWFQVL HALAVEVVEG YAAVAPRDAN SCDYCALPGL CRIKELGGGA AKNKKERDRN D
|
| |