Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1809 |
Symbol | |
ID | 3705326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2046956 |
End bp | 2049505 |
Gene Length | 2550 bp |
Protein Length | 849 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637738292 |
Product | hypothetical protein |
Protein accession | YP_343809 |
Protein GI | 77165284 |
COG category | [S] Function unknown [V] Defense mechanisms |
COG ID | [COG0286] Type I restriction-modification system methyltransferase subunit [COG2852] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR00497] type I restriction system adenine methylase (hsdM) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATA CGGAACAGCA ATTTCTCAAG TCCCTGGACA ATAAGCTCTG GAAGGCCGCC GACAAGCTGC GCGCTAACCT GGACGCCGCC AACTACAAGC ACGTGGTGCT CGGGCTTATC TTCCTCAAAT ACGTCTCTGA CGCCTTTGAA GAGCGCCAGG AGCAACTGCT GGCACTGTTC AAGGATGAAA GCAACGACAT CTACTACCTC TCTCCCGAAG ACTACGACGG TGACGCCGAC TATCAGCAGG CGTTGCGGGA CGAGCTGGAA ATCCTGGACT ACTATCGCGA AGCCAACGTA TTCTGGGTGC CCAAAGCGGC CCGCTGGAAT ACCGTGAAAG AAAAAGCCGT GCTGCCCGTG GGCACCGTGC TGTGGCAGGA CGATGCGGGC AATGATGTCA AGCTGCGCTC CGTCTCCTGG CTGATGGATA ACGCCCTGGA AGCTATCGAA AAGAGCAACG CCAAGCTCAG GGGCATCCTC AACCGCATCA GCCAGTACCA GTTGGAAAAC GAGAAGCTGC TGGGGCTGAT TAATACCTTC TCCGATACCT CTTTCACCAA GCCGGTGTAC GGCGGGGAGA AGCTACACCT GCACAGCAAA GACATCCTCG GCCATGTATA CGAATACTTC CTGGGCCAGT TCGCCCTGGC CGAAGGCAAG CAAGGCGGCC AATATTACAC CCCCAAGAGC ATCGTCACCC TGATTGTGGA AATGCTCGAA CCCTATTCGG GCCGGGTGTA CGACCCGGCC ATGGGCTCCG GCGGCTTTTT TGTCTCCAGC GACAAGTTCA TCGAGGAGCA CGCCAAGGAA CAGCACTACG ATCCCGCCGA GCAAAAAAAG CATATCTCCG TCTACGGCCA GGAATCCAAC CCCACCACCT GGAAGCTGGC CGCCATGAAC ATGGCCATTC GGGGCATCGA TTTCAACTTC GGCAAAAAGA ACGCCGACAC TTTCCTGGAC GACCAGCACC CGGATCTGCG GGCCGATTTC GTCATGGCCA ACCCGCCCTT CAACATGAAG GATTGGTGGA GCGAATCCCT GGCGGACGAT GCCCGCTGGC AATACGGCAC CCCGCCCAAG GGCAACGCCA ACTTTGCCTG GATGCAGCAC ATGATTCACC ATCTGGCACC CACCGGTAGC ATGGCCCTGC TGCTGGCCAA CGGCTCCATG AGCGCCCACA CCAACAACGA AGGGAAAATC CGCCAGCGCC TGATTGAAGA AGATCTGGTG GAGTGCATGG TGGCCCTGCC GGGGCAGCTC TTTACCAACA CCCAGATCCC GGCCTGCATC TGGTTTTTGA CCAAAGACAA AGCCGGTGGA AAACACCCCT CTCCCTCCAG TAGGGGAGAG CCAGCAAAAC ACTCCTTTCC CTCTAGCAGG GGAGAGTCAG CAAAACACCC CTCTCCCTCC AGCGGGAGAG AATCAGCAAA ATACCCCTCT CCCTCTGGGA GAGGGGCAGG GGGTGAGGGT AAAGAGGGCA AAGAGAGCAA GCGGGACCGC CGCAGGGAAT TCCTGTTTAT CGACGCCCGC AACCTGGGCT ACATGCGCGA CCGGGTGCTG CGGGACTTCA CCTTGGATGA TATTGCCAAA ATCGCCGACA CCTTCCATGC CTGGCAGCGT ATTCCCCCCC TTCCCTTGGG AGAGAGCCGG GGTAAGGGCC AAACCCCACC CAAGCTATTG CGATTCGCCC GCGAGCTAAG AAAAAACCAG ACCGATGGGG AAAATCTGCT TTGGCAACTG CTGCGAAATC GCCAGATGGC CAATGCCAAG TTCCGCCGCC AACAGCCCAT TGAGGATTAC ATCGCGGACT TTTATTGCCA TGAACACCGG CTGGTGGTCG AATTGGATGG CAGCCAGCAC CTGACGCCTG AAGGAAGGCA GCGGGATGCC CGTCGGACAC AACGTCTCCA GGAAATAGGG ATTCAAGTGC TGCGTTTTAA CAACCGGCAG GTTTTGACCG AGACGGAAGG CGTACTGGAG TCCATCTACA ACACGCTCAC CCTTTCCCTC ACCCAAACCC TCTCCCAGAG GGAGAGGGCT TCAACACCCC GCGTCACCTT GCCCCGGGGC GACGACCGGG GCAGCGAGGC CATTCCGGCA GGGGAGAATC AGCATATCCT TCCAGCAGGG GAGGGTCAGC AAAACACCCC TCTCCCTCTA ACAGGGGAGA GCCAGCAAAA CACCCCTCTC CCTCTAACAG GGGAGAGCCA GCGAAATACC CCTCTCCCTC TAGCAGGGGA GGGTCAGCAA AACACCCCTC TCCCTCTGGG AGAGGGGCCG GGGGTGAGGG CAAAGAGTGC AAAGAGGGCA ACCACCTACC AGGACATCCC CGGCTTCTGC AAATCCGTCA GCCTGGACGA CATCAAAAAG CACGACTTCG TACTGACGCC GGGCCGCTAC GTCGGCGCAC CGGAACAGGA AGACGACGGC GAACCCTTCG CCGAGAAAAT GATGCGACTG ACCGAGCAAC TGCGGGAGCA ATTTGCCGAG AGTGATCGAT TGGAAGCTGA AATCAAGCGG AATCTGGGGA GGCTGGGATA TGAGTTGTGA
|
Protein sequence | MNDTEQQFLK SLDNKLWKAA DKLRANLDAA NYKHVVLGLI FLKYVSDAFE ERQEQLLALF KDESNDIYYL SPEDYDGDAD YQQALRDELE ILDYYREANV FWVPKAARWN TVKEKAVLPV GTVLWQDDAG NDVKLRSVSW LMDNALEAIE KSNAKLRGIL NRISQYQLEN EKLLGLINTF SDTSFTKPVY GGEKLHLHSK DILGHVYEYF LGQFALAEGK QGGQYYTPKS IVTLIVEMLE PYSGRVYDPA MGSGGFFVSS DKFIEEHAKE QHYDPAEQKK HISVYGQESN PTTWKLAAMN MAIRGIDFNF GKKNADTFLD DQHPDLRADF VMANPPFNMK DWWSESLADD ARWQYGTPPK GNANFAWMQH MIHHLAPTGS MALLLANGSM SAHTNNEGKI RQRLIEEDLV ECMVALPGQL FTNTQIPACI WFLTKDKAGG KHPSPSSRGE PAKHSFPSSR GESAKHPSPS SGRESAKYPS PSGRGAGGEG KEGKESKRDR RREFLFIDAR NLGYMRDRVL RDFTLDDIAK IADTFHAWQR IPPLPLGESR GKGQTPPKLL RFARELRKNQ TDGENLLWQL LRNRQMANAK FRRQQPIEDY IADFYCHEHR LVVELDGSQH LTPEGRQRDA RRTQRLQEIG IQVLRFNNRQ VLTETEGVLE SIYNTLTLSL TQTLSQRERA STPRVTLPRG DDRGSEAIPA GENQHILPAG EGQQNTPLPL TGESQQNTPL PLTGESQRNT PLPLAGEGQQ NTPLPLGEGP GVRAKSAKRA TTYQDIPGFC KSVSLDDIKK HDFVLTPGRY VGAPEQEDDG EPFAEKMMRL TEQLREQFAE SDRLEAEIKR NLGRLGYEL
|
| |