Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1259 |
Symbol | |
ID | 3706371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1386199 |
End bp | 1389189 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637737761 |
Product | hypothetical protein |
Protein accession | YP_343290 |
Protein GI | 77164765 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAATT CCGCGCTTAG AAGGGTAAGA ACATCACCGG CTTGGATAAG AAAACGATGG GTGCGCAACA GTGCTCTAGG GGTAGTAATT ATTCTCCTTA TCTATACCTT GGTTGGTTTT TTTCTTGTTC CCTATCTGCT AGAGAAACAA CTAATCAATT ACTTAAAAGA AAATCTGGGT GTAGAAGCAA AAGTTAAGGA AATCACGCTG AATCCCTATG CGTTGACTCT AGCGGTTAAT AATTTTTCCT TTCACAAGTC GGGCCATCCT AAGCTGTTTG GTTTTAAGCA ATTCTATGCC AATTTCGAGC TTTCAAGTAT ATTTCGTAAA GCGTGGGCCT TCCAAAAAAT TAGTCTAACT AAGCCTTACC TGAGGCTGCA AATCAATAAA AATGGACAGG TTAATCTTGC GGAATTACTG CCCGCGGATG AAACGCCGGC GCTAAAAGAG CGTAAAAAAG AAGTACCGCT GACAAGCGAT CAGATCCTTA TAACTGGAGG AGATATCCAT TTCATTGATT TGACTCAGCC TACTCCCTTT GAAAAAAAAC TAGAAGCCAT TAATGTGGAT TTAAAGAAAT TTAGCACCTT ACCTGAGAAT GATGGCAGTT ATTCCTTTAA GGCCACCACC CAGGCGGGGG AAATCCTGCG CTGGAAGGGA GAGGTTACCT TAAGTCCTTT ACATTCCAAG GGGACTTTTG AGCTTGTGGG AGGTAAAGCC CGTACACTCT GGAAGTATTT ACGAGACCAA GTGGCTTTTG AAATTACCAG TGGCCGGATG GATGGGCGTG GGAATTACAC CTTGGAGAGC CAGGAAAGGG GATTACAGAT TATTTTGAAG GGCGTAACGT TTGCATTGAC CCAGTTAGGT CTTAAGCCCA AAGAGGGTAA TAGGGAAATC CTTACAGTGC CAAAGTTGGG GTTTTCCGGG GGGCAGTTAC GGTGGCCGGA AAAAATCATT GGGGTTAAGT TTATCTACAT GGGCGGAACT CAGGTACGGG CCTGGCTGAA TAAGCAAGGC GCATTAAATT GGCAGAAATT ATTCAAAAGT AAAAAAGCCG ATAGAAAAAT GACGGAATCA ACCGCTAATT CTTCATCCTT AAGCTGGCGG GCGGCGATAA AAAAGATCAA GGTTGAAAAC GTTAGCGCCA GCCTGCAAGA TCGAACGATC GAACCACCAG CCACGTTAAA TATTAGCGAT TTAGGTTTAC AGCTTATTGA TGTCACTTCA GTACCCGGTT TGGCGTCTAC TTTTAATTTG CAATTCCATC TTAACAAGCA AGGTCAGCTT TTAGCTCGAG GTCATGTCAC GGCGTTGCCC CCGTCCGCTG ATTTAGAAAT CCACCTAGAA TCCCTTTCCC TATTACCTTT CCAGCCCTAT TTGAATCGTT TTCTTAAACT GAAGCTAATT TCAGGTAACC TGGAGGCTAA AGGAAATGTC GCCTACGCTC AAAGTAATGA TGCTCCTAAC TTCCAGTTTA AGGGCGATCT GGTTCTGCAA AAATTTGCCG CGGAGGATAC TTTGCTAGAT GAGCGTTTTC TAGGGTGGGA GAATTTAAAA TTTGAGCGGG TATTGGTTGG GTTATTTCCT TCTCGGATTC ATATCGATAA CATAGCACTG GATGCTCCTT ACGGAAAAGT GACGATTAAC GAGAATAAGA AGATCAATAT CAAAGAAGTA TTAAGTCCCT TAGCAGGGAA GAAAGAAAGC CAGTCTGCAG ACCCTTCCTC TACTTCCGCA TCTAAGCCGC TTCCTATTGC TATTAATTCA ATTCGAATTA AAAAAGGTTC GGCCAATTTT GCCGATCTAA GCGTGTCTCC GAAATTTTCC ATGGGTATCC ATTCGCTTCA GAGTGAAATT CAAAATCTAT CCTCGATGGA CCAAGGTAGA TCTTCCATTT CTTTGGAGGG AACCGTGGAG TCCTATGGAG AAATGAGTAT GACAGGAAAG AGCAATCTTT TTGCTCTAGA ACGCGCCACC GAGTTCAGCG CCTTTGCCAG GAATATTGCT CTTCCTGAAT TCACGCCTTA TGCAACGGAA TTTTTGGGAT ACCCGATAGA GAAGGGGAAA TTATCCCTTG ATCTCACCTA TCGGATTAAA GAAGACCAAA TCCAGGGTAA GAATGGGATT CTATTGAAAA ATCTGGATCT AGGAAAAAAA GTTGAAAGTC CAAAAGCCAT CGATGCGCCC ATAAAGCTGG CAATTGGATT GCTTAAGGAT TCTCAAGGAA AAATTGCTAT TCAGGTGCCT ATTGAAGGGA ATTTGAATGC ACCTAAGTTT AGCTATGGGC ATCTTATCGG TGAGGCTTTA ACGGGTGTTA TTGGTAAGGT CATTTCCTCG CCGTTTAGAC TGTTAGGAAG CCTAGTCGGC GCCAAAGAAG ATGTAGATTT AGGATTTATT GAATTTAGAC CTATGGGCAG CAAGTTGTTG CCTCCGGCTC AGGAGAAGCT TTTACAGTTG GCTAAGGCTT TAAAGAAGCG CCCGGAATTG CAGTTACAAA TACAAGGCAG GTATGATCCT ATCACGGACT CCAATTTTTG GAAAAAAGAA AAATTTGAAG TGATCCTGTC AGATCAACTT AAACAGCAAA GTGGTGCTTC GGACAAAGGC AAGAATGCCC TTGTTCGGCA ACAAGCATTA GAGCAGCTTT ATTTAAAGCA GTTTTCTATT AAGTCTCTTA ATCAGCAGCG CGCTCAATAT GGACTTCAAC CGGTAAAGAC AGGCGCAGGA AATGTTGAGC CGAATAATGC TTCTTCTCTA GAGAAGAAAT TGTCTTCTTA TCGGAAAGCA CTTGAGAAAA AACTTATTGA AGCGCAGCCA GTCAGTAAAA ATAAACTCCA GCAGTTAGGG CAGGAGCGGG CAAATGCCAT TAAGGCATAT CTGGTCAGCA AGGGAGGCAT TCAGGAAAAA CGTCTAGGGA TACTTCAAGT CGAATCGACT CAATCACCAG CGAAAGATTT CGTTCGTTGC CAGCTTCATA TCAGTAGCTA G
|
Protein sequence | MVNSALRRVR TSPAWIRKRW VRNSALGVVI ILLIYTLVGF FLVPYLLEKQ LINYLKENLG VEAKVKEITL NPYALTLAVN NFSFHKSGHP KLFGFKQFYA NFELSSIFRK AWAFQKISLT KPYLRLQINK NGQVNLAELL PADETPALKE RKKEVPLTSD QILITGGDIH FIDLTQPTPF EKKLEAINVD LKKFSTLPEN DGSYSFKATT QAGEILRWKG EVTLSPLHSK GTFELVGGKA RTLWKYLRDQ VAFEITSGRM DGRGNYTLES QERGLQIILK GVTFALTQLG LKPKEGNREI LTVPKLGFSG GQLRWPEKII GVKFIYMGGT QVRAWLNKQG ALNWQKLFKS KKADRKMTES TANSSSLSWR AAIKKIKVEN VSASLQDRTI EPPATLNISD LGLQLIDVTS VPGLASTFNL QFHLNKQGQL LARGHVTALP PSADLEIHLE SLSLLPFQPY LNRFLKLKLI SGNLEAKGNV AYAQSNDAPN FQFKGDLVLQ KFAAEDTLLD ERFLGWENLK FERVLVGLFP SRIHIDNIAL DAPYGKVTIN ENKKINIKEV LSPLAGKKES QSADPSSTSA SKPLPIAINS IRIKKGSANF ADLSVSPKFS MGIHSLQSEI QNLSSMDQGR SSISLEGTVE SYGEMSMTGK SNLFALERAT EFSAFARNIA LPEFTPYATE FLGYPIEKGK LSLDLTYRIK EDQIQGKNGI LLKNLDLGKK VESPKAIDAP IKLAIGLLKD SQGKIAIQVP IEGNLNAPKF SYGHLIGEAL TGVIGKVISS PFRLLGSLVG AKEDVDLGFI EFRPMGSKLL PPAQEKLLQL AKALKKRPEL QLQIQGRYDP ITDSNFWKKE KFEVILSDQL KQQSGASDKG KNALVRQQAL EQLYLKQFSI KSLNQQRAQY GLQPVKTGAG NVEPNNASSL EKKLSSYRKA LEKKLIEAQP VSKNKLQQLG QERANAIKAY LVSKGGIQEK RLGILQVEST QSPAKDFVRC QLHISS
|
| |