Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0371 |
Symbol | |
ID | 3706542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 404477 |
End bp | 408265 |
Gene Length | 3789 bp |
Protein Length | 1262 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637736883 |
Product | hypothetical protein |
Protein accession | YP_342427 |
Protein GI | 77163902 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGCTAC TGCGTTCGAT TGGCGCTTTT ATTCTGCTGT TGTTAGTGCT GCTGGTAGGG GGGCTTGCCT ATTTGCTCTT GACTGAGGTG GGAACTCGGC AATTATTGGC CCAGGTGGCC CAGGTGATTC CGGGAGAGTT AGAAACCCAA CAGGTTGAGG GGACTTTGGG GGAGGCGCTT ACCCTGACCG GATTGCGCTA CCGGACTCCG GATTTTACTC TTGAAGTCGG GTATTTTCAC TTTGCCTGGC GGCCGGCAGC CTTGTTGGGG GCAACTTTCT GGGTGGAGCA GCTTCATCTT GGAGAAGTCA GCTGGCGCCA AAAACGCCCA GGGGAGTCTA CGGCCTCGCA AGAGCCGATA GTGCTTCCCG AAATACAGAT CCCTTTAAAA GCTAAGGCAG AAGACGTCCG GCTGCAAAAT ATTTCCTTAA CTCCCCTTGG ATCGTCCCCT GTGGTTATTA ATACTATTGT CTTCAAGGGG AATTTCGATG GCCAAGCCTT ACAAGTAGGC GAATTGGGGG TTTCTGCTCC CCAAGGAGAG GTGCAGGTGA GTGGAGAGAT GGCTTTCCAG GAGGCTTATC CCATGGCGTT CGCCCTGGCT TGGGGGGCAC CGGTTCCCGA GTTGGGGAAG GTCACTGGCA TTGGTAGGGT GAAAGGAGAT CTTCGCCAGT TGACTCTACA CCAAACCGTT CAAGCTCCCT TCCATCTGCA ATTCCGAAGC AGGCTTTTTG AGCTATTGGA ACAGCCTCGC TGGGTTGCGG CGCTGGAGGT TCCCGGGGTC GAATTGCAGC ATTTATCTGC CCAGTGGCCA GCAGTACGTT TTGGGTTAGA CCTTAAGGGC GCAGGGAGCT TGGAACAGTT TAAGGTACGA GCCTCCTATC AGATTCAGGA GGCTCAAACA GGCGAAGTGA GGGGCAGTCT CTCGGTGGAA CAGCTTGCCA TGGGGCATTG GCTCCTGGAC CGGCTGACCT TGCGGCAAGT TGAGGGGCCT GCTCGGTTAG CGCTCCGGGG GGAGGTGATG ATGGCAGAAG ACCAGCCTCG TATGAGCCTT GCGGGCCAGT GGCAAGATAT GGCTTGGCCC CTGAGAGGGA CGGCTCAAGT TTCCAGCAAC CGGGGCCAAT TGACCTTGGA AGGAACGCCA GCCGCCTACC GGTTACAATT GAACAGCGCT CTTGCCGGTC AGGATATTCC CACCAGCGAG TGGCATTTGA TGGGTACCGG CGATACCACC CAGTTTGAGT TAGAGAAGTT ACGGGGACAA TTGTTGGATG GGGTTTTGAG TGGTTCGGGC AATTTCCGCT GGACTCCCGC CTTGGCCTGG GATGTGCGAG TAGACGGAGA AGCGCTTAAT CTGTCGAAGG AATGGCCAGA ATGGCCAGGA GTATTGTCCT TTAGCAGTGA TACTAACGGT GTTTTAGAGG AGGACGCCCA GGATATCACG CTTGATCTCC ACGCCCTTTC TGGCTCCCTA CGGGGCTATC CAGTAGCGGC CCAGGGAAGA GTTCAACGAC AAGATAATAC CTGGCGTATT GCCGATTTAA AGTTACGCTC GGGAGATTCC CGGCTTTCTT TGGGGGGAAC GGTCAATGAG CGGCTAGCGC TAAAGTGGCA TCTCTCTTCC CCGGATCTTT CCCAACTGCT GCCGGAGGCT CAAGGGGATT TGTTAGTAAA GGGGCATGCG AAAGGGCCGC TTACAGGCCC GGAACTTACC TTTCGACTCC AGGGGAAAGC TCTGGCCTAT CAGGATTATC AAGTGGAATC GGTGATGGCT AATGTGGATG TGGACTTACA GGGGAAGCAG TCATCGCAGG TGCGGATCGA TGCCTCGGAT TTCACTTTGG CAGACCAGAC TCTCCGCTCT GTGGCTATTG AGGGGGGCGG CACGCCATTG CACCATAAAC TGAGTCTGGC AGTGAAGGCG CCGGAGCGTT CCCTGGACCT TGGATTCCAA GGCTCCTGGA AAGAGGAGGT TTGGCAGGGA GAAATTACCA AGACCGAGCT TACGGACTCG CTCATGGGTC ACTGGGAGGC AGTGAGCGCC ACTTCCCTTA CTCTGAGTCG CAGCAATATT GATCTCGCTC CTTGGTGCTG GCGGCAGCAA TCTGCCCAGC TCTGCCTAGG TGGCAGTTGG CAGGAAGAAA GTTTTTGGCG AGGAAGTTTC AAGCTAGAGG ATTTCCCGTT AGCAATGCTA GGGCCGCTTT TACCGGAAAA AACAGCACTG GAAGGGGTGA TTGGGGGCGA GGTGCAGGCA CAGGGAGAGG CTCACCAGTT AGTCCAGGCC CGGATGCAGC TTGCGGCTTC CGGGGTTCAA TTAACCCAAG TGACACCTGA GGGGCAGTCT CTGCGCTTTC CCTATCAAGA CATGCAGGCC AGGCTTAATT TGGAAGATAG GGGAGGGAAA GCAGGTTTTG AGCTACTTTC AGCCGATCCT GGCACAGCGC CAGTTAGGGC CTCTCTGCGC CTGCCTTCTG CTCCTTTGGA TCTAACTGCC TTGGGGCAAT TGCCTTTAGA TGGCCAGATC TCAATGGCTT TTAGGGATCT CGCTTTTTTG GAAACGCTGA TACCAGAATT GGAAGCGGTT CAGGGACAAT TGCGGGCAGA TCTGACCTTG GGAGGACAGG TTGCCGCACC TCAATTACTG GGAGAAGTTG TGCTTCAAGA AGGAAGCGCC CAGGTCGTTC CCCTGGGGTT AAAGTTGATA AAGATCCGGT TGCGGGCAGA GGCAAGCGAG CAAGATAGGA TCGTTTTCAC GGGCGGGGTA CACTCGGGGG AGGGAGAGTT AGCTGTCAAT GGCCAGGTTC GTCTGGAACC TGAGGCGGGT TGGCCCGCTA AGGTGACGGT GACCGGAGAA CGTTTTGAAG CCATGGGGAC CTCGGATATC AGGGTATTGA TCTCACCTCA GTTGCAGATC ACGAAGGCAG AAGAAGCGAT TCGCGTGGAA GGGGAAGTTG TAATACCAGA GGCCACTTTA GTGATCAAGG ATATTGAGAG CAGAGGGGGG GTGCCAGTTT CCCAAGACGT AGTGATCATA TCCCAGGAAA AGGAAACTGA AAAAAAGGCT GTGCCCATTT ATGCCCGAGT CAGAATTATT TTGGGCGATG ATATTTCAGT GCGGGCCTTT GGTTTTAAAG GGGGAATAAC CGGGAGCCTA TTAGTGACGG AAACTCCCGG AAAGGCTACA CGGGGAAGCG GTGAACTCCA GATCGTTAAG GGTGAATATA AAGCCTATGG ACAGCAGTTA AATATTCGGC AGGGTCAGGT GGTTTTTGCC GGACCTATTG ATGATCCCCG GCTGAGCGTA GAAGCAGTAC GTGAGGTTGA TAATGGTAAT ATAGTCGTTG GAGCGCGCAT CCGGGGGGCT GCCAGTGAAC CAGTGCTCAC TTTATTTTCT GAGCCGTCGA TGGATGAGAG CAATATCCTG GCCTATTTGA TCCTAGGGCG GCCTTTGGCG GGAGCTTCTG GGGGTGACGG CGAATTATTG ACTAAAGCTG CGACTTCTCT TGGCTTGTCC GGTGGCACCC TCCTTGCTAA ACGGCTTGGA AAAATCTTTG GTTTGGAAGA TGTGGGGATT GAATCCGCTG ATAACGGTAA TGGGAATGGG GATACCCAAA GTGAGATGTT GATGCTGGGC AAGCAGCTCT CACCCAGTCT TTACATTGGT TATGGAATCG GGTTATTTGA GCGTTTTAGC TCTTTTCGAA TGCGCTATAT TTTGAGCAAA AATTGGAGCG TACAAGCCGA AACGGGCCTT GAAACCGGCG CCGATTTATT TTATAGCCTA GAGCGGTGA
|
Protein sequence | MKLLRSIGAF ILLLLVLLVG GLAYLLLTEV GTRQLLAQVA QVIPGELETQ QVEGTLGEAL TLTGLRYRTP DFTLEVGYFH FAWRPAALLG ATFWVEQLHL GEVSWRQKRP GESTASQEPI VLPEIQIPLK AKAEDVRLQN ISLTPLGSSP VVINTIVFKG NFDGQALQVG ELGVSAPQGE VQVSGEMAFQ EAYPMAFALA WGAPVPELGK VTGIGRVKGD LRQLTLHQTV QAPFHLQFRS RLFELLEQPR WVAALEVPGV ELQHLSAQWP AVRFGLDLKG AGSLEQFKVR ASYQIQEAQT GEVRGSLSVE QLAMGHWLLD RLTLRQVEGP ARLALRGEVM MAEDQPRMSL AGQWQDMAWP LRGTAQVSSN RGQLTLEGTP AAYRLQLNSA LAGQDIPTSE WHLMGTGDTT QFELEKLRGQ LLDGVLSGSG NFRWTPALAW DVRVDGEALN LSKEWPEWPG VLSFSSDTNG VLEEDAQDIT LDLHALSGSL RGYPVAAQGR VQRQDNTWRI ADLKLRSGDS RLSLGGTVNE RLALKWHLSS PDLSQLLPEA QGDLLVKGHA KGPLTGPELT FRLQGKALAY QDYQVESVMA NVDVDLQGKQ SSQVRIDASD FTLADQTLRS VAIEGGGTPL HHKLSLAVKA PERSLDLGFQ GSWKEEVWQG EITKTELTDS LMGHWEAVSA TSLTLSRSNI DLAPWCWRQQ SAQLCLGGSW QEESFWRGSF KLEDFPLAML GPLLPEKTAL EGVIGGEVQA QGEAHQLVQA RMQLAASGVQ LTQVTPEGQS LRFPYQDMQA RLNLEDRGGK AGFELLSADP GTAPVRASLR LPSAPLDLTA LGQLPLDGQI SMAFRDLAFL ETLIPELEAV QGQLRADLTL GGQVAAPQLL GEVVLQEGSA QVVPLGLKLI KIRLRAEASE QDRIVFTGGV HSGEGELAVN GQVRLEPEAG WPAKVTVTGE RFEAMGTSDI RVLISPQLQI TKAEEAIRVE GEVVIPEATL VIKDIESRGG VPVSQDVVII SQEKETEKKA VPIYARVRII LGDDISVRAF GFKGGITGSL LVTETPGKAT RGSGELQIVK GEYKAYGQQL NIRQGQVVFA GPIDDPRLSV EAVREVDNGN IVVGARIRGA ASEPVLTLFS EPSMDESNIL AYLILGRPLA GASGGDGELL TKAATSLGLS GGTLLAKRLG KIFGLEDVGI ESADNGNGNG DTQSEMLMLG KQLSPSLYIG YGIGLFERFS SFRMRYILSK NWSVQAETGL ETGADLFYSL ER
|
| |