Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0550 |
Symbol | |
ID | 3706742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 592101 |
End bp | 594401 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637737058 |
Product | hypothetical protein |
Protein accession | YP_342600 |
Protein GI | 77164075 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA CATTCTGTTT ACTTGCCATT GTTGCGTTTC CCGCCATGGC CGAAGTCACC GTGACTGAAC CGACGAAATT GACATCCATT GAAAGCCAAA TTTCGGCGCA AGAGTCCAAA AGCCTGGACC TCAAGGCGCG CTCAAGCGCA CTGGAAGCCG AGGAGCACCT CGCCGCCGCC ATTGACGGCT TTGTGTACGC GAGAGTGAAG CGCACGACGG GGAATTTTGT CGTCAAGGGG AAAACCTATC ACTCTATTTC TGCTACCGAC CGGCTGATTG ATGTCACCTT TCAAGGCACA ACCGGGGACT TTTTGACCGG CCCCGGCGAT TTGGTCTGGC GGAAGGGCGA TGGCACGGAA GTTATCCTCC ACCATTGCCC GGAAGTCGCG CTAGGTGAGA ACACATGTAT TGTGATTGAC CCGCGAGTGA GCTTCGACGG CAAGCAAGTG GCCTATACGG TGCTGGAGGG TCGGTATGGG ACCAACCCTG ATGTGAAAAT GCAGCATGGG ATTGATGCGC AGAAGTCCTC CCTTCATTTC GTCAATCTCG AAACGATGGA AAAGACCCAA TGGGCAGCCG TGCAAGGGGT ATTTGATATG GCCCCACAAT GGTTGCCCGA TGGCACCCTG ATGTTTACCT CCAACCGGGA CCATATCAAA GCAGGAAGCA TCAGCGGCTT TAGCCCGGAA GGGGAAGTCT TGCAGTTGTG GGTGGCGGAC AGAAATGGCA GCCACGCCCG TAACGTCAGT CCGGACATTC TCGCCGATGC GCTCCATCCC ATCCTTCATC CTTCTGGTAA GGTGCTTTTT TCAAGCTGGA AGCGGGACTT GGAAAGCGTT CAGAACAAAA CCCTGGATAA TCTGTGGTAT ATCGCCGAAA CCCGGCAAGA TGGTTCGGAG CATAATGCTA TCTGGGGCGC CCATGCGCGG TTTCTCCCTA ACCCAGACAA TATCACCATT AAAGCCCTTC ATTTTACGGG CGTGCGTAGC AATGGGGACG TGTGTACGAC GAACTATTAT CGCCAAAATA ATAAAGGCGG CGGTAACATT CAGTGCTTTA CCTATCCCAC CCCGAATCAT CCCGAGGGCA AATTCCCTTT CAAGGCCAAA GAGGGAAGAT ATTTAGGGTT GTGGGGGCAT TCCGGCGATC AGTTCAAAAC CGATGGGCTG GGGCGCACCC GCGATCCCAT GGGGATGCCT AAAGGGGGGC TGGTGTTTTC CGCCAGCCCA AAAGGCGTGT GCCATTTCTT CCATAAAATT GCCCAGCTAC CGCTCAATGA TAGCGGCTGT GATTTCGGTA TCTATGTACA AGGCACCGTG CCAGAGAAAA GCCTGGAAAG CCGTATCCTT ATCGTCAATA GCCCCGACTG GCACGAGTTC CAACCTCAGC CCGTGATGCC CTACGAGGAT ATTTACGGCA TTAAGAAGCC GCCAGTGGTA AAGATGGACC ATCCCAACCC GGACGGCAAA TGCATCCTGC GCTCGGCTAG CAAGCGGATG CAAGTAGACA ATGAACAGGG CTATCGCGGC CCCGGCAACG CTAATCCGTG TTATGAACAA GGCTGTTCCA TGGCGGGCGT GGATAAGCAA GCGCTCATGA CAAGCATCAA GTTCTGGCGG CCCGTTCCGT TCAATTCACG GGTTTTGCAT CAGGATTTTG ATGGATTTGG TCCATTCAGG ATGGAAGTGC TCGGCACGGT TCCATTACGG TCTGATGGTT CATTTACCGC TGAAGTGCCT TGCGATACCC CGTTCTATAT GGCTGGAGTC GATAGCGAAG GCAGAACGCT CGCCAGAGAC AAGATCGCCA TGTCTCTGCG GCCCGGTGAA ACCCGTACCT GCTCGGGGTG CCACAACCAC GACGACGACA ACCCCCCTCA GTTCAAGGAC TCAGAGGCGG CCAAAGTGGC CCCCACCCCA GTCCCCGCAA GCGGAACGCT GTTGAATTGG ACCACGGATG TCTGGCCCCT GTTGCGTGAC AATTGCGATG AAAACTTTCC CGAGTTTAAC CTTTCAGCCT CCACCGAGGA AAAAGCTTTT AATAACTTTA CCCATTACGC CAAAGGGGCC GCAAAGCCAC CGTGGCGGAG CTACTTTATC AACTGGTTGT TTGCCCGCGA GAGTCTCCTG TACTGGCGCA GCGCTGGAGA ACGTTTGGAT GGACGCACCG ATGCCAGCCG CACCGACGAT ATCGACTACG GCACCGTAGC ACCCAAGCAA GCCTGCCTTA ACGCCGCGCA GCTTAAAATC CTTGCAGACT GGATTGAAAG CGGTGCTTAT CGGAAAAATA ATAAGTGGTG A
|
Protein sequence | MKKTFCLLAI VAFPAMAEVT VTEPTKLTSI ESQISAQESK SLDLKARSSA LEAEEHLAAA IDGFVYARVK RTTGNFVVKG KTYHSISATD RLIDVTFQGT TGDFLTGPGD LVWRKGDGTE VILHHCPEVA LGENTCIVID PRVSFDGKQV AYTVLEGRYG TNPDVKMQHG IDAQKSSLHF VNLETMEKTQ WAAVQGVFDM APQWLPDGTL MFTSNRDHIK AGSISGFSPE GEVLQLWVAD RNGSHARNVS PDILADALHP ILHPSGKVLF SSWKRDLESV QNKTLDNLWY IAETRQDGSE HNAIWGAHAR FLPNPDNITI KALHFTGVRS NGDVCTTNYY RQNNKGGGNI QCFTYPTPNH PEGKFPFKAK EGRYLGLWGH SGDQFKTDGL GRTRDPMGMP KGGLVFSASP KGVCHFFHKI AQLPLNDSGC DFGIYVQGTV PEKSLESRIL IVNSPDWHEF QPQPVMPYED IYGIKKPPVV KMDHPNPDGK CILRSASKRM QVDNEQGYRG PGNANPCYEQ GCSMAGVDKQ ALMTSIKFWR PVPFNSRVLH QDFDGFGPFR MEVLGTVPLR SDGSFTAEVP CDTPFYMAGV DSEGRTLARD KIAMSLRPGE TRTCSGCHNH DDDNPPQFKD SEAAKVAPTP VPASGTLLNW TTDVWPLLRD NCDENFPEFN LSASTEEKAF NNFTHYAKGA AKPPWRSYFI NWLFARESLL YWRSAGERLD GRTDASRTDD IDYGTVAPKQ ACLNAAQLKI LADWIESGAY RKNNKW
|
| |