Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1768 |
Symbol | |
ID | 3704785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 1990694 |
End bp | 1992943 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637738251 |
Product | extracellular solute-binding protein |
Protein accession | YP_343770 |
Protein GI | 77165245 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.520524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTATA ATGGATATTT CATGAGCATG TTAGGAGGCC CCTTGGTATT TTCTCTCGTG CGCTGGTTGG GAGTCTTCGC TTTGCCGTGG TTGACGGCTT GTAGCGGAGA AGTATTGAAC AGTCCTTATC CTGCCGCCGG CAAGGTCCAA AATGTCGCCT ATTCAAGCTT CAATTTACGC CCTAAGACCC TGGACCCGGC TCGTTCCTAT AGTGCTAATG AAATAGTCTT TACTGGCCAG ATTTATGAGC CTCCCCTGCA ATATCATTAT CTCTTGCGCC CCTACAGCCT GGAGCCTCTG ACCGCTCAAG CCATGCCCCA GGTGACTTAC GTGGACGCCG CCGGCAATCC TCTTCCCCCA GAAGCCCCGT TTAGAGAGGT AGCCTATAGC ATTTATGAAA TTCAGATTCA GCCAGGCATT CATTATCAAC CCCATCCCGC ATTTGCCAAG GATGAAACGG GCCGGTTTCT CTATCATGAG CTAAGCCCTG GAGAGTTGGC TGGAATCTAT AAGCTTAGCG ATTTTCCCCA TAGGGGTAGC CGGGAATTGG TAGCGGCCGA TTATGTTTAC CAAATCAAAC GCCTGGCTTC CCCCTGGGTG CATTCCCCCA TTTTGGGCCT GATGAGCCGT TATATCGTGG GAATGAAGAC CTATACCCAA ACTTTGGTGG CGGCTCAAGA GAAGGACAAG GGAAACTATT TAGATCTTCG CGCTTACCCT CTTCGGGGAG CGGAAGTGGT GGACCGCTAT ACTTATCGCT TGACCATCGA AGGTAAATAT CCCCAACTGC GCTACTGGCT GGCCATGCCT TTTTTTGCTC CGGTGCCCTG GGAGGCGGAT CGGTTTTATG CTCAGTCAGG CATGGCGGAA CGCAATTTGA ACCTTGATTG GTATCCTGTC GGCACGGGTC CCTATATGCT CACGGAAAAT GACCCTAACC GCCGCATGGT GCTGGAGCGT AATCCGAACT TCCATGGCGA GACTTATCCT GCTCAGGGGA TGCCAGGCGA TAAAGCGGCT GGTCTCCTGG TGGACGGGGG CGAGTCCTTG CCTTTCATCG ATCGGGTCGT GTTTAGCCTG GAAAAGGAGA GTATCCCCTA TTGGAATAAA TTCTTGCAAG GCTACTACGA TACGTCCGCA GTCACCTCGG ATAGTTTTGA TCAGGCTTTA CGTATTGCTG GAGGAGGAGA AGAGCTGACT TTGACTGAGG AGATGAAAAC CAAGGGGATC AAGCTGGTTA CGGCTATCGG GACTTCCATC TCTTATCTGG GTTTTAATAT GCTGGACCCG ATAGTCGGGG GCGATAGCGA GCGGGCACGT AAACTGCGCC AGGCCATTTC CATTGCCATT GACTATGAGG AGTTTATTTC CATTTTTGCT AATAGCCAGG GGATTGCCGC CCAGGGACCT TTACCGCCCG GGATTTTTGG CCACCAAAGT GGTAAAGAGG GTATTAATCC CCATGTTTAT AACTGGAAGA ACGGACAGCC TCGCCGCAAG TCTCTCCAGA CAGCCCGCCG GCTCTTGATT GAGGCCGGTT ATCCGAATGG CCGGGACGCC GAGAGCGGCA AACCTCTTTT ATTATATTTC GATACCACGG GTAAGGGCCC GGACAGTGCG TCTTTAGTAA GTTGGATGCG GAAGCAATTC CAAAAGTTGA ATATCCAATT AGTTGTGCGC GAGACTGATT ACAACCGCTT TCAAGATAAA ATGCGTCAGG GAAATGCCCA GATTTTTCAA TGGGGATGGA CTGCTGATTA TCCCGATCCA GAAAACTTTC TCTTTTTGCT CTATGGGCCG GAGGGCAAAG TTCGCCATGG GGGGGAGAAT GCGGCCAACT ACAGCAATCC TGAGTTTAAC CGGCTTTTTC AAGAAATGAA AAGCATGGAA AATGGCCCGG AGCGCCTGAC CAAGATTTGG AAGATGGTGG CTATTGTTCG CCGGGATGCC CCTTGGGTAT GGGGGTTTCA TCCTAAGGAG GTGAGCTTGC TCCATGCCTG GAATTTCAAT GTCCAGCCTA ATTTAATGGC TAATAATACC CTCAAATACC GCCGCATTGA TCCTCAACTG CGGGCGCGGC TGCGGAAGGA ATGGAATCGT CCCTTGCTCT GGCCCCTGGG GGCACTGTTA GCCGTCCTTG TTCTGGGAGC AGCGCCTGCA GTGGTCACCT ACTGGCGCAA AGAGTATCGG CCGGGCTGGG CAGTGGTGCC AGGAGAAGGC AGAGGAAGTC AGAGAAAAGT GCCTGGATAA
|
Protein sequence | MNYNGYFMSM LGGPLVFSLV RWLGVFALPW LTACSGEVLN SPYPAAGKVQ NVAYSSFNLR PKTLDPARSY SANEIVFTGQ IYEPPLQYHY LLRPYSLEPL TAQAMPQVTY VDAAGNPLPP EAPFREVAYS IYEIQIQPGI HYQPHPAFAK DETGRFLYHE LSPGELAGIY KLSDFPHRGS RELVAADYVY QIKRLASPWV HSPILGLMSR YIVGMKTYTQ TLVAAQEKDK GNYLDLRAYP LRGAEVVDRY TYRLTIEGKY PQLRYWLAMP FFAPVPWEAD RFYAQSGMAE RNLNLDWYPV GTGPYMLTEN DPNRRMVLER NPNFHGETYP AQGMPGDKAA GLLVDGGESL PFIDRVVFSL EKESIPYWNK FLQGYYDTSA VTSDSFDQAL RIAGGGEELT LTEEMKTKGI KLVTAIGTSI SYLGFNMLDP IVGGDSERAR KLRQAISIAI DYEEFISIFA NSQGIAAQGP LPPGIFGHQS GKEGINPHVY NWKNGQPRRK SLQTARRLLI EAGYPNGRDA ESGKPLLLYF DTTGKGPDSA SLVSWMRKQF QKLNIQLVVR ETDYNRFQDK MRQGNAQIFQ WGWTADYPDP ENFLFLLYGP EGKVRHGGEN AANYSNPEFN RLFQEMKSME NGPERLTKIW KMVAIVRRDA PWVWGFHPKE VSLLHAWNFN VQPNLMANNT LKYRRIDPQL RARLRKEWNR PLLWPLGALL AVLVLGAAPA VVTYWRKEYR PGWAVVPGEG RGSQRKVPG
|
| |