Gene Noc_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1768 
Symbol 
ID3704785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1990694 
End bp1992943 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content52% 
IMG OID637738251 
Productextracellular solute-binding protein 
Protein accessionYP_343770 
Protein GI77165245 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.520524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATA ATGGATATTT CATGAGCATG TTAGGAGGCC CCTTGGTATT TTCTCTCGTG 
CGCTGGTTGG GAGTCTTCGC TTTGCCGTGG TTGACGGCTT GTAGCGGAGA AGTATTGAAC
AGTCCTTATC CTGCCGCCGG CAAGGTCCAA AATGTCGCCT ATTCAAGCTT CAATTTACGC
CCTAAGACCC TGGACCCGGC TCGTTCCTAT AGTGCTAATG AAATAGTCTT TACTGGCCAG
ATTTATGAGC CTCCCCTGCA ATATCATTAT CTCTTGCGCC CCTACAGCCT GGAGCCTCTG
ACCGCTCAAG CCATGCCCCA GGTGACTTAC GTGGACGCCG CCGGCAATCC TCTTCCCCCA
GAAGCCCCGT TTAGAGAGGT AGCCTATAGC ATTTATGAAA TTCAGATTCA GCCAGGCATT
CATTATCAAC CCCATCCCGC ATTTGCCAAG GATGAAACGG GCCGGTTTCT CTATCATGAG
CTAAGCCCTG GAGAGTTGGC TGGAATCTAT AAGCTTAGCG ATTTTCCCCA TAGGGGTAGC
CGGGAATTGG TAGCGGCCGA TTATGTTTAC CAAATCAAAC GCCTGGCTTC CCCCTGGGTG
CATTCCCCCA TTTTGGGCCT GATGAGCCGT TATATCGTGG GAATGAAGAC CTATACCCAA
ACTTTGGTGG CGGCTCAAGA GAAGGACAAG GGAAACTATT TAGATCTTCG CGCTTACCCT
CTTCGGGGAG CGGAAGTGGT GGACCGCTAT ACTTATCGCT TGACCATCGA AGGTAAATAT
CCCCAACTGC GCTACTGGCT GGCCATGCCT TTTTTTGCTC CGGTGCCCTG GGAGGCGGAT
CGGTTTTATG CTCAGTCAGG CATGGCGGAA CGCAATTTGA ACCTTGATTG GTATCCTGTC
GGCACGGGTC CCTATATGCT CACGGAAAAT GACCCTAACC GCCGCATGGT GCTGGAGCGT
AATCCGAACT TCCATGGCGA GACTTATCCT GCTCAGGGGA TGCCAGGCGA TAAAGCGGCT
GGTCTCCTGG TGGACGGGGG CGAGTCCTTG CCTTTCATCG ATCGGGTCGT GTTTAGCCTG
GAAAAGGAGA GTATCCCCTA TTGGAATAAA TTCTTGCAAG GCTACTACGA TACGTCCGCA
GTCACCTCGG ATAGTTTTGA TCAGGCTTTA CGTATTGCTG GAGGAGGAGA AGAGCTGACT
TTGACTGAGG AGATGAAAAC CAAGGGGATC AAGCTGGTTA CGGCTATCGG GACTTCCATC
TCTTATCTGG GTTTTAATAT GCTGGACCCG ATAGTCGGGG GCGATAGCGA GCGGGCACGT
AAACTGCGCC AGGCCATTTC CATTGCCATT GACTATGAGG AGTTTATTTC CATTTTTGCT
AATAGCCAGG GGATTGCCGC CCAGGGACCT TTACCGCCCG GGATTTTTGG CCACCAAAGT
GGTAAAGAGG GTATTAATCC CCATGTTTAT AACTGGAAGA ACGGACAGCC TCGCCGCAAG
TCTCTCCAGA CAGCCCGCCG GCTCTTGATT GAGGCCGGTT ATCCGAATGG CCGGGACGCC
GAGAGCGGCA AACCTCTTTT ATTATATTTC GATACCACGG GTAAGGGCCC GGACAGTGCG
TCTTTAGTAA GTTGGATGCG GAAGCAATTC CAAAAGTTGA ATATCCAATT AGTTGTGCGC
GAGACTGATT ACAACCGCTT TCAAGATAAA ATGCGTCAGG GAAATGCCCA GATTTTTCAA
TGGGGATGGA CTGCTGATTA TCCCGATCCA GAAAACTTTC TCTTTTTGCT CTATGGGCCG
GAGGGCAAAG TTCGCCATGG GGGGGAGAAT GCGGCCAACT ACAGCAATCC TGAGTTTAAC
CGGCTTTTTC AAGAAATGAA AAGCATGGAA AATGGCCCGG AGCGCCTGAC CAAGATTTGG
AAGATGGTGG CTATTGTTCG CCGGGATGCC CCTTGGGTAT GGGGGTTTCA TCCTAAGGAG
GTGAGCTTGC TCCATGCCTG GAATTTCAAT GTCCAGCCTA ATTTAATGGC TAATAATACC
CTCAAATACC GCCGCATTGA TCCTCAACTG CGGGCGCGGC TGCGGAAGGA ATGGAATCGT
CCCTTGCTCT GGCCCCTGGG GGCACTGTTA GCCGTCCTTG TTCTGGGAGC AGCGCCTGCA
GTGGTCACCT ACTGGCGCAA AGAGTATCGG CCGGGCTGGG CAGTGGTGCC AGGAGAAGGC
AGAGGAAGTC AGAGAAAAGT GCCTGGATAA
 
Protein sequence
MNYNGYFMSM LGGPLVFSLV RWLGVFALPW LTACSGEVLN SPYPAAGKVQ NVAYSSFNLR 
PKTLDPARSY SANEIVFTGQ IYEPPLQYHY LLRPYSLEPL TAQAMPQVTY VDAAGNPLPP
EAPFREVAYS IYEIQIQPGI HYQPHPAFAK DETGRFLYHE LSPGELAGIY KLSDFPHRGS
RELVAADYVY QIKRLASPWV HSPILGLMSR YIVGMKTYTQ TLVAAQEKDK GNYLDLRAYP
LRGAEVVDRY TYRLTIEGKY PQLRYWLAMP FFAPVPWEAD RFYAQSGMAE RNLNLDWYPV
GTGPYMLTEN DPNRRMVLER NPNFHGETYP AQGMPGDKAA GLLVDGGESL PFIDRVVFSL
EKESIPYWNK FLQGYYDTSA VTSDSFDQAL RIAGGGEELT LTEEMKTKGI KLVTAIGTSI
SYLGFNMLDP IVGGDSERAR KLRQAISIAI DYEEFISIFA NSQGIAAQGP LPPGIFGHQS
GKEGINPHVY NWKNGQPRRK SLQTARRLLI EAGYPNGRDA ESGKPLLLYF DTTGKGPDSA
SLVSWMRKQF QKLNIQLVVR ETDYNRFQDK MRQGNAQIFQ WGWTADYPDP ENFLFLLYGP
EGKVRHGGEN AANYSNPEFN RLFQEMKSME NGPERLTKIW KMVAIVRRDA PWVWGFHPKE
VSLLHAWNFN VQPNLMANNT LKYRRIDPQL RARLRKEWNR PLLWPLGALL AVLVLGAAPA
VVTYWRKEYR PGWAVVPGEG RGSQRKVPG