Gene Noc_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0208 
Symbol 
ID3706243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp230782 
End bp232821 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content52% 
IMG OID637736724 
Productoligopeptidase A 
Protein accessionYP_342268 
Protein GI77163743 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC CACTTCTTGA ATTTGCGGGC CTACCTCCAT TTTCGAAAAT TCAGCCAGCC 
CATGTGGAGC CTGCGATTGA TTGTCTGCTG GCAGAAGGGC GGGCCTTGAT TGAACAACTG
CTTACTCGCC ATACTGTCTA CACTTGGGAT AATCTAGCTC AGCCGCTAGA AGACTTGCGG
GAGCATCTTG ACCGAGTGTG GTCTCCTGTG TCCCATATGA ATGCGGTGGT TAATAGCGAT
GGATTACGGC GGGCTTACAA TGCCTGTTTG CCTAAACTTA GCGAGTTTGC CACTGAACTC
GGGCAAAACG AAAACTTGTA CCGGGCCTTT CAATCGATTG CTGAAGGCGA CGAGTATCCG
AGCCTGAATG TTCCCCAGAA AAAAATTATT GTCAATGCCC TGCGCGATTT TCGGCTTTCA
GGAGTGACTC TGCCATCGGA AAAGAAAGCT CGCTTTAAAG CCATCCAGCA GCAGTTAGCC
AGTTTAAACG CCAAATTTGA GGAAAATCTG CTGGATTCGA CCCAAGCTTG GCGTAAGCAC
CTTGCGGATG AGACAATCCT GGCGGGACTA CCCGAGGGAG CACGGGCCCA AGCGCGCCAA
GCAGCGGAAC AAGCGGGCCT GGAGGGTTGG TTGTTGACCT TGGAAGCACC TTCCTATGTT
GCGGTCACGA CCTACGCCGA TGACCGGGAA CTACGGGAAG AAATTTACAC CGCCTTTGTC
ACTCGCGGTT CGGATCAGGG CCCCCATGGA GGGCGCTGGG ATAATACTCA AGTCATGGAG
GAAATCTTGG CCTTGCGCCA TGAAGCGGCA CAGTTGCTTG GTTTTGCCAA TCATGCTGAA
TGCTCCCTAG CTACCAAAAT GGCCGGAAAT CCGCAGCAAG TATTAGATTT TCTTAATGAT
CTAGCCGTCC GCTCCAAACG AGTGGCCGAG CAGGATTTAG CTGAGGTCCG ATCCTTTGCC
CAGGCACACT ATGGTATTGA GGATCTCCAG GCGTGGGATG TGGCGTATTA TGGAGAAAAA
TTGCGGCAGC ACAAATATGC CATTTCCCAG GAGGAACTCA AGCCTTATTT CCCCGTCTGG
CGGGTGCTGG AAGGGCTATT TACTATCGTG AATCGACTCT ATGGCCTGGA AATTCAGGAA
CGAAAGGATG TGGATACTTG GCATCCCCAG GTGCGTTTTT TCGATATTTT CGATGACAGT
GGCGAACTGC GCGGGCAGTT CTATCTGGAT CTCTATGCGC GCAGCAACAA GCGAGGGGGA
GCATGGATGG CCGATTGTCT GTCCCGCAAG CGCCAGGGAA GTCAATTACA AATTCCAGTG
GCCTATTTGA CTTGTAACTT GACACCACCG GTAGATGATA AGCCAGCCCT GTTCACCCAT
AATGAGGTGA TCACGTTGTT TCACGAGTTT GGCCACGGTT TACATCACTT GCTCACCAAA
ATTGATTATC CCAGCGTGGC GGGAATTAGC GGTGTGTTCT GGGATGCAGT GGAATTGCCT
AGCCAGTTTA TGGAGAATTG GTGCTGGCAA CAAGAAGCGT TGGCCCTCAT TGCCTGCCAT
TTTGAAACCC ACGAACCTCT CCCGGAAAAA TTATTTGAGC GCATGCTAGC GGCCAAGAAT
TTCCTCTCGG GGATGATGAT AGTGCGCCAG CTTGAGTTTG CCTTGTTCGA TTTTCGCCTG
CACTTGGAGT ATGACCCCGC GAAGGGCGCC CGGGTTGATG AGCTGCTGCA GGAGGCGCGA
GAGCAGGTTG CCGTTGTCAA GCCGCCGTCC TTTAACCGTT TCGCCCATAG CTTCAGTCAT
ATCTTCGCCG GTGGTTATGC CGCTGGTTAC TATAGTTACA AGTGGGCCGA GGTGCTATCA
GCAGATGCCT TTTCCCGGTT TGAGGAAGAA GGTATTTTTG ATCGGCAGGC AGGCAGGGCT
TTCATGAGCA GCATCTTGGA GCAGGGAGGC AGCCGTGATC CCCTGGAACT ATTTATCGAA
TTTCGGGGCC GAGAGCCCGT CATTGACGCC TTGCTCCGCC ACAGTGGCAT TGCCGCCTGA
 
Protein sequence
MSNPLLEFAG LPPFSKIQPA HVEPAIDCLL AEGRALIEQL LTRHTVYTWD NLAQPLEDLR 
EHLDRVWSPV SHMNAVVNSD GLRRAYNACL PKLSEFATEL GQNENLYRAF QSIAEGDEYP
SLNVPQKKII VNALRDFRLS GVTLPSEKKA RFKAIQQQLA SLNAKFEENL LDSTQAWRKH
LADETILAGL PEGARAQARQ AAEQAGLEGW LLTLEAPSYV AVTTYADDRE LREEIYTAFV
TRGSDQGPHG GRWDNTQVME EILALRHEAA QLLGFANHAE CSLATKMAGN PQQVLDFLND
LAVRSKRVAE QDLAEVRSFA QAHYGIEDLQ AWDVAYYGEK LRQHKYAISQ EELKPYFPVW
RVLEGLFTIV NRLYGLEIQE RKDVDTWHPQ VRFFDIFDDS GELRGQFYLD LYARSNKRGG
AWMADCLSRK RQGSQLQIPV AYLTCNLTPP VDDKPALFTH NEVITLFHEF GHGLHHLLTK
IDYPSVAGIS GVFWDAVELP SQFMENWCWQ QEALALIACH FETHEPLPEK LFERMLAAKN
FLSGMMIVRQ LEFALFDFRL HLEYDPAKGA RVDELLQEAR EQVAVVKPPS FNRFAHSFSH
IFAGGYAAGY YSYKWAEVLS ADAFSRFEEE GIFDRQAGRA FMSSILEQGG SRDPLELFIE
FRGREPVIDA LLRHSGIAA