Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2076 |
Symbol | |
ID | 3705247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2384753 |
End bp | 2387704 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637738551 |
Product | peptidase M16-like |
Protein accession | YP_344066 |
Protein GI | 77165541 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAATA CCGCCATCAT CAATCCCAAA ACACGTTCTT CAACCCATCC GGCATTTGAC AGGATACGCA GTCAACCGAT TGATTCTCTT AACCTTACTG TTGAGGAATA CCGCCACCGA AAGACCGGCG CGAAACATTT CCATCTCGCC ACGGATAACC CAGAAAATGT ATTCTTGGTG GCTTTTCCCA CAGTCCCCAC GGATTCCACA GGGGTAGCTC ATATTCTGGA GCATACTGTC CTGTGCGGGA GCAGAAATTA TCCGGTGCGC GACCCTTTCT TTATGATGCT GCGGCGCTCC CTCAATACTT TCATGAATGC CTTTACCAGC GCCGACTGGA CGGCCTATCC CTTTGCCAGC AAAAATAAAA AAGATTTCAG CAACCTTCTA AAAATCTACC TGGACGCCGC CTTTTTCGCC CGCCTCCATC CCCTAGACTT TGCCCAGGAG GGGCACCGGG TGGAGTTCGA AAACCCCACT GATCCCGAGA CCGATTTAGT ATTTAAAGGC GTGGTATTCA ATGAAATGAA GGGGGCCATG AGTTCGCCGG TGGCGACCCT TTGGCAAACC CTCTCCAGTC ATCTGTTCCC TACGACGACC TATCACTATA ACAGTGGGGG CGATCCAGAA CGCATTCCCG ACCTCAGCCA TGAACAACTC AAGAGTTTCT ACCAAACCCA TTACCATCCC TCCAATGCGG TGTTCATGAC TTTTGGCGAC ATCCCAGCTC AGGAACACCA CCAAGCGTTC GAATCCCAAG CTCTCTCCGA GTTTGATCGG CTTGAGATGA AGCTTAACGT GGGGGATGAA AAACGCTACT CCGCGCCGCT GCGAGTAGAA GAAAGCTATG CCCTGGAAAC CGAGGACGCG GCCAATAAAA CCCACATTGT ACTGGGCTGG TTGCTCGGCC GAAGCACCGA CCTAGAGGAG CAACTCAAGG CCCATCTGCT TTCCGGCGTA TTACTGGATA ATAGCGCCTC CCCCTTGCGC CATGCCCTGG AGACTTGCGG CCTGGGCGCC GCCCCTTCTC CCCTGTGCGG CCTGGAGGAT AACAACCGTG AAATGAGTTT TATCTGCGGC CTTGAGGGGA CTCAGCCGGA GCATGCCGAG GCGCTGGAAC AACGGGTGCT AGAAGTCCTG CGAGAAGTGG CCGAACAAGG CGTTCCCCAG GAACAAGTTG AAGCCGTGCT GCACCAATTG GAACTGCATC AACGGGAGAT CGGTGGCGAT GGAATGCCCT ATGGCCTTCA GCTGATACTC GAAGGGCTAT CGAGCGCTAT TCACAATGGC GACCCCGTAG CTCTACTCAA TCTCGATCCG GTGCTGGAAA AACTGCGCCA GGAGATCAAA GACCCTGGTT TCATTAAGAG TTTGGTACAG GAGAATCTTC TTGGCAACCT TCACCGGGTA CGCTTGACCC TCAAGCCTGA TCCTAGCCTC GGCGCCCGCC GCGCTAAAGC GGAAAAAGCC CGCCTCGCGG CCTTGAAGGC AGCCATGGAC GAGGAGCAAA AAGCAGCCGT GGTAAAACTG GCTGCCGAGC TTGCGGCCCG CCAACAACAG CCAGATGACC CGGATTTTCT GCCCAAGGTG GGGATCGAAG ACATTCCCGC TACCCTTTCC ATTCCCCAGG GTATTCCGGA AACGGCTGGT AACCTACCGG CCACCTTTTT TGCTCAGGGC ACGAATGGCT TGGCATATCA ACAGATCGTC ATTGACATGC CCCACCTGGA AGATGAACTG CTGGAGGTAT TACCCCATTA TACCGCCTGT CTCACGGAAT TGGGGGTCGG CAACCGGGAT TACCGTCAAA CCCAAGCTTG GCAAGATAGC ATCAGTGGTG GCATCAACGC CAGCACCACC TTGCGGGGCC AAATAGACAA TGTCCAACAG GTAAATGGCC ATTTTGTCCT GTCCAGTAAA GCCCTCGCTG CTAACCATGC GCAACTCACC GAGCTCCTGC AAACCACACT GGGGGAGGTG CGATTTGATG AACTGGACCA CCTCCGGGAA GTGATTGCCC AGCGCCGGGC CGAGTGGGAA GATCAGATCA CCGGCAGTGG CCATGCCCTC GCCATGGCAG CGGCGGCCAG CGGGATGAGT CCAACCGCCG CCCTGACCCA TCGCCTGACC GGGCTGGCGG GAATTTCCTT GCTGCAACAA CTGGATGAAA GTCTCGACAG TAAAGCCGCC CGCCAAGCGC TAGCCGATAA ATTCCGCCAT ATCCACGATC GCCTCCTCGC CGCTCCCCGT CAGTGGTTAC TCATCGGCGA ACAAGAATAT CGCTCAGAAT TTTTAGCGGC CTTGAGTCAG CGCGGATCTT CCAATTCAGA AACCGGAACG AAGTTTACCC CTTTGCGCCT GCCAGAAGTG CGCGCTTCCG TGGGCCAAGC CTGGACCACG AGCACGCAAG TTAATTTCTG TGCTAAGGCT TATCCTACCG TGCCTGTCGG CCATTCCGAT GCGGCAGCCC TTACGGTACT GGGAGGATTC CTCCGCAATA ACTACCTTCA CCGGGCCATT CGCGAACAGG GTGGCGCCTA TGGCGGCGGC GCCGGGCAAG ATTCGGACAG CGCGGCTTTT CGATTCTTTT CCTATCGGGA CCCACGTCTC GCCGAGACCT TGGAGGATTT TGACCGATCA GTACAATGGT TGCTGGAAAA CGACCATGAA TGGCGCCTTG TGGAAGAAGC GATTCTCGGA GTCATCAGCG CCATCGACAA ACCCAAATCA CCTTCTGGTG ACGCCAAGAG CGCCTTCTAT AACAGCCTTT ACGGCCGCAC CCCTGAGCAG CGCCGCCGCT TCCGGAGCCA AATACTGGAG GTGCGTCTTG AAGATCTTAA ACGGGTTGCT GAAAACTACC TCAAGCCGGA GAATGCCAGC ATTGCCGTGC TTACCAATGC TACCCAGCTT GAACAGCTGG CTGGGCTGGA GTTGGTCACC TATAAAGTAT GA
|
Protein sequence | MNNTAIINPK TRSSTHPAFD RIRSQPIDSL NLTVEEYRHR KTGAKHFHLA TDNPENVFLV AFPTVPTDST GVAHILEHTV LCGSRNYPVR DPFFMMLRRS LNTFMNAFTS ADWTAYPFAS KNKKDFSNLL KIYLDAAFFA RLHPLDFAQE GHRVEFENPT DPETDLVFKG VVFNEMKGAM SSPVATLWQT LSSHLFPTTT YHYNSGGDPE RIPDLSHEQL KSFYQTHYHP SNAVFMTFGD IPAQEHHQAF ESQALSEFDR LEMKLNVGDE KRYSAPLRVE ESYALETEDA ANKTHIVLGW LLGRSTDLEE QLKAHLLSGV LLDNSASPLR HALETCGLGA APSPLCGLED NNREMSFICG LEGTQPEHAE ALEQRVLEVL REVAEQGVPQ EQVEAVLHQL ELHQREIGGD GMPYGLQLIL EGLSSAIHNG DPVALLNLDP VLEKLRQEIK DPGFIKSLVQ ENLLGNLHRV RLTLKPDPSL GARRAKAEKA RLAALKAAMD EEQKAAVVKL AAELAARQQQ PDDPDFLPKV GIEDIPATLS IPQGIPETAG NLPATFFAQG TNGLAYQQIV IDMPHLEDEL LEVLPHYTAC LTELGVGNRD YRQTQAWQDS ISGGINASTT LRGQIDNVQQ VNGHFVLSSK ALAANHAQLT ELLQTTLGEV RFDELDHLRE VIAQRRAEWE DQITGSGHAL AMAAAASGMS PTAALTHRLT GLAGISLLQQ LDESLDSKAA RQALADKFRH IHDRLLAAPR QWLLIGEQEY RSEFLAALSQ RGSSNSETGT KFTPLRLPEV RASVGQAWTT STQVNFCAKA YPTVPVGHSD AAALTVLGGF LRNNYLHRAI REQGGAYGGG AGQDSDSAAF RFFSYRDPRL AETLEDFDRS VQWLLENDHE WRLVEEAILG VISAIDKPKS PSGDAKSAFY NSLYGRTPEQ RRRFRSQILE VRLEDLKRVA ENYLKPENAS IAVLTNATQL EQLAGLELVT YKV
|
| |