Gene Noc_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2076 
Symbol 
ID3705247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2384753 
End bp2387704 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content55% 
IMG OID637738551 
Productpeptidase M16-like 
Protein accessionYP_344066 
Protein GI77165541 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA CCGCCATCAT CAATCCCAAA ACACGTTCTT CAACCCATCC GGCATTTGAC 
AGGATACGCA GTCAACCGAT TGATTCTCTT AACCTTACTG TTGAGGAATA CCGCCACCGA
AAGACCGGCG CGAAACATTT CCATCTCGCC ACGGATAACC CAGAAAATGT ATTCTTGGTG
GCTTTTCCCA CAGTCCCCAC GGATTCCACA GGGGTAGCTC ATATTCTGGA GCATACTGTC
CTGTGCGGGA GCAGAAATTA TCCGGTGCGC GACCCTTTCT TTATGATGCT GCGGCGCTCC
CTCAATACTT TCATGAATGC CTTTACCAGC GCCGACTGGA CGGCCTATCC CTTTGCCAGC
AAAAATAAAA AAGATTTCAG CAACCTTCTA AAAATCTACC TGGACGCCGC CTTTTTCGCC
CGCCTCCATC CCCTAGACTT TGCCCAGGAG GGGCACCGGG TGGAGTTCGA AAACCCCACT
GATCCCGAGA CCGATTTAGT ATTTAAAGGC GTGGTATTCA ATGAAATGAA GGGGGCCATG
AGTTCGCCGG TGGCGACCCT TTGGCAAACC CTCTCCAGTC ATCTGTTCCC TACGACGACC
TATCACTATA ACAGTGGGGG CGATCCAGAA CGCATTCCCG ACCTCAGCCA TGAACAACTC
AAGAGTTTCT ACCAAACCCA TTACCATCCC TCCAATGCGG TGTTCATGAC TTTTGGCGAC
ATCCCAGCTC AGGAACACCA CCAAGCGTTC GAATCCCAAG CTCTCTCCGA GTTTGATCGG
CTTGAGATGA AGCTTAACGT GGGGGATGAA AAACGCTACT CCGCGCCGCT GCGAGTAGAA
GAAAGCTATG CCCTGGAAAC CGAGGACGCG GCCAATAAAA CCCACATTGT ACTGGGCTGG
TTGCTCGGCC GAAGCACCGA CCTAGAGGAG CAACTCAAGG CCCATCTGCT TTCCGGCGTA
TTACTGGATA ATAGCGCCTC CCCCTTGCGC CATGCCCTGG AGACTTGCGG CCTGGGCGCC
GCCCCTTCTC CCCTGTGCGG CCTGGAGGAT AACAACCGTG AAATGAGTTT TATCTGCGGC
CTTGAGGGGA CTCAGCCGGA GCATGCCGAG GCGCTGGAAC AACGGGTGCT AGAAGTCCTG
CGAGAAGTGG CCGAACAAGG CGTTCCCCAG GAACAAGTTG AAGCCGTGCT GCACCAATTG
GAACTGCATC AACGGGAGAT CGGTGGCGAT GGAATGCCCT ATGGCCTTCA GCTGATACTC
GAAGGGCTAT CGAGCGCTAT TCACAATGGC GACCCCGTAG CTCTACTCAA TCTCGATCCG
GTGCTGGAAA AACTGCGCCA GGAGATCAAA GACCCTGGTT TCATTAAGAG TTTGGTACAG
GAGAATCTTC TTGGCAACCT TCACCGGGTA CGCTTGACCC TCAAGCCTGA TCCTAGCCTC
GGCGCCCGCC GCGCTAAAGC GGAAAAAGCC CGCCTCGCGG CCTTGAAGGC AGCCATGGAC
GAGGAGCAAA AAGCAGCCGT GGTAAAACTG GCTGCCGAGC TTGCGGCCCG CCAACAACAG
CCAGATGACC CGGATTTTCT GCCCAAGGTG GGGATCGAAG ACATTCCCGC TACCCTTTCC
ATTCCCCAGG GTATTCCGGA AACGGCTGGT AACCTACCGG CCACCTTTTT TGCTCAGGGC
ACGAATGGCT TGGCATATCA ACAGATCGTC ATTGACATGC CCCACCTGGA AGATGAACTG
CTGGAGGTAT TACCCCATTA TACCGCCTGT CTCACGGAAT TGGGGGTCGG CAACCGGGAT
TACCGTCAAA CCCAAGCTTG GCAAGATAGC ATCAGTGGTG GCATCAACGC CAGCACCACC
TTGCGGGGCC AAATAGACAA TGTCCAACAG GTAAATGGCC ATTTTGTCCT GTCCAGTAAA
GCCCTCGCTG CTAACCATGC GCAACTCACC GAGCTCCTGC AAACCACACT GGGGGAGGTG
CGATTTGATG AACTGGACCA CCTCCGGGAA GTGATTGCCC AGCGCCGGGC CGAGTGGGAA
GATCAGATCA CCGGCAGTGG CCATGCCCTC GCCATGGCAG CGGCGGCCAG CGGGATGAGT
CCAACCGCCG CCCTGACCCA TCGCCTGACC GGGCTGGCGG GAATTTCCTT GCTGCAACAA
CTGGATGAAA GTCTCGACAG TAAAGCCGCC CGCCAAGCGC TAGCCGATAA ATTCCGCCAT
ATCCACGATC GCCTCCTCGC CGCTCCCCGT CAGTGGTTAC TCATCGGCGA ACAAGAATAT
CGCTCAGAAT TTTTAGCGGC CTTGAGTCAG CGCGGATCTT CCAATTCAGA AACCGGAACG
AAGTTTACCC CTTTGCGCCT GCCAGAAGTG CGCGCTTCCG TGGGCCAAGC CTGGACCACG
AGCACGCAAG TTAATTTCTG TGCTAAGGCT TATCCTACCG TGCCTGTCGG CCATTCCGAT
GCGGCAGCCC TTACGGTACT GGGAGGATTC CTCCGCAATA ACTACCTTCA CCGGGCCATT
CGCGAACAGG GTGGCGCCTA TGGCGGCGGC GCCGGGCAAG ATTCGGACAG CGCGGCTTTT
CGATTCTTTT CCTATCGGGA CCCACGTCTC GCCGAGACCT TGGAGGATTT TGACCGATCA
GTACAATGGT TGCTGGAAAA CGACCATGAA TGGCGCCTTG TGGAAGAAGC GATTCTCGGA
GTCATCAGCG CCATCGACAA ACCCAAATCA CCTTCTGGTG ACGCCAAGAG CGCCTTCTAT
AACAGCCTTT ACGGCCGCAC CCCTGAGCAG CGCCGCCGCT TCCGGAGCCA AATACTGGAG
GTGCGTCTTG AAGATCTTAA ACGGGTTGCT GAAAACTACC TCAAGCCGGA GAATGCCAGC
ATTGCCGTGC TTACCAATGC TACCCAGCTT GAACAGCTGG CTGGGCTGGA GTTGGTCACC
TATAAAGTAT GA
 
Protein sequence
MNNTAIINPK TRSSTHPAFD RIRSQPIDSL NLTVEEYRHR KTGAKHFHLA TDNPENVFLV 
AFPTVPTDST GVAHILEHTV LCGSRNYPVR DPFFMMLRRS LNTFMNAFTS ADWTAYPFAS
KNKKDFSNLL KIYLDAAFFA RLHPLDFAQE GHRVEFENPT DPETDLVFKG VVFNEMKGAM
SSPVATLWQT LSSHLFPTTT YHYNSGGDPE RIPDLSHEQL KSFYQTHYHP SNAVFMTFGD
IPAQEHHQAF ESQALSEFDR LEMKLNVGDE KRYSAPLRVE ESYALETEDA ANKTHIVLGW
LLGRSTDLEE QLKAHLLSGV LLDNSASPLR HALETCGLGA APSPLCGLED NNREMSFICG
LEGTQPEHAE ALEQRVLEVL REVAEQGVPQ EQVEAVLHQL ELHQREIGGD GMPYGLQLIL
EGLSSAIHNG DPVALLNLDP VLEKLRQEIK DPGFIKSLVQ ENLLGNLHRV RLTLKPDPSL
GARRAKAEKA RLAALKAAMD EEQKAAVVKL AAELAARQQQ PDDPDFLPKV GIEDIPATLS
IPQGIPETAG NLPATFFAQG TNGLAYQQIV IDMPHLEDEL LEVLPHYTAC LTELGVGNRD
YRQTQAWQDS ISGGINASTT LRGQIDNVQQ VNGHFVLSSK ALAANHAQLT ELLQTTLGEV
RFDELDHLRE VIAQRRAEWE DQITGSGHAL AMAAAASGMS PTAALTHRLT GLAGISLLQQ
LDESLDSKAA RQALADKFRH IHDRLLAAPR QWLLIGEQEY RSEFLAALSQ RGSSNSETGT
KFTPLRLPEV RASVGQAWTT STQVNFCAKA YPTVPVGHSD AAALTVLGGF LRNNYLHRAI
REQGGAYGGG AGQDSDSAAF RFFSYRDPRL AETLEDFDRS VQWLLENDHE WRLVEEAILG
VISAIDKPKS PSGDAKSAFY NSLYGRTPEQ RRRFRSQILE VRLEDLKRVA ENYLKPENAS
IAVLTNATQL EQLAGLELVT YKV