Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2053 |
Symbol | |
ID | 3705029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2361833 |
End bp | 2363671 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637738528 |
Product | glycoside hydrolase 15-like protein |
Protein accession | YP_344043 |
Protein GI | 77165518 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.440439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAAAC ACAATTACCC TGCCATTAGT GATTATGGCT ATATTTCCGA TTGCCATTCC TCCGCCCTGA TCTCGAAATC CGGTTCCATC GACTGGTGCT GCATGCCACG GGTGGATTCC CGCAGCTGTT TTGGCCGTCT TCTGGGCTGG GAGCAAGGCG GGTACTGTCA AATCGCTCCG CCAGAACCCC ATGAAGTATC CCGCCGTTAC CTCCCTCAGA CGCTGATTCT GGAAACCACC TTTCGAACGA GCGAGGGAGA AGCCCGTCTC CTGGATTGTT TTACGCTGCG GGAAGGGGGG AAACAGCACC CTCACCGGCA GATCCTCAGG GTACTTGAAG GGCTAAAAGG GCAGGTGAGC TTCCGCGTCG ATATTGCGCC ACGCTTCGAT TACGGCGCTA TCAAACCTTG GATTCAGCGG CGCCATAATA ATCATTCTGG CGATTATTAT ATTGTGATAG GAGGCAGTGA TGGGTTACTC ATCTCCAACG ATTTTCGCCT GGAAATGAAG GATCGCCATA ATCTTCAAGG CGCTTGCCAT ATTAAAGAAG GGCAACGAGT TCATCTCTCC CTTTTATACC GGCGCCCGGA AAGTCTTGAT GAAGGCTGGG CTAATATCCC TACTATCGAA ACGCTGGACC AGCGCTTGGA AGAAACTATC AAATGGTGGC ATGCCTGGTT CTCCCAGGGC GAATTTAACG GCCCCCATGC TGAACAAGCG CAGCGTTCGG CCCTTGTCCT CAAGGGCTTA TGCAATGCGC CTACGGGGGC TATCGCGGCG GCCTCCACAA CATCTCTTCC GGAAGCGCCC GGCGGGGAGC GGAACTGGGA TTATCGTTTT ACTTGGATTC GGGACTCTAC CTTTACGGTC AGATCACTGG CGGATCTTGG ATATATCAAG GAGGCAGATG GTTTTCGCCG TTTTATCGAG CGGACTGCAG CTGGATGTGC GGACGAAGTC CAGATTTTAT TTGGTGTGGG GGGAGAACGG CGACTGCATG AATTTGAGAT TAAAGAATTG CCAGGATACC GGGGAGCAAA GCCAGTGCGC CAAGGCAATG CGGCGGAAAA ACAAATCCAA CTAGATGTTT ATGGAGAATT ATTGGAGTTA GCCTGGCGCT GGCGCCAGCG GGGGCAAACC CCAGACGAAG ATTATTGGGA ATTCCTAGCG GGCCTTGTGA ATGCAGCGGG TGAGCGTTGG AAAGAGCCAG ATCAAGGTCT TTGGGAGATG CGCGGTGAAC CCCGTCATTT CGTCCACTCC AAGGTCATGT GTTGGGCGGC CTTAGATCGA GGGATCAAAC TGGCTGCAGA CCTTGATAAT CATGCGCCTC TTGAGTGGTG GAAGCAGGAA CGGAAAGCGG TCCGCCAAGC AGTGGAAGAG AAGGGCTATG ATTTCCAGCG CGGTATTTTT ATTCAGGCCT TTGATCATGT TGAGATGGAC GCGGGTTTAT TGTTATTGCC CGTGGTGGGA TTCGTGGATT ATCAGGACGA ACGCATGATA CGGACCACAA ACGCCGTATG GCGGGACTTG GAACAAGAAG GTCTGCTGCG CCGCTATAGA GCGGAAAGTC ACGATGATGG CCTGCAGGGC AAGGAAGGCG TGTTTCTGGC TTGCTCCTTT TGGCTGGCGG AATGCCTGGC TTACCAAGGC CGCCTGGAAG AGGCGCGCGA GGTGTTCACG CAGGCAGCGG CTACCGGCAA TGATCTTGGC CTTTATTCAG AGGAATACGA TACCGAAAAA AAGGAGATGT TGGGCAACTT TCCCCAAGGT TTGACTCACC TTTCCCTGAT TGCCGCCGCG GTAGCCCTGT CAAAGGTGGC AGAAGTGGGA GGGAACTAA
|
Protein sequence | MDKHNYPAIS DYGYISDCHS SALISKSGSI DWCCMPRVDS RSCFGRLLGW EQGGYCQIAP PEPHEVSRRY LPQTLILETT FRTSEGEARL LDCFTLREGG KQHPHRQILR VLEGLKGQVS FRVDIAPRFD YGAIKPWIQR RHNNHSGDYY IVIGGSDGLL ISNDFRLEMK DRHNLQGACH IKEGQRVHLS LLYRRPESLD EGWANIPTIE TLDQRLEETI KWWHAWFSQG EFNGPHAEQA QRSALVLKGL CNAPTGAIAA ASTTSLPEAP GGERNWDYRF TWIRDSTFTV RSLADLGYIK EADGFRRFIE RTAAGCADEV QILFGVGGER RLHEFEIKEL PGYRGAKPVR QGNAAEKQIQ LDVYGELLEL AWRWRQRGQT PDEDYWEFLA GLVNAAGERW KEPDQGLWEM RGEPRHFVHS KVMCWAALDR GIKLAADLDN HAPLEWWKQE RKAVRQAVEE KGYDFQRGIF IQAFDHVEMD AGLLLLPVVG FVDYQDERMI RTTNAVWRDL EQEGLLRRYR AESHDDGLQG KEGVFLACSF WLAECLAYQG RLEEAREVFT QAAATGNDLG LYSEEYDTEK KEMLGNFPQG LTHLSLIAAA VALSKVAEVG GN
|
| |