Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2281 |
Symbol | |
ID | 3705073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2631275 |
End bp | 2633167 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637738760 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_344269 |
Protein GI | 77165744 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCAAAGC GTGAAATCGG CCCTTTTTTG CAGGCCCAAA AGCGACGGGC CGTGGTCATT GTGCATGATG TCACTGCCGT CTGTTTGGCC TGGGTGTTAA GTTACCTTAT TCGTTACAAT TTATCGCCTT ATTGGGTGGA CTGGGAGACT TGCCTCAGCA CCTTGCCGGT GATCGTGGTG GTCCAAAGCA TCGTCTTGAT TTGGATAGGG CTTTATCGGG GCTTGTGGCG CTTTGCGAGC ATTCCTGATT TATTAAACAT CATTCGCGCC GTAGGTTACG GAGCCCTTTT AGTTGCGCTA ACGCTTTTTC TTATCAACCG CCTCAACGGC GTACCCCGCA GTGCATTGCT GCTTTATCCT ATCTTTTCTT TACTGCTGCT GGCAGGTCCC CGGCTTGCCT ACCGGGTGTG GAAAGATCGC CGTCTTACCC TGGCTGCAAG TCCTCATCGA AAACGAGTCT TAATTTTGGG AGCTGGCCGT GCTGGTGATT TGCTGGTCCG TGATATGCTT GTGGAGGGGC ACTACCTGCC GGTGGCTTTT CTAGATGATC GGCCTGATCT GATAGGCCGG CAAGTGCGAG GAATTCCAGT AGTGGGGTTG TTAGCGCAAT TGCCTGCCAA GGTGGGTGAG ATGGCTGCAG ATGTGGTGGT GATTGCGATG CCTTCTGCTA ATAACCAGCA GATGCGCCGT GCTGTTTCGC TCTGTGAGCA GGCCCAAGTA CCTTATCGGA CTTTACCCCG TCTGCAAGAT TTGGTTGCGG GTCGGGCGAG CATCAAGGAG CTTCGAGAAG TTGCCATTGA CGATCTATTG GGGCGGGACG AAATATTTTT GGATTGGGAT TATATCCGCC GAGACTTGGA AGCTAAATCC CTGTTGGTCA GTGGCGGAGG AGGATCGATC GGATCCGAAC TTTGCCGGCA ATTAGGGCGC CTTGGGCCCG CTTCCCTTAT TGTCATCGAT AATAGCGAAT ATAAGCTCTA TCGCCTCGAA CAGGAGTTGC GGCAGAAACT CCCGGCTCTC AATCTCCACG CTTATTTATG TGATGTGCGT GATGAAGCTG GTGTGAAGGA GATCCTAGCG CGTCATCTGC CAGAGGTGGT GTTCCATGCA GCAGCTTATA AACATGTACC CCTCCTAGAG AGGCAAGCAC GGCAGGCTGT GCGAAACAAT ATTTTTGGAA CCCGCTGTTT AGCCGAGGCG GCAAGGCAAG CAGGCTGTGG AACTTTCGTC CTGATTTCAA CCGACAAAGC GGTTAATCCA ACGAGTGTGA TGGGCACCAG CAAGCGAGTT GCGGAGCTTC TCTGCCAAGA TCTTAACCGA CACAGCCAAA CTCGGTTTAT CGTCGTGCGT TTTGGCAATG TGCTTGGATC AGCAGGGAGC GTGGTGCCGT TATTCCAGGC CCAAATTGCC GCCGGTGGGC CGATCACGGT GACCCATCCC GAAATGACCC GCTATTTCAT GACTATTCCC GAGGCATGCC ACTTGATTAC CCAAGCCGCT ATGATAGGCA AAGGAGGGGA GATATTCGTC CTGGATATGG GCGAGTCGGT TAAAATTATC GAACTTGCTG AGCAAATGAT CCGCCTTTCT GGCCAGGAAC CGGCAGTAGA AATCATTTAT ACGGGGCTGA GGCCGGGGGA AAAACTCCAT GAGGAATTAT TTTATGCCCA TGAATCTCCT GTACCTACCC AATCGCCGAA GATATTGCTA GCACGGCATG GTGCTACCGA CTGGAGACGC TTGGAAAGCG GACTGGATGA GTTAGAGAAA GCCTGTGGGC TGGGAGAGGA ATGGCAGGTG AAGCAGGTGT TAGCGCGCCT ATTGCCGGAG GTGAGGGCGG AGGCTAAATT GCAACAGGTG GGTTCTAAAG TAATTGCAAT CAACAGGAGT TAA
|
Protein sequence | MPKREIGPFL QAQKRRAVVI VHDVTAVCLA WVLSYLIRYN LSPYWVDWET CLSTLPVIVV VQSIVLIWIG LYRGLWRFAS IPDLLNIIRA VGYGALLVAL TLFLINRLNG VPRSALLLYP IFSLLLLAGP RLAYRVWKDR RLTLAASPHR KRVLILGAGR AGDLLVRDML VEGHYLPVAF LDDRPDLIGR QVRGIPVVGL LAQLPAKVGE MAADVVVIAM PSANNQQMRR AVSLCEQAQV PYRTLPRLQD LVAGRASIKE LREVAIDDLL GRDEIFLDWD YIRRDLEAKS LLVSGGGGSI GSELCRQLGR LGPASLIVID NSEYKLYRLE QELRQKLPAL NLHAYLCDVR DEAGVKEILA RHLPEVVFHA AAYKHVPLLE RQARQAVRNN IFGTRCLAEA ARQAGCGTFV LISTDKAVNP TSVMGTSKRV AELLCQDLNR HSQTRFIVVR FGNVLGSAGS VVPLFQAQIA AGGPITVTHP EMTRYFMTIP EACHLITQAA MIGKGGEIFV LDMGESVKII ELAEQMIRLS GQEPAVEIIY TGLRPGEKLH EELFYAHESP VPTQSPKILL ARHGATDWRR LESGLDELEK ACGLGEEWQV KQVLARLLPE VRAEAKLQQV GSKVIAINRS
|
| |