Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2178 |
Symbol | |
ID | 3705214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 2515685 |
End bp | 2519455 |
Gene Length | 3771 bp |
Protein Length | 1256 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637738654 |
Product | glycosyl transferase family protein |
Protein accession | YP_344168 |
Protein GI | 77165643 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGTA GCGCCCCATC CGGCAAGTCA ACACCGGTTT CCTCTCTGCC CTTAAAAGAG CATTTATATA TTCGGGAATT TGATCCCCAG GGGGCGGATT CCCTGGCTAA AATTGCCCGT CTTATCCGAC CCCATACCCA GGTATTGGAT TTGGGTACAG GGCCTGGAGT TTTGGGGAAA TATTTATCCA CAGCTCTGGG CTGTGTTGTG GATGGCGTGG AAATGAGTGG AGACCAGGCT CGACTTGCAA AACCCTTTTA TCGGTATTTA CGGATAGCCG ATTTGGAGAC AGCGCAGCTA GCGGCGCTTT TTCCGGACCA GGGGGCGGGA AGCGAGACAG AATATTCAAT AGATACTAGC AGCACGGACC CTAAGAAAGA TACGCATCAC CGCTATGATT ATATTGTCTG CGCCGATGTT CTTGAGCATC TGAAAAATCC TGGCGCCGTG GCTTCCCAAC TGCCCGCCTT GTTAAAGCCC CAGGGCCGGG TTTTGTTGTC TATTCCCAAT ATCGCCCATG CGGGAGTGAT TGCCGAGCTG CTGGCGGGGG AGTTTCGCTA TCGTCCAGAG GGCTTGCTGG ATTCGACTCA TTTGCGGTTT TTTACCCGTA AGTCTTTACT GGAATTTTTG AACTGCCATG GCTTAGTTCC CCTTTCGGTA GAGGGGATAC CCTGTGATAT CAGGGCGAGC GAATTTCGAG GCTACTATGT GGAGACATTA CCGCCTGCTA TTTACCGTTT ATTGCAAGCC TATCCCGATG CTTTAACCTA TCAATTTATC GTGGAAGCCA GGCCAGGCGC CCAGGCTGCA AAAAAGCTGG CAGCAGACCC CGTGGTGCCT GAATTTCACT TTGCTTGCCG ACTTTATTGG CGCTTGGGAA CGGCTGGATA TCAGGAAGAA AATAGTAGCT ATGTCCTGGG TTGCATCGGT AAGGAGCACC AGAGGATTCG GTTTTCTATT CCTCCCCTGC CCGAAATCCC CACAGGGATA CGGATTGCTC CCGCCGAGCG CCCAGGGTTC ATGCAAATCC ACCAAATTGC CCTCTACGAT AAGGAGAGGC AGAATATTTG GCGATGGCCA GGGGATATTG CCCATTTGCC GGCGGTTAAC ACCTATCAGA TGGAATTTTC CCATAGCTGG AGCGCCCCTT TGGGCGTCAA TGCGGTTTTA ATGGGTAAGG ACCCCTCTTT TGAATTGTCT CTTGAAGAAT CTGTATTGGC CTCTCTGCAA GCAGGTGGAG GGTTAGAGTT GCAAATTTCA TGGCCCTTGT CCGCTGATTT CATGGCGCTA ACGCAACGCC TGGAGCAAAA GGATCGGGAG CTACAAGCGC AGGAGGAATT ACTGCGGGAA AAAGATCGCC TCCTGGTTCA TCGCGGACGC CAATTGGAAG AAAAAGAGCG GTTGTTGGAA GCCAGCCATC AGGATTTACA AGGGCTGCAT GAAAAACTGG CGGAGCAGCA GCGGGAGCTA ACCGCCCATG AACAGCAGCT AGCGGAATCC AATGCCCTAG CCAATTATTT AAAAGCCCGG CTGGCCCATC AGGAGAGTTG GCGCGGTTGG ATGCGGCGTC CTTTCCGGCC CTTGAAGCGC TGGCATCTAA AACGGTTCGA GGCTAGAACA GCTCACTCCC CGTGCATTGA TATCATCATT CCCGTTTATA ATGCCTATGA GTATCTACGG GACTGCTTAG AGCGCCTGCG CCTCTGTACT CAGGAGCCTT ACCGGCTGGT GCTTATCGAC GATGCCTCAA CGGACAGCCG TATTCAGACT CTATTTGAGG AGCTTGAGGC GGCGGGGGAT GAAGATATTC TGCTGCTGCG TAACGAATAT AACCAGGGGT TTGTAGCTAC CGCCAACCGG GGAATGTCAC TCGGCGCTAA CGATGTGGTG CTGTTGAATT CCGATACCTT AGTAACTAGG AATTGGTTGG AAAAACTTAA ACGTTGCGCC GCCTCGGACC CAAAAATAGG TACCATTACT CCTTTTACCA ATAATGGGGA GATTTGTTCT TTCCCGGAAT TCTGCCGGGA AAATCCTTTG CCCGACGATC CGGAATTGCT CAACCAGGCG CTGGATCATC TGGATCTCGC CATCTATCCG GATATTCCCA CCGGAGTTGG CTTTTGCCTC TATATCCGCC GGGCGCTTAT CCATCAGGTG GGTTTATTCG ATGAAGATGC CTTTGGTCGA GGGTACGGAG AAGAAAATGA CTTATGCCTG CGGGCAGCGC AAGCCGGTTT CCGGAATGTG CTCTGTAGCG ATGCCTATGT AGCCCATGTG GGAGGCTGCT CCTTTGGCCA GGAAAAAAGC GCCATTGGGG AAAAGCAAAT GGCGGTGCTA CTCAATAAAC ACCCCACTTA CCTGGAGCAG GTCGATCGCT TTATAAAACA AGATCCCCTT AAACCCCTTC GGCAATTAAT TCAAGGCCAG TTAGAAAAGG CCGTTCCTTC AAGAAAGCCC GCCATCTTGC ATGTGATGCA TGGCCATGGC GGACCCGCCG TCTCCCAGGG TCGGGGTGGA GGGATAGGCA CTTATATCGA GAATTTAACG GCCCGCCTTG CCGGTGAATT CCGACATTAT GGGTTGATTG CCCTAGAGCG GGAATGGACC CTGCAAGAGC TTTCTCCTGG GGAGGAGAAT CAAAGCTATC GTTTCCAGCG CCAGGATAAT GAAACTTGGC CTGCCTTTCT GGAAGGGATT TGTGCTTGGT TAGACGTTAG ACTGTGCCAT ATTCATCAAA TTGCCGATTG CCGCGATGGC TTGCTAGAAG CTTTTGCCGG GGTGCGGATT CCCTATGGGG TCAGCATCCA TGATTTTTTA CTGGCCTGCC CAACGGTAAA CCTTTTGGAT GGCAAGGCGC GTTACTGTCA TGCCGTAACC AACACTGACC AGTGCCAACA ATGTCTTGAT GACCAGCTCT CCTTTGCCCA TATTGATATT GGGAAATGGC GGCGGCGGCA TGGGGATTTT TTAGCCAAGG CCGCTTTCGT CCTTGCCCCT TCAGCCTGGG CCCGGCGCAC CTTTAACAAG TATTTTCCTG GGGTTCCGGT AACTTTAATC CCTAATTTTC AGCAGCCCCC GTTATTTGGG CAAAGGGGAG GGAATATCCG TGGCTTCTTG CTTCCGCAAG ATAGTATCAA GTCGATCGGG GTGCTAGGGG CTATCGGGCC GGTGAAAGGT GCTAGACAGT TGGAACAGTT AGTGGAAAGA ACCCGGGAGC GGCAATTGCC CTTGCGCTGG GTGGTCATTG GCTACACGGA TCGGCAGGGA GATCCCCCTG TGCCTTACCA GAGCGAGGAT CAGATAGTAA CGCTCCATGG CCCTTATAGG CAGGCGGATC TGCCAGCCCT GCTGGATCAT TACGCTATCT CTTTGGTGGT CTTTCCTTCG GCGGGTCCGG AAACTTTTTC TTATACGCTT TCGGAGGCTT GGGCGGCTGG CCGGCCGGTG TTGGTGCCTC CTATTGGGGC GCTGGAAGAA CGGGTGGCGG ATATCGGGGC TGGCTGGATC ATGGAGGATT GGCAAGATAT GGATAAGATT TTGGACCAGG TAATGGCTTT GGTTTATCCG GAAGCGGCGG AATCCCTGCT GCTAATTCAG GAGTGTGTGG AACAGGCCAA TCGCCAGCAG GCGGATCAAT CGTGTTCTTC AAGCCTTCTG ATTGCTGAAG CTTACCGGCG TTCTTTCGCC TCTTTTTCGC CATCTGAGTT AAGGGACCTG AGCTCCTGGC GCATCTATGA GGCGGCTTGT CAGGGGCGGG ATGGGGATTG A
|
Protein sequence | MNSSAPSGKS TPVSSLPLKE HLYIREFDPQ GADSLAKIAR LIRPHTQVLD LGTGPGVLGK YLSTALGCVV DGVEMSGDQA RLAKPFYRYL RIADLETAQL AALFPDQGAG SETEYSIDTS STDPKKDTHH RYDYIVCADV LEHLKNPGAV ASQLPALLKP QGRVLLSIPN IAHAGVIAEL LAGEFRYRPE GLLDSTHLRF FTRKSLLEFL NCHGLVPLSV EGIPCDIRAS EFRGYYVETL PPAIYRLLQA YPDALTYQFI VEARPGAQAA KKLAADPVVP EFHFACRLYW RLGTAGYQEE NSSYVLGCIG KEHQRIRFSI PPLPEIPTGI RIAPAERPGF MQIHQIALYD KERQNIWRWP GDIAHLPAVN TYQMEFSHSW SAPLGVNAVL MGKDPSFELS LEESVLASLQ AGGGLELQIS WPLSADFMAL TQRLEQKDRE LQAQEELLRE KDRLLVHRGR QLEEKERLLE ASHQDLQGLH EKLAEQQREL TAHEQQLAES NALANYLKAR LAHQESWRGW MRRPFRPLKR WHLKRFEART AHSPCIDIII PVYNAYEYLR DCLERLRLCT QEPYRLVLID DASTDSRIQT LFEELEAAGD EDILLLRNEY NQGFVATANR GMSLGANDVV LLNSDTLVTR NWLEKLKRCA ASDPKIGTIT PFTNNGEICS FPEFCRENPL PDDPELLNQA LDHLDLAIYP DIPTGVGFCL YIRRALIHQV GLFDEDAFGR GYGEENDLCL RAAQAGFRNV LCSDAYVAHV GGCSFGQEKS AIGEKQMAVL LNKHPTYLEQ VDRFIKQDPL KPLRQLIQGQ LEKAVPSRKP AILHVMHGHG GPAVSQGRGG GIGTYIENLT ARLAGEFRHY GLIALEREWT LQELSPGEEN QSYRFQRQDN ETWPAFLEGI CAWLDVRLCH IHQIADCRDG LLEAFAGVRI PYGVSIHDFL LACPTVNLLD GKARYCHAVT NTDQCQQCLD DQLSFAHIDI GKWRRRHGDF LAKAAFVLAP SAWARRTFNK YFPGVPVTLI PNFQQPPLFG QRGGNIRGFL LPQDSIKSIG VLGAIGPVKG ARQLEQLVER TRERQLPLRW VVIGYTDRQG DPPVPYQSED QIVTLHGPYR QADLPALLDH YAISLVVFPS AGPETFSYTL SEAWAAGRPV LVPPIGALEE RVADIGAGWI MEDWQDMDKI LDQVMALVYP EAAESLLLIQ ECVEQANRQQ ADQSCSSSLL IAEAYRRSFA SFSPSELRDL SSWRIYEAAC QGRDGD
|
| |