Gene Noc_2178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2178 
Symbol 
ID3705214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2515685 
End bp2519455 
Gene Length3771 bp 
Protein Length1256 aa 
Translation table11 
GC content53% 
IMG OID637738654 
Productglycosyl transferase family protein 
Protein accessionYP_344168 
Protein GI77165643 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTA GCGCCCCATC CGGCAAGTCA ACACCGGTTT CCTCTCTGCC CTTAAAAGAG 
CATTTATATA TTCGGGAATT TGATCCCCAG GGGGCGGATT CCCTGGCTAA AATTGCCCGT
CTTATCCGAC CCCATACCCA GGTATTGGAT TTGGGTACAG GGCCTGGAGT TTTGGGGAAA
TATTTATCCA CAGCTCTGGG CTGTGTTGTG GATGGCGTGG AAATGAGTGG AGACCAGGCT
CGACTTGCAA AACCCTTTTA TCGGTATTTA CGGATAGCCG ATTTGGAGAC AGCGCAGCTA
GCGGCGCTTT TTCCGGACCA GGGGGCGGGA AGCGAGACAG AATATTCAAT AGATACTAGC
AGCACGGACC CTAAGAAAGA TACGCATCAC CGCTATGATT ATATTGTCTG CGCCGATGTT
CTTGAGCATC TGAAAAATCC TGGCGCCGTG GCTTCCCAAC TGCCCGCCTT GTTAAAGCCC
CAGGGCCGGG TTTTGTTGTC TATTCCCAAT ATCGCCCATG CGGGAGTGAT TGCCGAGCTG
CTGGCGGGGG AGTTTCGCTA TCGTCCAGAG GGCTTGCTGG ATTCGACTCA TTTGCGGTTT
TTTACCCGTA AGTCTTTACT GGAATTTTTG AACTGCCATG GCTTAGTTCC CCTTTCGGTA
GAGGGGATAC CCTGTGATAT CAGGGCGAGC GAATTTCGAG GCTACTATGT GGAGACATTA
CCGCCTGCTA TTTACCGTTT ATTGCAAGCC TATCCCGATG CTTTAACCTA TCAATTTATC
GTGGAAGCCA GGCCAGGCGC CCAGGCTGCA AAAAAGCTGG CAGCAGACCC CGTGGTGCCT
GAATTTCACT TTGCTTGCCG ACTTTATTGG CGCTTGGGAA CGGCTGGATA TCAGGAAGAA
AATAGTAGCT ATGTCCTGGG TTGCATCGGT AAGGAGCACC AGAGGATTCG GTTTTCTATT
CCTCCCCTGC CCGAAATCCC CACAGGGATA CGGATTGCTC CCGCCGAGCG CCCAGGGTTC
ATGCAAATCC ACCAAATTGC CCTCTACGAT AAGGAGAGGC AGAATATTTG GCGATGGCCA
GGGGATATTG CCCATTTGCC GGCGGTTAAC ACCTATCAGA TGGAATTTTC CCATAGCTGG
AGCGCCCCTT TGGGCGTCAA TGCGGTTTTA ATGGGTAAGG ACCCCTCTTT TGAATTGTCT
CTTGAAGAAT CTGTATTGGC CTCTCTGCAA GCAGGTGGAG GGTTAGAGTT GCAAATTTCA
TGGCCCTTGT CCGCTGATTT CATGGCGCTA ACGCAACGCC TGGAGCAAAA GGATCGGGAG
CTACAAGCGC AGGAGGAATT ACTGCGGGAA AAAGATCGCC TCCTGGTTCA TCGCGGACGC
CAATTGGAAG AAAAAGAGCG GTTGTTGGAA GCCAGCCATC AGGATTTACA AGGGCTGCAT
GAAAAACTGG CGGAGCAGCA GCGGGAGCTA ACCGCCCATG AACAGCAGCT AGCGGAATCC
AATGCCCTAG CCAATTATTT AAAAGCCCGG CTGGCCCATC AGGAGAGTTG GCGCGGTTGG
ATGCGGCGTC CTTTCCGGCC CTTGAAGCGC TGGCATCTAA AACGGTTCGA GGCTAGAACA
GCTCACTCCC CGTGCATTGA TATCATCATT CCCGTTTATA ATGCCTATGA GTATCTACGG
GACTGCTTAG AGCGCCTGCG CCTCTGTACT CAGGAGCCTT ACCGGCTGGT GCTTATCGAC
GATGCCTCAA CGGACAGCCG TATTCAGACT CTATTTGAGG AGCTTGAGGC GGCGGGGGAT
GAAGATATTC TGCTGCTGCG TAACGAATAT AACCAGGGGT TTGTAGCTAC CGCCAACCGG
GGAATGTCAC TCGGCGCTAA CGATGTGGTG CTGTTGAATT CCGATACCTT AGTAACTAGG
AATTGGTTGG AAAAACTTAA ACGTTGCGCC GCCTCGGACC CAAAAATAGG TACCATTACT
CCTTTTACCA ATAATGGGGA GATTTGTTCT TTCCCGGAAT TCTGCCGGGA AAATCCTTTG
CCCGACGATC CGGAATTGCT CAACCAGGCG CTGGATCATC TGGATCTCGC CATCTATCCG
GATATTCCCA CCGGAGTTGG CTTTTGCCTC TATATCCGCC GGGCGCTTAT CCATCAGGTG
GGTTTATTCG ATGAAGATGC CTTTGGTCGA GGGTACGGAG AAGAAAATGA CTTATGCCTG
CGGGCAGCGC AAGCCGGTTT CCGGAATGTG CTCTGTAGCG ATGCCTATGT AGCCCATGTG
GGAGGCTGCT CCTTTGGCCA GGAAAAAAGC GCCATTGGGG AAAAGCAAAT GGCGGTGCTA
CTCAATAAAC ACCCCACTTA CCTGGAGCAG GTCGATCGCT TTATAAAACA AGATCCCCTT
AAACCCCTTC GGCAATTAAT TCAAGGCCAG TTAGAAAAGG CCGTTCCTTC AAGAAAGCCC
GCCATCTTGC ATGTGATGCA TGGCCATGGC GGACCCGCCG TCTCCCAGGG TCGGGGTGGA
GGGATAGGCA CTTATATCGA GAATTTAACG GCCCGCCTTG CCGGTGAATT CCGACATTAT
GGGTTGATTG CCCTAGAGCG GGAATGGACC CTGCAAGAGC TTTCTCCTGG GGAGGAGAAT
CAAAGCTATC GTTTCCAGCG CCAGGATAAT GAAACTTGGC CTGCCTTTCT GGAAGGGATT
TGTGCTTGGT TAGACGTTAG ACTGTGCCAT ATTCATCAAA TTGCCGATTG CCGCGATGGC
TTGCTAGAAG CTTTTGCCGG GGTGCGGATT CCCTATGGGG TCAGCATCCA TGATTTTTTA
CTGGCCTGCC CAACGGTAAA CCTTTTGGAT GGCAAGGCGC GTTACTGTCA TGCCGTAACC
AACACTGACC AGTGCCAACA ATGTCTTGAT GACCAGCTCT CCTTTGCCCA TATTGATATT
GGGAAATGGC GGCGGCGGCA TGGGGATTTT TTAGCCAAGG CCGCTTTCGT CCTTGCCCCT
TCAGCCTGGG CCCGGCGCAC CTTTAACAAG TATTTTCCTG GGGTTCCGGT AACTTTAATC
CCTAATTTTC AGCAGCCCCC GTTATTTGGG CAAAGGGGAG GGAATATCCG TGGCTTCTTG
CTTCCGCAAG ATAGTATCAA GTCGATCGGG GTGCTAGGGG CTATCGGGCC GGTGAAAGGT
GCTAGACAGT TGGAACAGTT AGTGGAAAGA ACCCGGGAGC GGCAATTGCC CTTGCGCTGG
GTGGTCATTG GCTACACGGA TCGGCAGGGA GATCCCCCTG TGCCTTACCA GAGCGAGGAT
CAGATAGTAA CGCTCCATGG CCCTTATAGG CAGGCGGATC TGCCAGCCCT GCTGGATCAT
TACGCTATCT CTTTGGTGGT CTTTCCTTCG GCGGGTCCGG AAACTTTTTC TTATACGCTT
TCGGAGGCTT GGGCGGCTGG CCGGCCGGTG TTGGTGCCTC CTATTGGGGC GCTGGAAGAA
CGGGTGGCGG ATATCGGGGC TGGCTGGATC ATGGAGGATT GGCAAGATAT GGATAAGATT
TTGGACCAGG TAATGGCTTT GGTTTATCCG GAAGCGGCGG AATCCCTGCT GCTAATTCAG
GAGTGTGTGG AACAGGCCAA TCGCCAGCAG GCGGATCAAT CGTGTTCTTC AAGCCTTCTG
ATTGCTGAAG CTTACCGGCG TTCTTTCGCC TCTTTTTCGC CATCTGAGTT AAGGGACCTG
AGCTCCTGGC GCATCTATGA GGCGGCTTGT CAGGGGCGGG ATGGGGATTG A
 
Protein sequence
MNSSAPSGKS TPVSSLPLKE HLYIREFDPQ GADSLAKIAR LIRPHTQVLD LGTGPGVLGK 
YLSTALGCVV DGVEMSGDQA RLAKPFYRYL RIADLETAQL AALFPDQGAG SETEYSIDTS
STDPKKDTHH RYDYIVCADV LEHLKNPGAV ASQLPALLKP QGRVLLSIPN IAHAGVIAEL
LAGEFRYRPE GLLDSTHLRF FTRKSLLEFL NCHGLVPLSV EGIPCDIRAS EFRGYYVETL
PPAIYRLLQA YPDALTYQFI VEARPGAQAA KKLAADPVVP EFHFACRLYW RLGTAGYQEE
NSSYVLGCIG KEHQRIRFSI PPLPEIPTGI RIAPAERPGF MQIHQIALYD KERQNIWRWP
GDIAHLPAVN TYQMEFSHSW SAPLGVNAVL MGKDPSFELS LEESVLASLQ AGGGLELQIS
WPLSADFMAL TQRLEQKDRE LQAQEELLRE KDRLLVHRGR QLEEKERLLE ASHQDLQGLH
EKLAEQQREL TAHEQQLAES NALANYLKAR LAHQESWRGW MRRPFRPLKR WHLKRFEART
AHSPCIDIII PVYNAYEYLR DCLERLRLCT QEPYRLVLID DASTDSRIQT LFEELEAAGD
EDILLLRNEY NQGFVATANR GMSLGANDVV LLNSDTLVTR NWLEKLKRCA ASDPKIGTIT
PFTNNGEICS FPEFCRENPL PDDPELLNQA LDHLDLAIYP DIPTGVGFCL YIRRALIHQV
GLFDEDAFGR GYGEENDLCL RAAQAGFRNV LCSDAYVAHV GGCSFGQEKS AIGEKQMAVL
LNKHPTYLEQ VDRFIKQDPL KPLRQLIQGQ LEKAVPSRKP AILHVMHGHG GPAVSQGRGG
GIGTYIENLT ARLAGEFRHY GLIALEREWT LQELSPGEEN QSYRFQRQDN ETWPAFLEGI
CAWLDVRLCH IHQIADCRDG LLEAFAGVRI PYGVSIHDFL LACPTVNLLD GKARYCHAVT
NTDQCQQCLD DQLSFAHIDI GKWRRRHGDF LAKAAFVLAP SAWARRTFNK YFPGVPVTLI
PNFQQPPLFG QRGGNIRGFL LPQDSIKSIG VLGAIGPVKG ARQLEQLVER TRERQLPLRW
VVIGYTDRQG DPPVPYQSED QIVTLHGPYR QADLPALLDH YAISLVVFPS AGPETFSYTL
SEAWAAGRPV LVPPIGALEE RVADIGAGWI MEDWQDMDKI LDQVMALVYP EAAESLLLIQ
ECVEQANRQQ ADQSCSSSLL IAEAYRRSFA SFSPSELRDL SSWRIYEAAC QGRDGD