Gene Noc_1508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1508 
Symbol 
ID3705844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1672305 
End bp1673813 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content59% 
IMG OID637737994 
Productundecaprenyl-phosphate galactosephosphotransferase 
Protein accessionYP_343523 
Protein GI77164998 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.2856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTAG TACCGTTAGT ATTAAAAGAA AGAATAATAG AACCAGAAGG AGAAAGGCCA 
AAAAATGGCA ACTTTACTCC AGCGGTATAT GAGGTACTTT CATGCGAAAA CAAGACACGG
CCCTATCTAA CGGGCGCCGT ACTGCTAGCT GTGGACGTAG GCGCCGTGGG GGTGGCCTTT
GCGGCCGCCG TGGCCCTGCG CTACAGCTTC GGGTCTGGTT TGAATTTCTC CACTTATTGG
GAATTGTCGC CGCTTCTGCT GCTATTTTCG ATCTTCTATG CTGGCGCCAA GCTCTATTCC
GGCACAATGT TTCCCCCGCC AGAAGAACTG CGGCGGCTCA CCTATGCAAC CAGCGCGGTG
TTTCTGACGC TGGCTGTCTT CGCACTGCTG ACCCAAACCG GTATGCGGTA CTCCCGGGGG
GTATTTTTTC TGGCCTGGGG CTTGGCGCTG GTCGCGGTGC CGCTGGCCCG GGCGGCGATA
CGTGCCCTTT ATGTCAGCCG TCCCTGGTGG GGTAGAGGGG TGTTGATTCT AGGGGCGGGA
TATACCTGCG CGCACCTGGT CCGCACGCTC CAGGAGAACC CTCAACTGGG ATTGAAACCC
ATAGCTCTGC TCGACGACGA TCCGGTGAAA CAGAAGCGTG AGCTGCATGG GGTTCCGGTC
GTCGGTGCCT GCGCCTTGGC GCCGAAGCTT GCCCGACGTC TACGCCTCAG CTACGCGGCT
GTGGCCATGC CCGGTGTGTC ACGCGCCCGG CTTGTAGAAT TACTGGGACA GCACGGCTGG
CCGTTCCGGC GCACGCTATT AATTCCCGAC CTATTCGGCT TTTCCTCCCT GTGGGTTACC
TCCCGTGACC TGGGCGGAAT TCTGGGCCTG GAGCTGCGCG AACAGCTCCT GCTGCCAATC
CCCATGCTGG TCAAACGGAC CTTGGACTTG GTTCTCGCCC TGGTGGGTGG GCTGTTCATC
CTGCCGCTCC TTGGGTTGAT CGCTCTTGCC ATCAAGCTAG ATTCCAGGGG ACCGGTGTTC
TACCGCTCGG AGCGCATGGG CCGTGATGGC CACCGCTTCG TGGCACTCAA ATTTCGTTCC
ATGCGCGGTG ATGGTGAAGC GCTGTTGCGG GAATTATTAC AGCGTGATCC GGAAAAGCGG
AAAGAATACG AGCAGTATCA CAAGCTCACC AGCGACCCCC GCGTAACGCC AGTGGGACGC
CTGCTCCGCG CCTGGAGCCT CGATGAACTC CCTCAGCTTT GGAATGTATT GAAGGGCGAT
ATGAGCCTTG TTGGCCCGCG CGCCTATTTG GAGCGCGAGC GGCCAGACAT GGGAGAAAAA
TCGAATCTCA TCTTGAAGGT GAGACCCGGT ATCACGGGCC TGTGGCAAGT CAGCGGCCGC
AACGAGCGCA CCTTCGGCGA GCGGGTGGAT ATGGACGTCT ACTACGCCCG CAACTGGTCC
GTTTGGCTCG ACTTTTGGAT TCTTGCCCGG ACAGCCACGG CGGTTTTGCA GGGGAAGGGG
GCGTACTGA
 
Protein sequence
MRLVPLVLKE RIIEPEGERP KNGNFTPAVY EVLSCENKTR PYLTGAVLLA VDVGAVGVAF 
AAAVALRYSF GSGLNFSTYW ELSPLLLLFS IFYAGAKLYS GTMFPPPEEL RRLTYATSAV
FLTLAVFALL TQTGMRYSRG VFFLAWGLAL VAVPLARAAI RALYVSRPWW GRGVLILGAG
YTCAHLVRTL QENPQLGLKP IALLDDDPVK QKRELHGVPV VGACALAPKL ARRLRLSYAA
VAMPGVSRAR LVELLGQHGW PFRRTLLIPD LFGFSSLWVT SRDLGGILGL ELREQLLLPI
PMLVKRTLDL VLALVGGLFI LPLLGLIALA IKLDSRGPVF YRSERMGRDG HRFVALKFRS
MRGDGEALLR ELLQRDPEKR KEYEQYHKLT SDPRVTPVGR LLRAWSLDEL PQLWNVLKGD
MSLVGPRAYL ERERPDMGEK SNLILKVRPG ITGLWQVSGR NERTFGERVD MDVYYARNWS
VWLDFWILAR TATAVLQGKG AY