Gene Noc_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3072 
Symbol 
ID3705650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3464842 
End bp3466203 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content48% 
IMG OID637739546 
ProductUDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_345043 
Protein GI77166518 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTGTTA GCGTCATTAT CCTAGCTGCG GGGCAAGGAA CCCGCATGCA TTCAACTTTA 
CCCAAGGTGC TTCACCAACT TGCAGGCCGG CCGCTGCTGA GCCATGTGAT CGCAACAGCG
CGCCAGCTAA ATCCAGCGCA AATAATTGTG GTTTATGGCC ATGGTGGGGA AACGGTGCCT
GAGGCTTTTA GAGCGGCTGA TATTACTTGG GTGCGGCAAG AACTACAGTT AGGCACGGGT
CATGCGGCGC TCCAGACTTT GCCTTATATA AAGCCGGACA CAATGCTCTT GATTCTTTCT
GGCGATGTCC CTCTGGTTAA AATTGAGACT CTAAAAAGGC TGCTAGCTAT GGTAGATCAG
GGAGGGGTAG GTCTGCTTAC TGTTGAGCTG GCTAATCCTA ATGGTTATGG CCGGATTATT
CGAGATAGGG TTGGCGGGGT ATCTAAAATT ATTGAAGAAG CTGATGCTAG CCCAGAGCAA
CGCCGGATCA GAGAAGTTAA TACAGGCATT ACGGCTATTG AGGCTCGTTA CTTGAAGCAG
ATTGCCCCTA AGCTTAGCAA TAATAATGCC CAGGGGGAGT ATTATCTTAC CGATATTATT
GAGCACGCTG TCGCCAGCGG AGAGAAAGTA GTAGCAGTTT CAGCGGCAGA TTCTATAGAG
GTGATGGGTG TCAATGATCG CCAGCAGCTT GCCTATTTAG AGCGTTTTTA TCAGAAACGG
GAGGCCGCGC GTTTAATGGG GGAAGGAGTT AGCCTCAGCG ATCCCGATCG CTTTGACTTG
CGGGGTGAAT TGCTGGTGGG GAAGGATGTT TATATTGATA TTAATGTGAT TCTTGAAGGG
AGGGTGATTC TAGGTGATGG AGTAAAAATA GGCCCTCACT GCTATCTTCG CAACGCGGTG
CTTGGGGAAG GCGTTGAGGT ATTGGCTAAT TGCGTTATTG AAGAAGCCGC GATTGATGCT
TGTGCCCGAG TTGGCCCCTT TACTCGCATC CGTCCTGAAA CGAGGTTAGG TGAAGGAGTA
CATATTGGTA ATTTTGTGGA AATCAAAAAA TCGACTATCA ATAAAAATTC CAAAGTTAAC
CATTTGAGCT ATATTGGTGA TGCCACTATC GGCAAGAAAG TTAACATTGG AGCTGGTACG
ATTACCTGTA ATTATGATGG GGCAAATAAA CATCATACGC TCATTGAAGA TAATGTTTTT
ATTGGCTCAG ATACTCAATT GATAGCCCCC GTGAAAATTG GTGCGGGCGC AACTATTGGC
GCGGGGGCTA CCATTACCCA CGATGTGCCG CCGGGAGAAT TGACCCTAAG CAGGACGCCA
CAAAAATCAT GGCCTGGATG GAAGCGGCCT AGTAAAAAGT GA
 
Protein sequence
MAVSVIILAA GQGTRMHSTL PKVLHQLAGR PLLSHVIATA RQLNPAQIIV VYGHGGETVP 
EAFRAADITW VRQELQLGTG HAALQTLPYI KPDTMLLILS GDVPLVKIET LKRLLAMVDQ
GGVGLLTVEL ANPNGYGRII RDRVGGVSKI IEEADASPEQ RRIREVNTGI TAIEARYLKQ
IAPKLSNNNA QGEYYLTDII EHAVASGEKV VAVSAADSIE VMGVNDRQQL AYLERFYQKR
EAARLMGEGV SLSDPDRFDL RGELLVGKDV YIDINVILEG RVILGDGVKI GPHCYLRNAV
LGEGVEVLAN CVIEEAAIDA CARVGPFTRI RPETRLGEGV HIGNFVEIKK STINKNSKVN
HLSYIGDATI GKKVNIGAGT ITCNYDGANK HHTLIEDNVF IGSDTQLIAP VKIGAGATIG
AGATITHDVP PGELTLSRTP QKSWPGWKRP SKK