Gene Noc_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2780 
Symbol 
ID3705510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3156761 
End bp3158056 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content51% 
IMG OID637739256 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_344757 
Protein GI77166232 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.164739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAC TTTTGATTAA TGGTGGTATT TCTCTAAACG GAGAAATTCG CATCTCGGGT 
GCTAAGAATG CTGCCTTGCC TATACTCGCA GCAACTCTAC TTGCGAGCGA ACCCGTTAAA
ATCTGCAATA TCCCTCACTT GCATGATATC ACCACGACCA TGGAGCTGCT TGGACGCATG
GGAGCTCAGC TCATGGTGGA TGAGCACCTT AATATTGAGG TAGATACACG TAATCTGAAA
GAATTTTATG CCCCTTATGA GCTGGTCAAA ACGATGCGCG CCTCTATCCT GGTGTTAGGA
CCACTATTAG CCCGCTATGG AAGAGCAGAT GTATCCTTGC CCGGCGGTTG TGCTATTGGT
TCCCGGCCCG TCAACCTTCA TATCCATGGT TTACAGGCTA TGGGAGCAAC CATTACAGTG
GAAGAAGGGT ATATTTGCGC TCGCTCCCAG GGACGACTAA GAGGAACCCG ACTGTTTATG
GATCGCGTGT CGGTGACAGG AACCGAGAAT TTGATGATGG CGGCCACTTT AGCTGAAGGG
ACGACCTTTA TAGAAAATGC CGCTCGTGAG CCAGAGGTGG TGGATTTGGC GCACTGCCTT
AACCAGATGG GGGCTAGAAT TAGCGGCATG GGTAGCGATA CGTTGGTCAT TGAGGGAGTT
GATTCCCTTG GTGGTGCCTC TCATACGGTG CTTCCCGATC GTATTGAGAC GGGTACCTAT
CTAGTGGCTG GAGCCTTGAC GGGTGGACGA GTGAAGCTCA AAAACACCAG CCCCGGAAGT
TTGGAAGCCG TATTGTTGAA GCTAGAGGAA GCCGGTGCTG AAATCAATAC GGGGAAGGAT
TGGATTGTTT TAGATATGAA AGGGCGCCGG CCCCGTGCAG TGGATATTCG CACCGCCCCC
TATCCCGCTT TTCCTACCGA TATGCAGGCT CAGTTTACTA CCTTGAACAT TGTCGCAGAG
GGTAGTGGCA CAATTACCGA AACCGTTTTT GAAAATCGAT TTATGCATGT CCAGGAGCTA
CAGCGGATGG GAGCGGTTAT TCGGCTGGAG GGTAATACTG CTTTCACCAA CGGGGTGGAA
ACGCTGACGG GTGCTCCGGT GATGGCAACT GATCTACGTG CTTCGGCCAG CTTGGTTTTG
GCAGGACTGG TCGCTAAGGG GGTCACTGCA GTGGATCGTA TTTACCATGT CGACCGCGGC
TATGAGTGCA TTGAGGAAAA ACTGCAACAA TTGGGAGCCA AGATTCGCCG CGTTTCGAGC
TATACTCCCG GCAAGATCTA TGCTGCTTAT GGTTGA
 
Protein sequence
MDKLLINGGI SLNGEIRISG AKNAALPILA ATLLASEPVK ICNIPHLHDI TTTMELLGRM 
GAQLMVDEHL NIEVDTRNLK EFYAPYELVK TMRASILVLG PLLARYGRAD VSLPGGCAIG
SRPVNLHIHG LQAMGATITV EEGYICARSQ GRLRGTRLFM DRVSVTGTEN LMMAATLAEG
TTFIENAARE PEVVDLAHCL NQMGARISGM GSDTLVIEGV DSLGGASHTV LPDRIETGTY
LVAGALTGGR VKLKNTSPGS LEAVLLKLEE AGAEINTGKD WIVLDMKGRR PRAVDIRTAP
YPAFPTDMQA QFTTLNIVAE GSGTITETVF ENRFMHVQEL QRMGAVIRLE GNTAFTNGVE
TLTGAPVMAT DLRASASLVL AGLVAKGVTA VDRIYHVDRG YECIEEKLQQ LGAKIRRVSS
YTPGKIYAAY G