Gene Noc_2676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2676 
SymbollpxK 
ID3704432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3032253 
End bp3033269 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content53% 
IMG OID637739157 
Producttetraacyldisaccharide 4'-kinase 
Protein accessionYP_344659 
Protein GI77166134 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1663] Tetraacyldisaccharide-1-P 4'-kinase 
TIGRFAM ID[TIGR00682] tetraacyldisaccharide 4'-kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.270569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATT ATAGCTCTTT AATTCTTCGC TATTGGTATA GTAATCAGCC TTCCCGCTGG 
TTGCTCACCC CCCTGAGTGG TCTCTTCCAA CTAGCAGTCA AAATACGCCA ATGGGCCTAT
ACCCAAGGCC TGTTCCATAC CCACATACTC CCTCTGCCGG TGCTGGTCAT TGGCAACCTG
ACCTTGGGAG GGACAGGAAA AACGCCGTTA GTTATCTGGC TAGCCCAATT TCTCAGGCAG
CATGGCTATC GACCAGGACT TATCAGTCGT GGCTATGGAG GCCACGCTCA AAACTACCCT
CAACAGGTCT ATCCTGACAG TGATCCACAT TTAGTCGGCG ATGAGGCCGT CCTCCTGGCC
CGGCGCACCG GCTGCCCGCT TGTGGTAGGA CCTGATCGAG TTGCTGCCTC CCACGCTCTT
TTGGCTCACT CCGATTGCAA TGTCCTGCTT TCAGATGATG GACTGCAACA TTATGCCCTA
GGTCGGGATA TCGAAATCCT CGTAGTAGAT GGCGTACGCC GTTTTGGCAA CGCTCACTGT
CTGCCCGCCG GTCCTTTAAG AGAACCATTA AGCCGCCTTC GAACCGTGGA CTTAGTGGTC
ACTAACGGGA TGCCTCAAGG AGGCGAGTTC GCCATGTATT CACAGCTTCA AGATGCTCGC
CATATCAAAG ACGGTACCCT CCGCCCGCTA AAGAAATTCC GCCGCACTAA GCTCCATGGC
GTCGCGGGAA TTGGAAACCC AGAACGCTTC TTTAGCCAAC TGCGGGCCCT AGAGCTTACT
ATCCAGCCAC ACCCTTTCCC CGATCACTAT GGTTTCCAAT CTGAAGACTT GGCTTTTGCA
GATCAGCAGC CGGTGCTCAT GACCGAGAAA GATGCGGTAA AATGTATCCG TTTTGCCCGC
GATAACTATT GGTACGTTCC TATGGATGTT TCTCTGCCAG CATCCTTTGG CGCTCAAGTG
CTCAGCCTCC TGCAACAGGC CGCCAAAAAA AAGCTAAACA TAGAGACAAC AGGATGA
 
Protein sequence
MSDYSSLILR YWYSNQPSRW LLTPLSGLFQ LAVKIRQWAY TQGLFHTHIL PLPVLVIGNL 
TLGGTGKTPL VIWLAQFLRQ HGYRPGLISR GYGGHAQNYP QQVYPDSDPH LVGDEAVLLA
RRTGCPLVVG PDRVAASHAL LAHSDCNVLL SDDGLQHYAL GRDIEILVVD GVRRFGNAHC
LPAGPLREPL SRLRTVDLVV TNGMPQGGEF AMYSQLQDAR HIKDGTLRPL KKFRRTKLHG
VAGIGNPERF FSQLRALELT IQPHPFPDHY GFQSEDLAFA DQQPVLMTEK DAVKCIRFAR
DNYWYVPMDV SLPASFGAQV LSLLQQAAKK KLNIETTG