Gene Noc_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1743 
Symbol 
ID3705004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1958361 
End bp1960283 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content52% 
IMG OID637738226 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_343745 
Protein GI77165220 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTCTG TAACTAGCTA TCCTCTATTA GAACAGATTG ATTCCCCGGA GCGTTTGCGC 
CGTTTGCCTG AGTCGGATCT AGAGACTCTC GCCGAGGAAT TGCGCGATTT TCTCCTTCAC
TCCGTCGCTC GCAGTGGCGG ACACTTGGCT GCAGGTTTGG GGACTATCGA ATTGACGATC
GCCCTACACT ATATTTTTGC CACTCCGGAA GATCGCCTGG TATGGGATGT AGGGCATCAA
GCCTATCCCC ACAAAGTGCT CACAGGACGA CGGGAACGGT TGGGAACTAT CCGTCAAGCA
GGCGGTTTGG CGCCCTTCCC CAGTCGTCAC GAGAGTCCTT ACGATACTTT TGGCGTGGGT
CATTCTAGCA CTTCGATTAG TGCCGCTCTC GGTATGGCCA TTGCCGCTAA TGAGAAGGGG
GAGAAGCGCA AAACAGTGGC CATTATCGGC GATGGCGGAA TGACGGCAGG AATGGCTTAT
GAGGCGTTGG ATCATGCCGG TGCCCTGGGG GCTGATTTAC TTGTGATCCT GAATGATAAC
GAGATGTCCA TTTCTCCTAA TGTGGGCGCA ATTTCTAGTT ATTTGACACG GTTATTAAGT
GGGCGGGTTT ACTCAACGGT GCGGGAAGGT AGCAAAAAGG TGCTTGAACG TATGCCGCCA
CCTATGTGGG AATTAGCGCG CCGCACGGAA GAACATGTGA AAGGGATGGT AGCTCCGGGG
ACTTTGTTTG AAGAGATGGG CTTTAATTAT TTCGGCCCTA TCGATGGGCA TGATTTGAGT
TCGTTGATTC GTACCTTACG GAATTTACAT AAGTTAACTG GCCCCCGTCT GTTGCATATC
GTCACCTGTA AGGGTAAAGG TTATACGCTA GCGGAGGAAA ATCCGGTCAC CTATCATGGG
GTAACCCCGT TTGATCCTAA GGTTGGCATC CAGCAAGGGC CCCAAAAACC ATCATCCGCA
ATGAGCTACA CTCAAGTCTT CAGCCAGTGG TTGTGTGATA TGGCAGCCCA GGATGGACTC
TTGGTAGGCA TTACTCCCGC TATGCGGGAG GGGTCGGGTC TGGTGAAATT TTCTGAATGT
TTTCCGGAAC GTTACTTTGA TGTGGCTATT GCCGAGCAGC ACAGTGTGAC TTTAGCTGCC
GGGATGGCAT GCGATGGGTT AAAACCGGTG GTTGCGATTT ACTCCACTTT TCTACAGCGG
GCCTATGATC AGCTGATTCA TGATGTTGCT CTGCAAAACC TGCCAGTGCT TTTTGCCATA
GATCGAGCTG GGGTGGTAGG GCCGGATGGC CCTACCCATG CGGGTAGCTT TGATTTGACC
TATCTTCGCT GCATCCCTAA CCTGGTAGTG ATGGCTCCGG CAGATGAAAA TGAGTGCCGG
CAGATGCTTT ATACGGGTTT CCTGCTTAAC CAACCGGCAG CAGTCCGTTA TCCTCGTGGG
AAAGGACCAG GGGTAGCCGT TGAAGCAAGC ATGACAGCAC TGCCGCTGGG TAAGGCTGAG
CTTAAGCGGA AAGGTCGGGG TATTGCTATC CTTGCTTTTG GTGCCACGGT GGCGCCCGCC
CTTGAAGCAG CGGAAAAGCT GGATGCCACG GTGGTGAATA TGCGCTTTGT TAAACCCTTG
GATGAAGATT TGGTCCTGGA AATGGCGATG AACCATGAAT TGCTGGTGAC TGTAGAGGAT
AATGTTATTG CGGGCGGCGC GGGGAGCGCT GTCAGCGAAT GCTTGGCTTA TCATGGGGTT
TCAGTGCCTT TACTCCTGCA TGGTTTACCT GATAATTTTT TAGAACATGG CTCCCGTGAG
GCGCTCTTGG AGCAGTGTCA TTTGAATGCT GAGGGCATTC TCCAGCGCGT GAAAACCTAC
CGTGCTCGGC TGCCTAAGTC CAAGGCTAGC GTGGTTTCCT CCGCAGCAGG TACCCATGGT
TAA
 
Protein sequence
MASVTSYPLL EQIDSPERLR RLPESDLETL AEELRDFLLH SVARSGGHLA AGLGTIELTI 
ALHYIFATPE DRLVWDVGHQ AYPHKVLTGR RERLGTIRQA GGLAPFPSRH ESPYDTFGVG
HSSTSISAAL GMAIAANEKG EKRKTVAIIG DGGMTAGMAY EALDHAGALG ADLLVILNDN
EMSISPNVGA ISSYLTRLLS GRVYSTVREG SKKVLERMPP PMWELARRTE EHVKGMVAPG
TLFEEMGFNY FGPIDGHDLS SLIRTLRNLH KLTGPRLLHI VTCKGKGYTL AEENPVTYHG
VTPFDPKVGI QQGPQKPSSA MSYTQVFSQW LCDMAAQDGL LVGITPAMRE GSGLVKFSEC
FPERYFDVAI AEQHSVTLAA GMACDGLKPV VAIYSTFLQR AYDQLIHDVA LQNLPVLFAI
DRAGVVGPDG PTHAGSFDLT YLRCIPNLVV MAPADENECR QMLYTGFLLN QPAAVRYPRG
KGPGVAVEAS MTALPLGKAE LKRKGRGIAI LAFGATVAPA LEAAEKLDAT VVNMRFVKPL
DEDLVLEMAM NHELLVTVED NVIAGGAGSA VSECLAYHGV SVPLLLHGLP DNFLEHGSRE
ALLEQCHLNA EGILQRVKTY RARLPKSKAS VVSSAAGTHG