Gene Noc_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2100 
Symbol 
ID3704410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2413766 
End bp2414926 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content59% 
IMG OID637738575 
Product8-amino-7-oxononanoate synthase 
Protein accessionYP_344090 
Protein GI162139858 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCGATT TAAAAACGGC CCTTACGATG GAGCTGAGGC AGCGCCAGAC TCAGTCTCTT 
TACCGCTATC GGCGGGTATT GGAAGGTCCC CAGGGGGCGG AGCTCCAAAT GGATGGCCGC
CGAATTTTAG CCTTTTGCAG CAATGACTAT TTAGGTCTGG CGAATCACCC CGCAACCCGG
GCTGCCTTTA TGCAAGGAGT CCGAGAGTAC GGGGTTGGCA GTGGGGCGGC CCACCTAGTG
ACGGGCCATA GCCGTGCCCA TCATACGCTA GAGGAGGCTC TGGCGGCGTT TGTGGGACGG
CCCCGGGTGT TGCTGTTTTC TACCGGTTAT TCGGCCAATC TTGGGGTTAT CAGCGCCCTA
ATAGGGCGTC AAGACGCAGT TTTCGAGGAT CGCCTCAATC ATGCCTCCTT GCTAGATGGG
GGGCTGCTTG CGGGTGCCCG CTTTAAACGC TATCGGCACC GGGATTATCA GTCCCTCGAA
GCCGCTTTAA CTGCCACCAA GGCCCGCCGC AGATTGGTGG TGACGGATGG GGTTTTTAGC
ATGGATGGGG CGCTGGCTCC CTTGCCGGAC CTGGCTGCAG TTGCCGACCG TTTTGATGCT
TGGCTGATGG TGGATGATGC CCATGGTCTG GGCGTCTTGG GCGAAGAAGG GCGTGGCAGC
GTGGCCCATT TTGGGCTGGG AATGGCCCAG GCGCCTATTT TGGTGGGTAC CTTGGGTAAA
GCCTTGGGCA CCTTCGGGGC CTTTGTGGCC GGTGAGGAGG CCCTTATTGA AACCTTGATT
CAGCAAGCGC GGACCTACAT CTATACTACA GCCCCGCCCT CTGCGGTGGC CGTAGCGACC
CTGGCCAGTT TGCGGCTGGT TGAAACCGAA TCCTGGCGTC GGGATAAATT AACCCGCTTG
ATTGCCCAAT TTCGGCAAGG CGCCGCTCAG TTGGGGCTTC AGCTCGTGGA TTCCCCGACC
CCTATCCAGC CGTTGCTGGT GGGAGATGCT GGGGCTGCAG TTAAACTGAG CGAGCGCTTG
CTTGCGCAAG GGATACTGGT GACTGCCATC CGCCCGCCCA CGGTGCCAGA GGGAAGTGCC
CGCCTACGAA TTACTTTAAC GGCGGCTCAT TCCGAAGCCC AGGTAGCACG CTTGCTGGAG
TCGCTAGTTC AAGTTTTATG A
 
Protein sequence
MPDLKTALTM ELRQRQTQSL YRYRRVLEGP QGAELQMDGR RILAFCSNDY LGLANHPATR 
AAFMQGVREY GVGSGAAHLV TGHSRAHHTL EEALAAFVGR PRVLLFSTGY SANLGVISAL
IGRQDAVFED RLNHASLLDG GLLAGARFKR YRHRDYQSLE AALTATKARR RLVVTDGVFS
MDGALAPLPD LAAVADRFDA WLMVDDAHGL GVLGEEGRGS VAHFGLGMAQ APILVGTLGK
ALGTFGAFVA GEEALIETLI QQARTYIYTT APPSAVAVAT LASLRLVETE SWRRDKLTRL
IAQFRQGAAQ LGLQLVDSPT PIQPLLVGDA GAAVKLSERL LAQGILVTAI RPPTVPEGSA
RLRITLTAAH SEAQVARLLE SLVQVL