Gene Noc_A0008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_A0008 
Symbol 
ID3704296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007483 
Strand
Start bp5468 
End bp6664 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content52% 
IMG OID637736503 
Productendonuclease/exonuclease/phosphatase 
Protein accessionYP_342051 
Protein GI77163525 
COG category[R] General function prediction only 
COG ID[COG3568] Metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.203843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAGC GGTTATTAAT CTTATTTTTT CTTTTTCTAC CCACTATCTC AGCGCAAGCG 
TTCACCATTG CAAGCTGGAA TACCAAACAT TTAGGGTGGG GAGACAAGAG GAACTGGAAC
GCAACCGCGG CGGTCGTAGC ACCCTACGAT TTTGTCGCCC TCCAGGAGGT AATGAGTAAA
ACGGCCGTCA ACCATCTGGT CCAGACGCTG AAGAAGCAGA CCGGAGTAAA ATGGTCCTCA
CTGGTTTCGG GAACCAGCGT AGGACGCTCG AAACGTTACC AGGAATTCTA TGCTTTCATC
TGGCGTGAAG AGGCTGTCGA TTATGTGGGC GGCGCTGTGG TCTATTTAGA CCCAGGCGAT
ATCTTTGCGC GGGAGCCTTT TGCAGCACGG TTTCAGACAG ATAATGGAAA GTATCGTTGG
ACTGCGGCCA CCGTACATGT GGTCTATGGA GATAGCCGGG ATGAGCGGCG CCGAGAAGCA
CAGCAGCTTG ATGAGTATGT AAACTGGCTA GAGGAAAACG TCGCTGAAGG AGATCCGGTG
GTTCTGATGG GCGACTTCAA CCTACCCCCG GATTCAGCGG GATTCCGGGA TCTGGCTAAA
GTACTTAAAC CCGCTATCCG GGAAGGGGCA ACGACTCTGT CCGCCAAAGA GGGCCGGTAC
GCCAATCTCT ACGATAATAT CTGGTACCGA CCGGATGCCT TGAAAATCCA GGAAGCCCGG
ATCGATCGTT TCCCTCAGCG TTTGGGAATT ACTCACAAGC TAGCTCGAAA AACCGTCAGT
GACCATGCTC CCGTGGTGAT TGTGCTTGGT GATCCGGTAT CCCCATCTCC AAAAGGAAAA
TTGAATGGCG CACAGACAAC CTCTTCAGCC GAGCGAAAGG CAACATTAGC AATTATTTGC
GTGCATCCCG ATGCGCCTGG AAACGATAAC AAAAATCTGG CCGGTGAATG GGTGGAGATA
CAGAATTCTG GCGCTCAGCA TCTGGATTTA ACCGGCTGGA TACTGGCGGA TGAAGCGGAC
CATAAGATTG CCTTACAAGG CAGCCTTAAT GCTGGCGGTA CCCTTCGGAT AGACTCCACC
GCAATAGGAC GCCCTATATG GAATAATTCG GGGGATACGG CGATTTTGCG TGATCCAGAG
GGGACTGTGG TCTCAACGCT GCGCTACCCC GGCGGGAGAA TTTGCGAAGA TCGCTAA
 
Protein sequence
MGKRLLILFF LFLPTISAQA FTIASWNTKH LGWGDKRNWN ATAAVVAPYD FVALQEVMSK 
TAVNHLVQTL KKQTGVKWSS LVSGTSVGRS KRYQEFYAFI WREEAVDYVG GAVVYLDPGD
IFAREPFAAR FQTDNGKYRW TAATVHVVYG DSRDERRREA QQLDEYVNWL EENVAEGDPV
VLMGDFNLPP DSAGFRDLAK VLKPAIREGA TTLSAKEGRY ANLYDNIWYR PDALKIQEAR
IDRFPQRLGI THKLARKTVS DHAPVVIVLG DPVSPSPKGK LNGAQTTSSA ERKATLAIIC
VHPDAPGNDN KNLAGEWVEI QNSGAQHLDL TGWILADEAD HKIALQGSLN AGGTLRIDST
AIGRPIWNNS GDTAILRDPE GTVVSTLRYP GGRICEDR