Gene Noc_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0740 
Symbol 
ID3707006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp802001 
End bp803323 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content46% 
IMG OID637737242 
Productsulfatase 
Protein accessionYP_342783 
Protein GI77164258 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCTGGC CTATTAATAG CTCATCTTTG GTTAGTGGAA GAGAGAAGCA GCCGCCTAAT 
GTTATCTTGA TTGTCGCTGA CGATATGGGT TATGGAGATG TGGGATGTTA TGGAAATCAG
CATATAAAAA CTCCAAATCT CGATGCTTTG GCAAAAAAGG GGGCGCGATT TACGGATTTT
CATAGTAACG GCCCGCTTTG CACGCCCACC CGGGCCGCGC TTTTGACCGG ATGTTATCAG
CAAAGAGTAG GTCTTCATAT TATCCCGAAG GATCAGCGCT ATGCCATGGC TAAAGCGATG
TCCCTGGAAG AAATTACTTT TGCAGAAGCG CTAAAATCGG TGGGTTATAG CACGGCACTG
GTAGGTAAAT GGCATTTGGG GGATCGTCCT GCTTTTTTGC CTCCTCGACA GGGTTTTGAT
GAGTATTTCG GGATTCCTTA CAGCCATGAT ATGCACCCCT GGCGTAAGTC GTTTCCGCCT
CTCCCCTTGA TGAGGGGTGA AGAGATTGTC GAGCTAAATC CTGATCTGGA TCACTTGACG
CAGTATTGTA CCGAGGAAGC CGTCAAATTT ATTAGCAAGA ATAAAGACCG CCCTTTCCTG
CTTTATATGC CTCATCCGAT GCCCCACCAG CCGGTGCATG TTTCCGAGAG ATTCGCAAAA
CGATTTTCCA AGGAACAACT AGCTGCTATT AAGGGAGAAG ATAAAAAATC CAGGAAATTT
CTTTACTCTG CCACTATTGA GGAAATTGAT TGGAGTGTCG GTGAGATTAT TAAGGCGGTG
AGAGCGTTAG GGATAGAAGA AAGTACTTTC GTTGCCTTTA CATCCGACAA TGGCCCAGCT
ATTGGTTCCG CGGGCCCATT GAGGGGGAAA AAAAGAGAAC TATGGGAGGG TGGGCATAGG
GTGCCTTTCA TTGCCTACTG GCAGGAAAAA ATCAGACCAG GTGTAGTGAT TGACGAAATC
GCAATGAGTA TGGATTTGTT TCCCACCATG GCGGCAATGG GGAGAGCGCC ATTGCCTAGA
AAAAAAATAG ATGGCGTTAA TTTACTGCCG TTGCTTTGTG AAGGCGATAA ACTTTCGGAA
AGGACGGTTT TTTGGCGCAG CAAGGGTAAA AAAGCAGCCC GTAAAGGGCC ATGGAAACTG
CTCATGCAGC CTACTAAGAA AAAAAGACCA ACAAGTATAG GTTTGTATCA TCTTAATAAC
GATCTTTCAG AACAACATAA TCTTGCTGAA ATTTACCCGG AGAAATTAAA AAGTTTACAG
CTTGAGTTTG CCGCTTGGGA AAAATATGTG GATGCTGGCA GGGCGCAGAA AGACGAATGG
TAA
 
Protein sequence
MPWPINSSSL VSGREKQPPN VILIVADDMG YGDVGCYGNQ HIKTPNLDAL AKKGARFTDF 
HSNGPLCTPT RAALLTGCYQ QRVGLHIIPK DQRYAMAKAM SLEEITFAEA LKSVGYSTAL
VGKWHLGDRP AFLPPRQGFD EYFGIPYSHD MHPWRKSFPP LPLMRGEEIV ELNPDLDHLT
QYCTEEAVKF ISKNKDRPFL LYMPHPMPHQ PVHVSERFAK RFSKEQLAAI KGEDKKSRKF
LYSATIEEID WSVGEIIKAV RALGIEESTF VAFTSDNGPA IGSAGPLRGK KRELWEGGHR
VPFIAYWQEK IRPGVVIDEI AMSMDLFPTM AAMGRAPLPR KKIDGVNLLP LLCEGDKLSE
RTVFWRSKGK KAARKGPWKL LMQPTKKKRP TSIGLYHLNN DLSEQHNLAE IYPEKLKSLQ
LEFAAWEKYV DAGRAQKDEW