Gene Noc_1015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1015 
Symbol 
ID3707276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1123160 
End bp1124926 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content57% 
IMG OID637737520 
Productsulfatase 
Protein accessionYP_343053 
Protein GI77164528 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGAC GAATGATCGC TTTGCTCGCC ATGCCGCTGC TGCTGGTTTT ATTCTCCCTC 
GGCAGAGCCT GGCTCAGTGA GCAGGGCAGC CTAGCTGCTA TGGGATGCAC TGGATGCCTT
GTGGTTCCAA CGGTGCAGCA AGATCTAGCC CTACTCGCCG CATTCCTCGC GGCAACGGCC
TTGTGGCTGG GATTGCCCCG CTATGTCGGA TGGCTTGCTT GGCTAATACA GGGCGGGTTA
GTGCTTATTA TCGTTGCCGA TCTAATCACC CTGGCGGAAT TTGGCATGCG CTTGAGTTGG
CGCGACGTGT TGAAATTCGG CGGGGAGTGG GGAGCAATAC AGGGATATGT AGGGACGAAA
GCAACAGCCA TCACTGGAAC CCTATGGCTT ATTGGTGTCG GCATTTTACT GAGCAGCTGG
GCGGGTCATC TGGTGGCGGC AAAACGCCTC GGTCAGGGTG GCCTGCAGAG AATCATCATG
GCCGCCGCCG TCTGTTTCTC CCTCTACGCT TTACCCACAA CGGCCTACCA TCCCCTGCCC
TGGCTTTATC GCAACGTGGT GGAAATCAAT CTGCCGAGCG GCGTTGATCG AGCTTATAGC
GAGGCATACC GGTCACGGGT GCTGGCCGCC TATTCCCCAC CACCCTTGCA CTGCATCCAG
GGCCGCGCCT TGAGGCGCAA TGTGGTGATA GTGATCATCG AGTCCTGGTC CTGGTACCAT
AGCCAGGACC GACTGGGGAT CATGAACGCC ACTCCCCAGC TTGATCGCCT TGCCCAACGA
GGAACCCTGT GGACGCAGTT TTTTTCCAAT GGCTTTACCA CCGATCATGG CTTGATCGCC
TTGCTGGGCG GCGTGGCTCC CCTACCCCCT GTTAACCGTT ACCACAGCCT TGAGGGCTAT
ACTGGCTTTG AAGAACTGCC GGACAGCCTG CCCCGGCGAC TCGCCGCGGA TGGCTACGAG
AGCTATTTTT TCACCACCGG CGACCTGGAA TTCATGGACA AGGGGAAATG GCTAAAGCGT
CTGGGATTCC ACAAGGTGGA AGGAGATGAC CATCCATTTT ACCAGGATGC CCAACGATTT
GCCTTCCATG CGGCCCATGA TGGGTGGTTG TATGATCGTT TTTTGCATTG GCTGGAGCAG
GAAGTGCCCC CCAAACGTCC TTATTTGGCG GTGCTGGAAA CCGTCACCAC CCATCCCCCC
TTTGTCGATC CCGAAACGGG CCGCCAAGAT GAGCTGACGG CATTTCGCTT CGCTGACGCT
CAGGCCGCTC GCTTTGTCGA ACACCTGGAT AAGCAGGGGT TTTTTGAGGA GGGCTTGCTG
ATTTTGACCA GCGATCAGCG GGCGTTAAGT CCCTTGCACA CGGCGGAGAT AAAAGCCTTC
GGACCCGCCG CGCCCGCGCT GTTGCCTTTG GTGGTATTAG GGGATTCTTT CGATAGCGGA
AAACAAGTCA CCACCGCCGC GCAAATGGCG GATATGCCAG CGTCTTTAGA TTATCTGTTG
ACTGATCGCG GTTGCCAAGA GGAAGGACGG GGCAACCTCT TTGCCCAACC ACCCCAGTCC
CCGCGCTGCA TCCTCCGTCC CCAAGGTAAT CAAAGGGATA TCGTGGATGC CTATTGTGGC
GACCAACACG CCCAAATCCA GCTTGAGGGG GATAAAACCC GTATACTTCG AGGCATCCTT
CCTCATGGGA AGACGCTCAT TGAGCAGATT AATGTCCAGC GGATTCGCGC AGGAGCGCGG
AAGGTTGAGT TCACCCATAT CCTATAA
 
Protein sequence
MRRRMIALLA MPLLLVLFSL GRAWLSEQGS LAAMGCTGCL VVPTVQQDLA LLAAFLAATA 
LWLGLPRYVG WLAWLIQGGL VLIIVADLIT LAEFGMRLSW RDVLKFGGEW GAIQGYVGTK
ATAITGTLWL IGVGILLSSW AGHLVAAKRL GQGGLQRIIM AAAVCFSLYA LPTTAYHPLP
WLYRNVVEIN LPSGVDRAYS EAYRSRVLAA YSPPPLHCIQ GRALRRNVVI VIIESWSWYH
SQDRLGIMNA TPQLDRLAQR GTLWTQFFSN GFTTDHGLIA LLGGVAPLPP VNRYHSLEGY
TGFEELPDSL PRRLAADGYE SYFFTTGDLE FMDKGKWLKR LGFHKVEGDD HPFYQDAQRF
AFHAAHDGWL YDRFLHWLEQ EVPPKRPYLA VLETVTTHPP FVDPETGRQD ELTAFRFADA
QAARFVEHLD KQGFFEEGLL ILTSDQRALS PLHTAEIKAF GPAAPALLPL VVLGDSFDSG
KQVTTAAQMA DMPASLDYLL TDRGCQEEGR GNLFAQPPQS PRCILRPQGN QRDIVDAYCG
DQHAQIQLEG DKTRILRGIL PHGKTLIEQI NVQRIRAGAR KVEFTHIL