Gene Noc_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_3069 
Symbol 
ID3704485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3459792 
End bp3461954 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content49% 
IMG OID637739543 
ProductHAD family hydrolase 
Protein accessionYP_345040 
Protein GI77166515 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0438] Glycosyltransferase
[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID[TIGR01482] Sucrose-phosphate phosphatase subfamily
[TIGR01484] HAD-superfamily hydrolase, subfamily IIB
[TIGR01485] sucrose-6F-phosphate phosphohydrolase
[TIGR02468] sucrose phosphate synthase/possible sucrose phosphate phosphatase, plant
[TIGR02471] sucrose phosphate synthase, sucrose phosphatase-like domain, bacterial
[TIGR02472] sucrose-phosphate synthase, putative, glycosyltransferase domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGC CCGATGATGG CCTATATATT GTGCTTATCA GCCTCCATGG TCTTATTCGT 
GGCCATGAGC TGGAATTAGG CCGAGATGCT GATACTGGTG GCCAGACAAA ATACGCCATT
GAGCTGGCCC GAGCCCTTGC AGAAAATCCT CAAGTTGGCC GGGTTGATTT ATTAACGCGA
AAGGTTATCG ATCCTAAAGT AGGGCAAGAC TATAGCGAAC CGCTAGAATA TCTTGCCCCT
CGAGCTCAGA TTGTACGCTT ATCTTGTGGT CCGCGCCGCT ACCTGCGTAA AGAAGTATTA
TGGCCTTATC TAGGTAGTTT CGCCGATTAT GCGCTACAGC ATATTCGCCG CATCGGTCGC
CTCCCCGATA TCATTCACTC CCATTATGCG GATGCTGCTT ATGTAGGGGT AAGGCTGGCA
GGTTTACTAG GCGTACCGCT AGTGCATACG GGCCATTCCT TAGGCCGGGT AAAACGGCAC
CGGCTCCTTG AAGGGGGAAC CAAAGAGGAG AGTATTGAAA CTCGGTACAA TATGCGGCAA
CGGATTGAGG CTGAAGAACA AGTCCTTAGC ACAGCAGCTC TGGTGGTTGC AAGTACCCAG
CAGGAAGTGG ATGAACAGTA TGCCCTCTAT GATAACTATC ACCCAAAACG GATGGTAGTG
ATTCCTCCCG GCACCGATCT GGAGCGTTTT CACCCGCCCT CCCGTTTTTG GCGTAATGCA
CCGATTGAGC AGGAAATAAA TCGTTTTCTA TCCTACCCCC GTAAACCTCT TATCCTGGCT
TTATCCCGGC CAGATGCGCG GAAAAATATT TCTACTTTGA TCCGTGCCTA TGGAGAAAAC
CCGGCGCTAC GCCAAAAGGT TAATTTAGTA TTAATAGTGG GTAACCGGGA TGATATCGGT
ACCATGGAGA AAGGACCAAG GACGGTACTA AAAGAGATAT TATTATTAAT TGACCGCTAT
GATCTTTATG GCAGTATTGC CTATCCCAAG CATCATGAAG TTGATGATGT GCCAGATTTA
TACCGTTTAG CGGCGCGGTC GAAAGGCGTT TTTATTAATC CCGCACTTAC TGAACCCTTT
GGCCTCACAT TAATTGAGGC GGCAGCAAGT GGTTTACCTG TTATTGCTAC CCACGATGGG
GGCCCCCGGG AAATATTAGA GCACTGCAAG AACGGATGCC TCATTGATCC CCTGGATGCG
GATCGGATGG GCAAGGTACT GCTTGAGTCT CTTTCTGATC GCAACCGCTG GCACCGATGG
GCTAAAAACG GCCTTAAGGG TGCCCAGCAG TATTACTCCT GGCCAGGACA TGTCACTCAA
TATTTGCGCG AGGTGAGTAA AGTCATCCGG AAGGCGAAAA AACCCAGGCT CCAAGCGAAA
AAAAAGAGCC GCTTACCCAT TTCCGAAAAG GTTTTGGTCT GTGATATTGA CAATACCTTA
ACCGGAGATG GAGAGGGCCT TCGCAGCTTG TTTGAAAGCC TCAAGGAGGC AGGCGCTAAA
ATTGGTTTTG GTATTGCTAC CGGGCGGAAT TTTGCTAGTA CTCTCAAAGT GCTTAAGAAA
TGGGATATTC CTCTGCCTGA TCTTCTGATT ACGGGGGTAG GGTCTCAAAT TTTCTACGGG
CCAAATTTGG TGGAAGATCA AAGCTGGCAG CAACATATCC GTTACCGTTG GAAGCGGGAA
TCTATCCTCA AGGCGATGGC TGATATTCCT AATTTGCGTC TTCAGCCTTC CAGTGAACAG
TTACCTTGTA AGATCAGCTA TGATGTAGAT GTTAAAAAAG GGCTTGATAT CCCAGCCATT
GCCCGCCACC TGCGGCAGTT GGACTTGAGC GCCAATATTA TCTATTCCTA TCAGGCTTAT
CTGGATTTAT TGCCGGTGCG GGCTTCTAAG GGAAGCGCGG TTCGTTTTTT CTGCGACAAA
TGGGGTATTC CCCTAGAGCA CCTCCTTGTG GTGGGCGACT CCGGCAGTGA TAAAGAAATG
CTATCTGGGA ATACTTTAGG GGCGGTTGTG GGTAACTATA GCCCTGAACT TGAGTATCTG
CGTGAAGATT CCAGTATTTA TTTTGCTCAA GGCCATCATG CCTGGGGGAT TTTAGAGGCA
TTAGCACATT ATGGCTTTCT TGAGCAGGAG AAAGCGGTAG TGGCCAAGGA AGAGGCACTA
TGA
 
Protein sequence
MNQPDDGLYI VLISLHGLIR GHELELGRDA DTGGQTKYAI ELARALAENP QVGRVDLLTR 
KVIDPKVGQD YSEPLEYLAP RAQIVRLSCG PRRYLRKEVL WPYLGSFADY ALQHIRRIGR
LPDIIHSHYA DAAYVGVRLA GLLGVPLVHT GHSLGRVKRH RLLEGGTKEE SIETRYNMRQ
RIEAEEQVLS TAALVVASTQ QEVDEQYALY DNYHPKRMVV IPPGTDLERF HPPSRFWRNA
PIEQEINRFL SYPRKPLILA LSRPDARKNI STLIRAYGEN PALRQKVNLV LIVGNRDDIG
TMEKGPRTVL KEILLLIDRY DLYGSIAYPK HHEVDDVPDL YRLAARSKGV FINPALTEPF
GLTLIEAAAS GLPVIATHDG GPREILEHCK NGCLIDPLDA DRMGKVLLES LSDRNRWHRW
AKNGLKGAQQ YYSWPGHVTQ YLREVSKVIR KAKKPRLQAK KKSRLPISEK VLVCDIDNTL
TGDGEGLRSL FESLKEAGAK IGFGIATGRN FASTLKVLKK WDIPLPDLLI TGVGSQIFYG
PNLVEDQSWQ QHIRYRWKRE SILKAMADIP NLRLQPSSEQ LPCKISYDVD VKKGLDIPAI
ARHLRQLDLS ANIIYSYQAY LDLLPVRASK GSAVRFFCDK WGIPLEHLLV VGDSGSDKEM
LSGNTLGAVV GNYSPELEYL REDSSIYFAQ GHHAWGILEA LAHYGFLEQE KAVVAKEEAL