Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2718 |
Symbol | |
ID | 3704744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3084992 |
End bp | 3086107 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637739200 |
Product | HAD family hydrolase |
Protein accession | YP_344701 |
Protein GI | 77166176 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily [COG3100] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01484] HAD-superfamily hydrolase, subfamily IIB [TIGR01485] sucrose-6F-phosphate phosphohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAACAAA AAATTTTGCT TTGTTCTGAT CTGGATCGGA CCCTGCTTCC CAACGGCCAT CAAGCGGAGT CGCCCCAGGC GCGGCTTCGG CTGCAACGAT TGGCCCAGCG TCCTGGCATT ATTTTGGCCT ATGTCAGCGG TCGTCATAAG GCGCTCATTC AAAGCGCTAT TCGAGAATAT GATCTTCCTC TTCCTGACTT TGCCATTGGT GATGTGGGGA CGACTATCTA TCAAATTACG GATAATCAAT GGCATCCTTG GGAAGACTGG AGTAAAGAAA TCAGCCAGGA TTGGCAAGGA ATAAACCAGG CTGGGCTGGC AAAACTTTTT GCTGACATTA CTCCACTGCG GTTGCAAGAG CCGGAAAAGC AAAACCGCTA TAAGCTCAGC TATTATGCTC CACCGGAGCT AGATTGGGAG AATCTGATTC CCCAATTGGC CCAGCGGTTA CAAGCGCAGG GAATCCAGGC TTCTTTCATT TGGAGTGTAG ATGAAACTGC CCAGATCGGT TTGCTCGATA TCCTGCCAAA GCGGGCCAAC AAACTCCATG CCATTCGCTT TTTAATGGAG CGTCAGCATT TTGATAAAAG CCATACCGTT TTTGCGGGGG ATAGTGGCAA CGACCTGGAG GTGTTAGCTA GCGGTCTCCA AGCTATTCTG GTCCGCAATG CCCAGGAAGA AGTGCGCCAG GAAGCTCTCC GCCGCCTCCC GCCAGAGCAT AGTCAGCAGC TTTATCTCGC CCGGGGTGGC TTTATGGGGC TTAACGGTTA TTACAGCGCG GGAGTGCTAG AGGGCCTAGC CCATTTTTTT CCTGAAACCC GGGCATGGAT GGAAACAGGG AGAGAAGAGT CAGCGGAGGA AGAAACGGCA CAATCCTGCG CCATTTATCG AAGCTGTAAA AGAAATGATA GCTACTTATA TGTGGAATCT CAGGATGATT TTTCCCGCGT TCCTGGAAAA TTGTTGGAAA TGCTTGGAAA GCTAGAGTTT GTCATGAGAC TGGAGCTGCG TCCCGAGATC TCCCTGGCCC AGGCCAATAC CAGGGAAGTG ATGCAAATGC TCAGGGAGAA AGGCTATTTT TTGCAGTTAT CATCAAGGGA ATACAGGCGG TCTTAA
|
Protein sequence | MKQKILLCSD LDRTLLPNGH QAESPQARLR LQRLAQRPGI ILAYVSGRHK ALIQSAIREY DLPLPDFAIG DVGTTIYQIT DNQWHPWEDW SKEISQDWQG INQAGLAKLF ADITPLRLQE PEKQNRYKLS YYAPPELDWE NLIPQLAQRL QAQGIQASFI WSVDETAQIG LLDILPKRAN KLHAIRFLME RQHFDKSHTV FAGDSGNDLE VLASGLQAIL VRNAQEEVRQ EALRRLPPEH SQQLYLARGG FMGLNGYYSA GVLEGLAHFF PETRAWMETG REESAEEETA QSCAIYRSCK RNDSYLYVES QDDFSRVPGK LLEMLGKLEF VMRLELRPEI SLAQANTREV MQMLREKGYF LQLSSREYRR S
|
| |