Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2075 |
Symbol | |
ID | 3705246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2383915 |
End bp | 2384685 |
Gene Length | 771 bp |
Protein Length | 256 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637738550 |
Product | HAD family hydrolase |
Protein accession | YP_344065 |
Protein GI | 77165540 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATAT CACAACCGGC TTCCCTGCCT CCTTGCGAAT CACAATCTGT TTCCAATTTA GCCCTTATTA CCCTCGATCT AGATGAAACC GTCTGGCCTA GCAAAGCTGT ATTGAGAAAA GCTGAAGAAA CCCAATTTAA GTGGCTTCAA CAGCAAGCCC CATACCTGAC AGCCAAGCAC GATCTGGAGA GCCTGCGGAG CCATAGACGA TTTATTCGAG AACGGTATAC CGAGATTGCG TACGATCTGA CAGCCGTGCG CACCGCTTCC CTGCGCTTGC TGCTCGAGGA ATTTGGCTAT TCACCAGGCT TAGCGGAAGA GGCCATTGCT ATTTTTCTCG AAGCCAGGAA CTGGGTAACC CCCTACACGG ATGTCCCGCC CGTCCTTGAA AAACTAGCCC GTACTTACCG CCTTGCCTCG CTCACGAATG GCAATGCCGA TGTTCAATAC ACACCGTTAA AAGCTCATTT CCATTTTTCC CTGACCCCTG CTATAGCGGG GGCCGCCAAA CCCGCGCCGG ACATGTTTTA TCGAGCGTTG GAACAGGCAG GTGCTGAGCC CCATCAGGCC GTCCATGTAG GCGATCATCC AGAATGCGAC ATTATTGCCG CCCAGCAAGT AGGCATGCGC GCAGTCTGGA TTAACCGGCT AGAAACCCCC TGGCCAGCGG ATTTGCCACC CCCAGAGGCC ACCATCAAAA ACTTTCACGA ATTTGAACAG TGGCTTTTAC AGGAAACTAA AACCCAGAAG CCATCCGCAA ACTTGTTTTA A
|
Protein sequence | MAISQPASLP PCESQSVSNL ALITLDLDET VWPSKAVLRK AEETQFKWLQ QQAPYLTAKH DLESLRSHRR FIRERYTEIA YDLTAVRTAS LRLLLEEFGY SPGLAEEAIA IFLEARNWVT PYTDVPPVLE KLARTYRLAS LTNGNADVQY TPLKAHFHFS LTPAIAGAAK PAPDMFYRAL EQAGAEPHQA VHVGDHPECD IIAAQQVGMR AVWINRLETP WPADLPPPEA TIKNFHEFEQ WLLQETKTQK PSANLF
|
| |