Gene Noc_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2075 
Symbol 
ID3705246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2383915 
End bp2384685 
Gene Length771 bp 
Protein Length256 aa 
Translation table11 
GC content52% 
IMG OID637738550 
ProductHAD family hydrolase 
Protein accessionYP_344065 
Protein GI77165540 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAT CACAACCGGC TTCCCTGCCT CCTTGCGAAT CACAATCTGT TTCCAATTTA 
GCCCTTATTA CCCTCGATCT AGATGAAACC GTCTGGCCTA GCAAAGCTGT ATTGAGAAAA
GCTGAAGAAA CCCAATTTAA GTGGCTTCAA CAGCAAGCCC CATACCTGAC AGCCAAGCAC
GATCTGGAGA GCCTGCGGAG CCATAGACGA TTTATTCGAG AACGGTATAC CGAGATTGCG
TACGATCTGA CAGCCGTGCG CACCGCTTCC CTGCGCTTGC TGCTCGAGGA ATTTGGCTAT
TCACCAGGCT TAGCGGAAGA GGCCATTGCT ATTTTTCTCG AAGCCAGGAA CTGGGTAACC
CCCTACACGG ATGTCCCGCC CGTCCTTGAA AAACTAGCCC GTACTTACCG CCTTGCCTCG
CTCACGAATG GCAATGCCGA TGTTCAATAC ACACCGTTAA AAGCTCATTT CCATTTTTCC
CTGACCCCTG CTATAGCGGG GGCCGCCAAA CCCGCGCCGG ACATGTTTTA TCGAGCGTTG
GAACAGGCAG GTGCTGAGCC CCATCAGGCC GTCCATGTAG GCGATCATCC AGAATGCGAC
ATTATTGCCG CCCAGCAAGT AGGCATGCGC GCAGTCTGGA TTAACCGGCT AGAAACCCCC
TGGCCAGCGG ATTTGCCACC CCCAGAGGCC ACCATCAAAA ACTTTCACGA ATTTGAACAG
TGGCTTTTAC AGGAAACTAA AACCCAGAAG CCATCCGCAA ACTTGTTTTA A
 
Protein sequence
MAISQPASLP PCESQSVSNL ALITLDLDET VWPSKAVLRK AEETQFKWLQ QQAPYLTAKH 
DLESLRSHRR FIRERYTEIA YDLTAVRTAS LRLLLEEFGY SPGLAEEAIA IFLEARNWVT
PYTDVPPVLE KLARTYRLAS LTNGNADVQY TPLKAHFHFS LTPAIAGAAK PAPDMFYRAL
EQAGAEPHQA VHVGDHPECD IIAAQQVGMR AVWINRLETP WPADLPPPEA TIKNFHEFEQ
WLLQETKTQK PSANLF