Gene Oant_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_1836 
Symbol 
ID5380335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009667 
Strand
Start bp1931532 
End bp1933076 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content57% 
IMG OID640834498 
Productsulfatase 
Protein accessionYP_001370381 
Protein GI153009166 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGAA AAAATGTCCT GCTTATCGTC GTTGATCAAT GGCGAGCAGA TTTTATCCCC 
CACCTGATGC GGGCGGAGGG TCGCGAACCC TTCCTCAAAA CTCCCAATCT TGATCGCTTG
TGCCGGGAAG GCTTGACCTT CCGCAACCAC GTCACAACCT GTGTGCCGTG TGGACCAGCA
AGGGCAAGCT TGCTTACTGG CCTTTACCTG ATGAACCATC GGGCGGTACA GAACACTGTT
CCGCTTGATC AGCGCCATTT GAACCTCGGC AAAGCCCTCC GCGCCATCGG CTATGATCCC
GCGCTCATTG GTTACACGAC GACGACGCCT GACCCGCGTT CGACGTCCCC AAGAGATCCG
CGTTTCACGG TTCTTGGCGA TATCATGGAC GGGTTTCGCT CAGTGGGCGC ATTCGAACCC
AATATGGACG GATATTTCGG CTGGGTGGCG CAGAACGGTT TTGAACTGCC GGAGAACCGG
GAAGATATCT GGCTGCCAGA AGGGGAGTAT TCCGTTCCCG GTGCTACCGA CAAGCCGTCG
CGTATCCCGA AGGAGTTCTC GGATTCCACA TTCTTCACGG AACGCGCGCT GACCTACCTT
AAGGGCAGGG ATGGCAAGCC ATTCTTTCTG CATCTAGGTT ACTACCGCCC GCACCCGCCA
TTCGTCGCCT CCGCGCCTTA TCATGCGATG TACAAGGCCG AAGATATGCC TGCGCCGGTT
CGCGCGGAAA GTCCGGATGC CGAAGCGGCA CAGCATCCGC TTATGAAGCA CTATATAGAT
CATATCAGGC GTGGTTCGTT TTTCCATGGG GCGGAAGGCT CCGGCGCAAC GCTGGACGAA
GGCGAGATTC GCCAGATGCG CGCCACCTAT TGCGGCCTGA TTACGGAAAT CGACGATTGT
CTGGGGCGGG TCTTCGCTTA CCTTGATGAA ACTGGTCAGT GGGACGACAC ACTAATCATC
TTCACCAGCG ACCATGGTGA GCAGCTCGGT GATCATCATC TGCTCGGCAA GATCGGCTAC
AACGACGAAA GTTTCCGTAT TCCTTTGGTT ATAAAGGATG CGGGGGAGAA CCGGCACGCT
GGCCAGATCG AAGATGGGTT TTCCGAAAGC ATCGATGTCA TGCCCACCAT CCTCGAATGG
CTCGGCGGGG AAACGCCACG CGCTTGCGAC GGACGTTCGC TGTTGCCATT TCTGGGTGAG
GGAAAACCCG CCGACTGGCG CACAGAATTG CATTACGAAT TCGACTTCCG CGACGTCTTC
TACGATCAGC CGCAGAACTC GGTACAGCTC TCCCAGGATG ATTGCAGCCT CTGTGTGATC
GAGGACGAGA ACTACAAGTA CGTGCATTTT GCGGCCCTGC CGCCGCTGTT CTTCGATTTG
AAGGCGGACC CGCACGAATT CAACAATCTG GCTGAAGACC CCGCTTATGC GGCTCTCGTT
CGCGACTACG CCCAGAAGGC TTTGTCGTGG CGACTGTCTC ATGCCGACCG GACACTGACC
CATTACAGAT CCGGCCCGCA AGGGCTCACA ACGCGCAACC ATTGA
 
Protein sequence
MTRKNVLLIV VDQWRADFIP HLMRAEGREP FLKTPNLDRL CREGLTFRNH VTTCVPCGPA 
RASLLTGLYL MNHRAVQNTV PLDQRHLNLG KALRAIGYDP ALIGYTTTTP DPRSTSPRDP
RFTVLGDIMD GFRSVGAFEP NMDGYFGWVA QNGFELPENR EDIWLPEGEY SVPGATDKPS
RIPKEFSDST FFTERALTYL KGRDGKPFFL HLGYYRPHPP FVASAPYHAM YKAEDMPAPV
RAESPDAEAA QHPLMKHYID HIRRGSFFHG AEGSGATLDE GEIRQMRATY CGLITEIDDC
LGRVFAYLDE TGQWDDTLII FTSDHGEQLG DHHLLGKIGY NDESFRIPLV IKDAGENRHA
GQIEDGFSES IDVMPTILEW LGGETPRACD GRSLLPFLGE GKPADWRTEL HYEFDFRDVF
YDQPQNSVQL SQDDCSLCVI EDENYKYVHF AALPPLFFDL KADPHEFNNL AEDPAYAALV
RDYAQKALSW RLSHADRTLT HYRSGPQGLT TRNH