Gene Oant_3323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_3323 
Symbol 
ID5382282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009668 
Strand
Start bp656821 
End bp658329 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content59% 
IMG OID640836005 
Productsulfatase 
Protein accessionYP_001371858 
Protein GI153010644 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR03417] choline-sulfatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC CGAATATCCT CATCCTCATG GTCGATCAGC TGAACGGCAC GTTCTTTCCG 
GACGGCCCGG CCGAATTCCT GCATGTTCCC AATCTGAGAA AGCTTGCCGA ACATTCCGCC
CGTTTTGCCA ACTGCTACAC CGCGAGCCCC CTCTGTGCCC CGGCGCGTGC ATCGTTCATG
TCGGGCCAGT TGCCGTTTCG AACTGGCGTC TATGACAATG CTGCGGAGTT TTCGTCTGAA
ATCCCGACCT ATGCGCATCA TCTCCGCAAC GCTGGCTACC AGACGGCGCT TTCCGGCAAG
ATGCATTTCG TCGGTCCGGA TCAGCTGCAC GGTTTCGAGC AGCGGCTCAC CACCGACATC
TACCCGGCGG ATTTCGGCTG GACGCCAGAT TATCGCAAGC CCGGTGAACG TATCGACTGG
TGGTATCACA ATCTCGGTTC GATCACCGGA GCGGGTTCCG CCGAAATCAC CAACCAGCTC
GAATATGACG ATGAAGTTGC CTATCAGGCC GAAGCAAAGC TCTATGATCT GGCGCGCGGC
CACGACGAGC GGCCATGGTG TCTGACGGTC AGTTTCACAC ATCCGCATGA CCCTTACGTA
GCGCGCAAGC ACTTCTTCGA TCTCTATGCG GGCATTCCCG AACTTGATCC GAAAATCGGA
CCGCTGGCTG AAAGCGAGAT CGACCCGCAC ACCGAACGCC TGTTACGGGC CTGCAAGGCC
GAGGACTACG ATCTGACCCG CGAACAGGTT CGCACGGCAC GTCAGGCCTA TTTCGCCAAT
ATCTCCTATG TGGATGACAA GATCGGCGTC CTGCTCGATG TCGTCGAACG CTGCGGCATG
GCGGATGACA CGATTGTCGT CTTCACTTCC GATCACGGCG ATATGCTTGG CGAACGCGGC
CTGTGGTTCA AGATGAGCTT CTTCGATGGC TCCGCTCGCG TCCCGTTGAT GATTGCGGCC
CCGCAATTGC GGGCTGGCCG CGTGGATGCG CCCGTCTCGA CGCTCGACGT TCTGCCGACA
CTGGCCGACC TTGCCGGGAT CGACCTCAAG GGCATCATGC CGTGGGCCGA CGGCGTTTCA
CTGACAGATG TTGCCTCCGG TACAGCCGAA CGCGGTGCGG TTCCGGTCGA ATATGCAGCC
GAAGGCACCA TTGCGCCAAT GATCTCGCTG CGGGATGGCG ACTGGAAGCT CAACCTGTGC
CGCGCCGACC CGCCACAATT GCTCAACCTT GCCGACGATC CCGACGAGCT GCGCAATCTG
GCGGAACTGC CGGAATTCAA AACCGTTCTG GACGACCTTC TGGCCAAAGC TGAAAAACGC
TGGAATCTGG AACGCTACGA TGCCGACGTC CGGGCCAGTC AGGCTCGTCG CCACGTCGTC
TATCCGGCGT TGCGCAACGG CGCCTATTAT CCATGGGACT ACCAGCCGCT GCAAAAGGCG
TCCGAGCGCT ATATGCGCAA CCACATGGAC CTGAACGTAC TGGAAGAAAA CCAGCGCTTT
CCGCGCTGA
 
Protein sequence
MKKPNILILM VDQLNGTFFP DGPAEFLHVP NLRKLAEHSA RFANCYTASP LCAPARASFM 
SGQLPFRTGV YDNAAEFSSE IPTYAHHLRN AGYQTALSGK MHFVGPDQLH GFEQRLTTDI
YPADFGWTPD YRKPGERIDW WYHNLGSITG AGSAEITNQL EYDDEVAYQA EAKLYDLARG
HDERPWCLTV SFTHPHDPYV ARKHFFDLYA GIPELDPKIG PLAESEIDPH TERLLRACKA
EDYDLTREQV RTARQAYFAN ISYVDDKIGV LLDVVERCGM ADDTIVVFTS DHGDMLGERG
LWFKMSFFDG SARVPLMIAA PQLRAGRVDA PVSTLDVLPT LADLAGIDLK GIMPWADGVS
LTDVASGTAE RGAVPVEYAA EGTIAPMISL RDGDWKLNLC RADPPQLLNL ADDPDELRNL
AELPEFKTVL DDLLAKAEKR WNLERYDADV RASQARRHVV YPALRNGAYY PWDYQPLQKA
SERYMRNHMD LNVLEENQRF PR