Gene Anae109_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_3235 
Symbol 
ID5376542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp3789170 
End bp3791533 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content72% 
IMG OID640844757 
Productsulfatase 
Protein accessionYP_001380413 
Protein GI153006088 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTG AGAAGAACGG CTTTCCGAGG CACGTGGAGC GCGAGGTCTA CCCGAAGCCG 
GAGCACCGCT TCGCCGGCGC GAAGATCGGC CTCACGCACG AGGACTCGCG CCCCGACTAC
CCGGCGCCCG AGCAGGCGCC GCCGCACGCG GCGAACGTGG TGATCGTGCT GCTCGACGAC
GCCGGCTGGG CGGTGTCGAG CGCGTACGGG GGGCTCTGCC GGATGCCGAC GGCGGAGCGG
CTCGCTCGCG AAGGGCTGCA GTACTGCGCG TTCCACACGA CGGCGCTGTG CGCGCCGACG
CGCGCGGCGC TGCTCACCGG CCGGAACCAC CACTCCGCCG CGACGGGTGT CGTGGCCGAG
ATGGCGACCG GCTATCCCGG CTACTCGGGG ATGATCCCGC GGAGCTGCGC GATGATCTCG
GAGATCCTGT CGCAGAACGG CTGGGCGACG GGCTGGTGGG GCAAGAACCA CAACGTGCCC
GATGGTCACA CGAGCGCGGC CGGGCCGTTC GACCACTGGC CGAGCCGGCG TGGCTTCGAC
TACTTCTACG GGTTCGTCGG CGGCGAGACG GATCAGTTCT ATCCCGCGCT GTATCGCGAC
ACGACGCCCG TCGCGCCGCC GAGGACACCG GAGGAGGGGT ATCACCTCAC CACGGACCTC
GCGGACGACT GCATCGCCTG GATGCGCCGC CAGAAGGCGA TCGCGCCGGA GCGACCGCTG
TTCGTGCACT TCGCGCCCGG CGCCGTCCAC GGCCCGCACC AGCCGCCGCT CGCGTGGCGC
GGCCGCAACG CGGGGCGGTT CGACATGGGC TGGGACCGCT GCCGCGAGCT CGTCCACGCG
CGCCAGCTCG AGCTCGGCGT CATCCCCCCC GCAACGCGCC TGACGCCGCG CCCCGCGGAG
CTGCCGGCCT GGGACTCCTT CGGTCCGGAG GAGCGGCGGC TCTTCGCGCG CCAGATGGAG
AACTTCGCCG ACTTCCAGGA GCACACCGAC TTCGAGGTCG GCCGCCTCGT CGAGGCGCTC
GAGGCGCTCG GCGAGCTCGA GAACACGCTC TTCCTCTACA TCCTCGGCGA CAACGGCTCG
AGCGCGGAGG GGAGCCTCCA CGGCACGATC AACGAGACGG CGTCGATGAG CGGCGTCGAG
CCGCCGCTCG CGCAGACCCT CGCGCGCATC GACGAGATCG GGCTCCCCGG GACCTGGCCG
CACTACGCCG TGGGCTGGGC GTGGGCGGGC GACACTCCGT ACCAGTGGGT GAAGCAGGTC
GCCTCGCACT TCGGCGGGAC GCGCAACGGC CTCGTCGTGA GCTGGCCCGC GTGGATCGCG
GATCGCGGCG CGAAGCGGTT CCAGTTCCAC CACGTCGTGG ACGTGGTGCC GACCCTGCTC
GAGGTGGCCG GGATCGCGGA GCCGGCGATG GTCGACGGCG TGACGCAGAA GCCGATCGAG
GGCGTCAGCA TGGCCTACAC GTTCGACCGG CTGAACGCGG ACGCGCCCAC CCGCAAGGAG
ACGCAGTACT TCGAGATGCT CGGCAATCGC GGCATGTACC GCGACGGCTG GTTCGCGGCC
TGCCGCCACG GACGGCTCCC GTGGGAGACG AGCGGCAGCG CCGACTTCGC CGAGGACCGC
TGGGAGCTGT ACGACCTGCG CGACGACTTC AGCCAGGCCG AGGACCTCGC GGCGCGCCAT
CCGGAGAAGC TGCGCGAGCT GCGGGACCTG TTCCTCGCCG AGGCCGCGAA GCACGGGGTG
CTGCCGCTCG ACGACCGCTT CGTGGAGCGG TCCGATCCGT CGCTGCGCCC CGGGTTCTTC
ACCGGGCGGA CCCGGCTGGT GCTCGATCCC GGCCTCGTGC GGCTGCCGGA GGGCAGCGCG
CCGCGGACGG CGAACGTGGA CCACGTCCTC ACGGTCATGG CCGAGCTCCC GGAGGGCGGC
GCGGAAGGCG TCCTCGCCTG CATGGGCGGG GACTGCTCTG GGTGGACGCT GTTCGTCGAC
GGCGGCCGGC TCCGCTACCA CTACAACCGC TTCGACTACG ATCGGTACGA CGTCGTCTCC
GACGCGCCGC TCCCGGCCGG CCGCGTGGAG CTGCGCCTCG AGTTCCGGTG CGACGATCCG
CGCAAGCGGG GAGGCGGCGC GACGGTGCGG CTCCTCTGCG ACGGCCGCGT CGTCGGCGAG
GGGCGGGTCG AGAAGCAGGT GACCGGCAGG TTCGGGGAGT GCTTCGACGT GGGGCAGGAC
TCGCTCTCGC CGGTGTGGGG CGGCTACCGC GATCGCCTCC CGTTCCGGTT CACGGGGATC
ATCCAGCGCG TCCACCTCGA GCTCGGCGAG GCGGCGGAGC CGACCGCGGC GGAGCGGCTC
GAGGAGCAGA TCCGGTTCGA CTGA
 
Protein sequence
MAAEKNGFPR HVEREVYPKP EHRFAGAKIG LTHEDSRPDY PAPEQAPPHA ANVVIVLLDD 
AGWAVSSAYG GLCRMPTAER LAREGLQYCA FHTTALCAPT RAALLTGRNH HSAATGVVAE
MATGYPGYSG MIPRSCAMIS EILSQNGWAT GWWGKNHNVP DGHTSAAGPF DHWPSRRGFD
YFYGFVGGET DQFYPALYRD TTPVAPPRTP EEGYHLTTDL ADDCIAWMRR QKAIAPERPL
FVHFAPGAVH GPHQPPLAWR GRNAGRFDMG WDRCRELVHA RQLELGVIPP ATRLTPRPAE
LPAWDSFGPE ERRLFARQME NFADFQEHTD FEVGRLVEAL EALGELENTL FLYILGDNGS
SAEGSLHGTI NETASMSGVE PPLAQTLARI DEIGLPGTWP HYAVGWAWAG DTPYQWVKQV
ASHFGGTRNG LVVSWPAWIA DRGAKRFQFH HVVDVVPTLL EVAGIAEPAM VDGVTQKPIE
GVSMAYTFDR LNADAPTRKE TQYFEMLGNR GMYRDGWFAA CRHGRLPWET SGSADFAEDR
WELYDLRDDF SQAEDLAARH PEKLRELRDL FLAEAAKHGV LPLDDRFVER SDPSLRPGFF
TGRTRLVLDP GLVRLPEGSA PRTANVDHVL TVMAELPEGG AEGVLACMGG DCSGWTLFVD
GGRLRYHYNR FDYDRYDVVS DAPLPAGRVE LRLEFRCDDP RKRGGGATVR LLCDGRVVGE
GRVEKQVTGR FGECFDVGQD SLSPVWGGYR DRLPFRFTGI IQRVHLELGE AAEPTAAERL
EEQIRFD