Gene BBta_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3535 
Symbol 
ID5151559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3689431 
End bp3691710 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content68% 
IMG OID640558388 
Productputative arylsulfatase 
Protein accessionYP_001239534 
Protein GI148254949 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.441448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG CAGCATTCGC CGGCGCCATC GGCAGAACCG TCGCGAGCTC CAAACCCTGG 
TGGCCCGCGC CGCCGCGGCC GCCGGCGGGC GCGCCGAACA TCCTTGTGGT GCTGTTCGAC
GATGTCGGAT TCTCCGATTT CGGCTGCTAC GGCTCGCCGA TCCGCACCCC TGCGATCGAC
GCGCTGGCCG CGCAGGGGCT GCGTTATTCC GGCTTCCACA CCACGGCGAT GTGCTCAACG
ACGCGGGCGG CGCTCCTGAC AGGCCGCAAT CATCATTCGG TCGGCGTCGG CTGTCTCGCC
AATTTCGACT CTGGGTATCC TGGTTATCGC GGCAAGATCG CGCGCGAGGC GGGGACCATC
GCCGAGATGC TGCGTGCGCA TGCCTATCGC AACTACATGG TCGGCAAGTG GCATGTGACG
CCGTTGACCG AGAGCGGCGC GACCGGGCCG TTCGACGGCT GGCCGCTCGG CCGTGGCTTC
GACCGCTTCT ACGGCTTCCT CGATGCCGAG ACCGATCAAT ATGCGCCGGA GCTCGTGCTC
GACAACACGC ATATCTCGCC GCCCGGCAGC TTCGCCGACG GCTATCATCT GACCGCCGAC
CTCATCGACC AGGCCATCCG CTTCATCGCC GACCACACGG CCGATCGGCC TGACCTGCCG
TGGCTGACCT GGCTCGCCCT CGGCGCCTGC CACGCGCCGC ACCAGGCGCC GCGCGACATC
ATCGAGAGCT ATGATGCGGT CTTCGCGCAT GGCTGGGACG TCGAGCGCGC GCAACGCCTG
GCGCGGCAGA AGGCGATGGG GCTGGTGCCG GAGACGACGG ACCTGCCGCC GCGCAATGAC
GGCGTGAAGG CCTGGGAGAC GCATTCGGAC GAGGAGCGGC GCGTCTTCAC CCGGCTGCAA
TCGGCCTTCG CCGGCATGCT CGATCATGCC GACCAGCATC TGGCGCGGTT GCTGGCGTTC
CTCGACAGGA CCGGCCAACG TGACAACACG CTGGTGATCG TGATGTCCGA CAATGGCGCC
AGCCAGGAGG GCGGACCGCT CGGCTTCGTC AACGCCATGG GACCGTTCAA CTTCAAGCCG
GAGCCGATCG CCGAGAAACT CGCCCGCATC GACGACATCG GCGGCCCCGA CACCCATTCG
AATTTCCCGC ATGGCTGGGC GATGGCGTCC AACACGCCGC TCCGGCGCTA CAAGCAGAAC
ACCCATGGCG GCGGCATCCG CGATCCCTTC ATCCTGTCAT GGCCGCAACG CATCGCCGCG
CAGGGCGAAT TGCGCCACCA ATTCGTCCAT GCCAGCGACC TGGTGCCGAC CCTGCTCGAC
CTGATCGGCA TCGCGGCGCC GGCCACGATC GCAGGCGTGC CGCAGATGCC GCTGGAGGGC
GTCAGCTTCG CGGCGTCGAT CGCGGATGCG ACGGCGCCAT CGAAGCCGGT GCCGCAATAT
TTCGAGATGT TCGGGCATCG CGGCCTCTGG CACGACGGCT GGAAGGCGGT CGCCTTTCAT
CCCCCGGGCA CGCCGTTCGA CAACGACAAA TGGGAGCTGT TTCATCTCGC CGAGGATTTC
TCCGAGACGC ATGATCTCGC CGCCGCCGAG CCGGAGCGGC TCGCGGCGCT GGTCAAGCTG
TGGTGGGAGC AGGCGGAGGC GCATCAGGTG CTGCCGCTCG ATGACCGCTT CGGCCCGCGT
TTCGCCGAGA ACGCCGCACG CTTCCACGGC GCCCGCACGC GCTTCGTGTT TCACGCCGGC
ATGGGCCATG TGCCGACCGA CGTCGCACCC GACGTGCGCA GCCGCGACTA CCTGATCGAG
GCCCATGTCG AGATCGGACC GGAGGGCGCC GAGGGCGTCC TGATCGCGCA TGGCGATGCC
ACATCGGGCT ACAGCCTCTA CGTCAAGGAC GGCCATCTCG TGCACGATCT CAACATCGGT
GGCCGGCACG AGATCGTCAG CTCGACTCGC GCGGTGCCGG CGGGAGCGCA TCGCCTCGGC
CTGCGGGTCG AGCGTCTCCG GCGCGAGAGC GAGCCTGCCA AGGGCGCGCG CACCGGCTTC
AGCCAATACA CCTTGCTGAT CGACGGCGAA GAGGTCGGCG CGCTGACGAC GCAGCTCGCC
TTTCACACGC TGATCTCATG GTCGGGACTC GACATCGGCC ATGACCGCGG CAGCCCGGTG
TCAGACTACA CGGCGCCGTT CGCGTTCACC GGCCGGCTGT CGCATGTGAC CGTTACAATG
CAGAACGAGC AGAGCCTGGA TGGCGACAGT GTGGGGCGGG CCGAGATGGC GCGGCAATAA
 
Protein sequence
MSEAAFAGAI GRTVASSKPW WPAPPRPPAG APNILVVLFD DVGFSDFGCY GSPIRTPAID 
ALAAQGLRYS GFHTTAMCST TRAALLTGRN HHSVGVGCLA NFDSGYPGYR GKIAREAGTI
AEMLRAHAYR NYMVGKWHVT PLTESGATGP FDGWPLGRGF DRFYGFLDAE TDQYAPELVL
DNTHISPPGS FADGYHLTAD LIDQAIRFIA DHTADRPDLP WLTWLALGAC HAPHQAPRDI
IESYDAVFAH GWDVERAQRL ARQKAMGLVP ETTDLPPRND GVKAWETHSD EERRVFTRLQ
SAFAGMLDHA DQHLARLLAF LDRTGQRDNT LVIVMSDNGA SQEGGPLGFV NAMGPFNFKP
EPIAEKLARI DDIGGPDTHS NFPHGWAMAS NTPLRRYKQN THGGGIRDPF ILSWPQRIAA
QGELRHQFVH ASDLVPTLLD LIGIAAPATI AGVPQMPLEG VSFAASIADA TAPSKPVPQY
FEMFGHRGLW HDGWKAVAFH PPGTPFDNDK WELFHLAEDF SETHDLAAAE PERLAALVKL
WWEQAEAHQV LPLDDRFGPR FAENAARFHG ARTRFVFHAG MGHVPTDVAP DVRSRDYLIE
AHVEIGPEGA EGVLIAHGDA TSGYSLYVKD GHLVHDLNIG GRHEIVSSTR AVPAGAHRLG
LRVERLRRES EPAKGARTGF SQYTLLIDGE EVGALTTQLA FHTLISWSGL DIGHDRGSPV
SDYTAPFAFT GRLSHVTVTM QNEQSLDGDS VGRAEMARQ