Gene BBta_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3675 
Symbol 
ID5154291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3838804 
End bp3840465 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID640558515 
Productsulfate thiol esterase SoxB 
Protein accessionYP_001239661 
Protein GI148255076 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0941118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGCGG CTGCCCTGAC GGGCGGCGCC AGCCTCGCGG GCACATCCCG CGCCTTTGCG 
CAACAGAAGC TGACCGAGAA GGAGCTTCTG GCGTTCGATC CGCTCGGAAA CGTCACCCTG
GTGCATGTCA CAGACATCCA CGGCCAGCTG ATGCCGCTTT ATTTCCGCGA GCCCTCGACC
AATCTCGGGG TCGGCGACGC CAAGGGGCAA CCGCCGCATG TCACCGGCAA GGAGTTTCTG
ACGCGGTTCG GCATTGCGCC CGGATCGTCG TCCGCCTATG CGCTGACCTC GGAGGATTTC
GAGGCGCTTG CCAAGACCTA CGGACGGATC GGCGGACTCG ATCGCGCCGC GACCGTCATC
AAGGCGATCC GCGCCGAGCG CGGCGACAAG GTCGCGTTGC TCGACGGCGG CGACACCTGG
CAGGGGTCCT GGTCGTCGCT GCAGACCCGC GGCCAGGACA TGATCGACTG CATGGCGTTG
CTGAAGCCGG ATGCGATGAC CGGGCATTGG GAGTTCACCT ACGGCACAGA GCGCGTCAAG
CAGGCGGTCG ACGGGCTCGG CTTTCCGTTC CTCGGCCTCA ACATCCGTGA CACCGAGTGG
AATGAGCCGG CGTTCGAGGC CTCGACGATG ATCGAGCGGG GCGGCGTCAA GATCGCAGTG
CTCGGCCAGG CCTTCCCGTA TACACCGGTG GCCAATCCGC GCTGGATGAT CCCGAACTGG
TCGTTCGGCG TGCGCGAGGA GGATGTCCAG ACCCAGGTCG ACAAGGCGCG CAAGGCCGGC
GCCGGGCTGG TGGTGCTGCT GTCGCACAAC GGCTTCGACG TCGACCGCAA ACTGGCCAGC
CGCGTCAAGG GTCTCGACGT CATTCTCACG GGCCACACCC ATGATGCGCT GCCCGAAGCG
GTCAAGGTCG GCAAGACGCT GCTGATCGCA TCCGGCTCCT CCGGCAAGTT CGTGTCGCGG
CTCGATCTCG ACGTCAAGGA TGGCGAGGTC AAGGCCTATC GCTACAGGCT GATCCCGCTG
TTCTCCGACG TGATCACGCC GGATACGGCG ATGGCAGCCA AGATCGCTGA GGTGCGCAAG
CCGTTCGCGG CCGATCTCGG CCGCGCGCTC GGACGCACCG AGACGCTGCT GTACCGCCGC
GGCAACTTCA ACGGCACGTT CGACGATCTG ATCTGTCAGG CTCTGCTGCA GGAACGCGAT
GCGGAGATTG CGCTGTCGCC GGGCTTCCGC TGGGGCACCA GCGTCATGCC GGGCCAGGAC
ATCACCTTCG AGGACGTCAC CAACGCCACC GCGATCACCT ATCCCGCGGT CTATCGCATG
GGCATGACCG GCACGCGCCT CAAGGAAATC ATCGAGGACG TGGCCGACAA TCTGTTCAAT
GTCGACCCCT ACTACCAGCA GGGCGGCGAC ATGGTCCGCA TCGGCGGCAT GTCCTACGCG
ATCGACGTCA ACAAGCCGCA AGGACAGCGC ATCTCCGACA TGCGCCTGAT CAAGACCGGC
GCGGTGATCG ATCCGGCGCG CGAGTATCAG GTCGCCGGCT GGGCCAGCGT CAACGAGGGT
ACGCAGGGCC CGCCGATCTG GGAGGTCGTG TCGGGCTATC TGCAGCGTCA GAAAACCGTT
CGTCTCGAGC CCAATCGGGC CGTCAAAGTC TCCGGAGTGT GA
 
Protein sequence
MAAAALTGGA SLAGTSRAFA QQKLTEKELL AFDPLGNVTL VHVTDIHGQL MPLYFREPST 
NLGVGDAKGQ PPHVTGKEFL TRFGIAPGSS SAYALTSEDF EALAKTYGRI GGLDRAATVI
KAIRAERGDK VALLDGGDTW QGSWSSLQTR GQDMIDCMAL LKPDAMTGHW EFTYGTERVK
QAVDGLGFPF LGLNIRDTEW NEPAFEASTM IERGGVKIAV LGQAFPYTPV ANPRWMIPNW
SFGVREEDVQ TQVDKARKAG AGLVVLLSHN GFDVDRKLAS RVKGLDVILT GHTHDALPEA
VKVGKTLLIA SGSSGKFVSR LDLDVKDGEV KAYRYRLIPL FSDVITPDTA MAAKIAEVRK
PFAADLGRAL GRTETLLYRR GNFNGTFDDL ICQALLQERD AEIALSPGFR WGTSVMPGQD
ITFEDVTNAT AITYPAVYRM GMTGTRLKEI IEDVADNLFN VDPYYQQGGD MVRIGGMSYA
IDVNKPQGQR ISDMRLIKTG AVIDPAREYQ VAGWASVNEG TQGPPIWEVV SGYLQRQKTV
RLEPNRAVKV SGV