Gene BBta_0692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0692 
Symbol 
ID5151969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp709686 
End bp711437 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content67% 
IMG OID640555693 
Productsulfate thiol esterase SoxB 
Protein accessionYP_001236865 
Protein GI148252280 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGC GGCGCCGTGA TTTTCTTAGA GTGTCGGGTG CCGCTGCGCT GGCCGCGCCG 
GCGCTGCTGC GCAGCGTCCG CGCCGCCGAG ACCGTCAGCC TCTACGACGT CGAAAAGTTC
GGCAATGCGC GCATCCTGCA CATGACCGAC ACGCATGCGC AGCTCAAGCC GGTGTATTTC
CGCGAGCCCA GCGTCAATAT CGGCATCGGC GAGATGTGGG GACGGCCGCC GCATCTGGTC
GAGCGCGCCT TTCTCGAACG CTATGGCATC CGGCCCGACA GCGCCGAGGC CTATGCCTTC
ACCTCGTTCG AGTTCGAGAA ATATTCCGGC CGGTTCGGCC GCATGGGCGG CTTCGCGCAT
CTGAAGACCT TGATCGACAA GCTCCGCGCC GATGTCGGCG ACCGGCGCGC GCTGCTGCTC
GATGGCGGCG ACCTCTGGCA GGGCAGCGGG CTCGCCAACG CGATGCACGG CGCCGACATG
GTGAGCGCGG CCAACCTGCT CGGCATCGAC GCGATGACCG GACATTGGGA GTTCACCTAT
GGCGAGGAGG CGCTGCGCGC CAACCTCGCG CGCTTCAAGG GCGAGTTCCT GGCGCAGAAC
GTGTTCCTGA CCGAGGAGGC GGCGTTCAAC GACGCCAAGG CGTTCGATCC GGGATCGGGA
CGCGTCTTCA AAGCGTCCAT GATCAAGGAG ATCGGCGGCG CGCGCATCGC CGTCATCGGA
CAGGCCTTCC CCTACGTGCC GGTCGCGCAT CCGCGCCGCT TCACGCCGGA CTGGACCTTC
GGCATCCGCG AGGAGGAGTT GCAGAAACTG GTCGACGGAT TGCGCGGCGC CGACAAGGTC
GATGCCGTGA TCCTGCTGTC GCATAACGGC ATGGATGTCG ACCTCAAGCT GGCAGGCCGG
GTCACCGGCA TCGACGTGAT TCTCGGCGGC CACACCCATG ATGCCGTGCC GCAGCCGGTC
GCGGTCAGCA ACGCCAAGGG CACGACCTTG GTCACCAATG CCGGCTCGAA CGGCAAATTC
CTCGCGGTGC TCGATCTCGA CGTCGGCAAG GGCCGCGTCG CCGATGCGCG CTACCGGCTG
CTGCCGGTCT ATTCGGAGTT GCTGAAGCCC GACCCTGCCA TGCAGGCGCT GATCGACCAG
ACCCAGCAGC CGCAGGTCGC CGCCTGGAGC GAGAAGCTCG CGACCTCGGA CCGCTTGCTC
TATCGACGCG GCAATTTCGA GGGCCCGACC GACGACATGA TCTGCGAGGC GTTGCGCAGC
CAGCTCGATG CGGAGATCGC GCTGTCGCCC GGGTTCCGCT GGGGCACGAC GTTGATCGCC
GGCCAATCCA TCATGCTGGA AGACGTGCTG GCGCAGACGG CGATCAGCTA TCCCGAGACC
TATGTGCAGC GGCTGACCGG CGCGCAGATC AAGGACGTGC TGGAAGACGT CTGCGACAAC
CTGTTCAACG CCGACCCGTA TCTGCAGCAG GGCGGCGACA TGGTGCGGCT CGAAGGCCTG
TCCTACCGCT GCGCACCCGC CGAGGCGATC GGCCGGCGCA TCTCGGACCT CACGCTCGCC
AATGGCCGGG CGCTGGAGCC GGGCAAGACC TACAAGATTG CCGGCTGGGC GTCGATGACG
GCGCAGGACG GCAAGCCGGT ATGGGAGGTC GTCGCGACGC ATCTGCGCAG CGTCGGCCTG
ACCTCGGCGG GAACAAGCAC CGTCACGCTG TCCGGCGTGG ACGGCAATCC CGGTTTTGCA
GCGCCGTCCT GA
 
Protein sequence
MTMRRRDFLR VSGAAALAAP ALLRSVRAAE TVSLYDVEKF GNARILHMTD THAQLKPVYF 
REPSVNIGIG EMWGRPPHLV ERAFLERYGI RPDSAEAYAF TSFEFEKYSG RFGRMGGFAH
LKTLIDKLRA DVGDRRALLL DGGDLWQGSG LANAMHGADM VSAANLLGID AMTGHWEFTY
GEEALRANLA RFKGEFLAQN VFLTEEAAFN DAKAFDPGSG RVFKASMIKE IGGARIAVIG
QAFPYVPVAH PRRFTPDWTF GIREEELQKL VDGLRGADKV DAVILLSHNG MDVDLKLAGR
VTGIDVILGG HTHDAVPQPV AVSNAKGTTL VTNAGSNGKF LAVLDLDVGK GRVADARYRL
LPVYSELLKP DPAMQALIDQ TQQPQVAAWS EKLATSDRLL YRRGNFEGPT DDMICEALRS
QLDAEIALSP GFRWGTTLIA GQSIMLEDVL AQTAISYPET YVQRLTGAQI KDVLEDVCDN
LFNADPYLQQ GGDMVRLEGL SYRCAPAEAI GRRISDLTLA NGRALEPGKT YKIAGWASMT
AQDGKPVWEV VATHLRSVGL TSAGTSTVTL SGVDGNPGFA APS