Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_0692 |
Symbol | |
ID | 5151969 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 709686 |
End bp | 711437 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640555693 |
Product | sulfate thiol esterase SoxB |
Protein accession | YP_001236865 |
Protein GI | 148252280 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGC GGCGCCGTGA TTTTCTTAGA GTGTCGGGTG CCGCTGCGCT GGCCGCGCCG GCGCTGCTGC GCAGCGTCCG CGCCGCCGAG ACCGTCAGCC TCTACGACGT CGAAAAGTTC GGCAATGCGC GCATCCTGCA CATGACCGAC ACGCATGCGC AGCTCAAGCC GGTGTATTTC CGCGAGCCCA GCGTCAATAT CGGCATCGGC GAGATGTGGG GACGGCCGCC GCATCTGGTC GAGCGCGCCT TTCTCGAACG CTATGGCATC CGGCCCGACA GCGCCGAGGC CTATGCCTTC ACCTCGTTCG AGTTCGAGAA ATATTCCGGC CGGTTCGGCC GCATGGGCGG CTTCGCGCAT CTGAAGACCT TGATCGACAA GCTCCGCGCC GATGTCGGCG ACCGGCGCGC GCTGCTGCTC GATGGCGGCG ACCTCTGGCA GGGCAGCGGG CTCGCCAACG CGATGCACGG CGCCGACATG GTGAGCGCGG CCAACCTGCT CGGCATCGAC GCGATGACCG GACATTGGGA GTTCACCTAT GGCGAGGAGG CGCTGCGCGC CAACCTCGCG CGCTTCAAGG GCGAGTTCCT GGCGCAGAAC GTGTTCCTGA CCGAGGAGGC GGCGTTCAAC GACGCCAAGG CGTTCGATCC GGGATCGGGA CGCGTCTTCA AAGCGTCCAT GATCAAGGAG ATCGGCGGCG CGCGCATCGC CGTCATCGGA CAGGCCTTCC CCTACGTGCC GGTCGCGCAT CCGCGCCGCT TCACGCCGGA CTGGACCTTC GGCATCCGCG AGGAGGAGTT GCAGAAACTG GTCGACGGAT TGCGCGGCGC CGACAAGGTC GATGCCGTGA TCCTGCTGTC GCATAACGGC ATGGATGTCG ACCTCAAGCT GGCAGGCCGG GTCACCGGCA TCGACGTGAT TCTCGGCGGC CACACCCATG ATGCCGTGCC GCAGCCGGTC GCGGTCAGCA ACGCCAAGGG CACGACCTTG GTCACCAATG CCGGCTCGAA CGGCAAATTC CTCGCGGTGC TCGATCTCGA CGTCGGCAAG GGCCGCGTCG CCGATGCGCG CTACCGGCTG CTGCCGGTCT ATTCGGAGTT GCTGAAGCCC GACCCTGCCA TGCAGGCGCT GATCGACCAG ACCCAGCAGC CGCAGGTCGC CGCCTGGAGC GAGAAGCTCG CGACCTCGGA CCGCTTGCTC TATCGACGCG GCAATTTCGA GGGCCCGACC GACGACATGA TCTGCGAGGC GTTGCGCAGC CAGCTCGATG CGGAGATCGC GCTGTCGCCC GGGTTCCGCT GGGGCACGAC GTTGATCGCC GGCCAATCCA TCATGCTGGA AGACGTGCTG GCGCAGACGG CGATCAGCTA TCCCGAGACC TATGTGCAGC GGCTGACCGG CGCGCAGATC AAGGACGTGC TGGAAGACGT CTGCGACAAC CTGTTCAACG CCGACCCGTA TCTGCAGCAG GGCGGCGACA TGGTGCGGCT CGAAGGCCTG TCCTACCGCT GCGCACCCGC CGAGGCGATC GGCCGGCGCA TCTCGGACCT CACGCTCGCC AATGGCCGGG CGCTGGAGCC GGGCAAGACC TACAAGATTG CCGGCTGGGC GTCGATGACG GCGCAGGACG GCAAGCCGGT ATGGGAGGTC GTCGCGACGC ATCTGCGCAG CGTCGGCCTG ACCTCGGCGG GAACAAGCAC CGTCACGCTG TCCGGCGTGG ACGGCAATCC CGGTTTTGCA GCGCCGTCCT GA
|
Protein sequence | MTMRRRDFLR VSGAAALAAP ALLRSVRAAE TVSLYDVEKF GNARILHMTD THAQLKPVYF REPSVNIGIG EMWGRPPHLV ERAFLERYGI RPDSAEAYAF TSFEFEKYSG RFGRMGGFAH LKTLIDKLRA DVGDRRALLL DGGDLWQGSG LANAMHGADM VSAANLLGID AMTGHWEFTY GEEALRANLA RFKGEFLAQN VFLTEEAAFN DAKAFDPGSG RVFKASMIKE IGGARIAVIG QAFPYVPVAH PRRFTPDWTF GIREEELQKL VDGLRGADKV DAVILLSHNG MDVDLKLAGR VTGIDVILGG HTHDAVPQPV AVSNAKGTTL VTNAGSNGKF LAVLDLDVGK GRVADARYRL LPVYSELLKP DPAMQALIDQ TQQPQVAAWS EKLATSDRLL YRRGNFEGPT DDMICEALRS QLDAEIALSP GFRWGTTLIA GQSIMLEDVL AQTAISYPET YVQRLTGAQI KDVLEDVCDN LFNADPYLQQ GGDMVRLEGL SYRCAPAEAI GRRISDLTLA NGRALEPGKT YKIAGWASMT AQDGKPVWEV VATHLRSVGL TSAGTSTVTL SGVDGNPGFA APS
|
| |