Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_0599 |
Symbol | |
ID | 5148989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 602089 |
End bp | 604377 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640555606 |
Product | putative arylsulfatase |
Protein accession | YP_001236778 |
Protein GI | 148252193 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.217593 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGATG TTCGCAACGC CGGCGCGCCG CCGCCGCGTT TCGAGATCAA GGCGCCCTCA CAGGCTCCGA ACGTGCTGAT TGTCCTGATT GATGACATGG GCTTCGGCCA GTCGTCGGCA TTCGGTGGCC CAATTCACAT GCCGACAGTC GAGCAACTTG CGAGTGGCGG ATTGCGATAC AATCAGTTCC ACACGACGGC GTTGTGTTCA CCGACCCGAG CAGCCCTGCT TTCCGGCCGC AATCACCATG TGAACAATTT CGGCTCGATT GCCGAGACAG GCACCTCATT TCCGGGCCAG ACCGGTCAGC GTCCTAACAA CGTTGCGTCA GTCGCTGAGA TGCTGCGTCT GAACGGCTAC AGCACCGCCC ATTTCGGCAA AAATCACGAG ACGGCCGCCT GGGAGGTCAG TGCTTCGGGA CCGACGGATC GCTGGCCAAC CCGCCAGGGA TTCGACAAGT TCTATGGTTT CATGGGAGGC GAAACCAACC AGTGGGCCCC ACTGATCTAC GATGGCACGA CGCAGGTCGA GTTGCCAAAG GATCCTAACT ATCACTTCAT GACGGATATG ACGGACAAGG CCATAGCCTG GATGAAGTCG CAGAAATCGC TGACTCCTGA CAAGCCGTTC TTCATGTATT TCGCTCCGGG GGCGACCCAT GCGCCTCATC ACGTGCCCAA GCAGTGGATC GCGAAATATA AGGGCAAGTT CGACCAGGGA TGGGACGCAT TACGCGAAGA GACTCTGGCA CGGCAGATCA AGCTCGGCGT CGTTCCTGCG GGCACCAAGC TCGCACCTAA GCCGGACGCC ATTGCGGACT GGGCCAAGCT TAGCGGTGAC GAAAAAAAGC TGTTCACGAG GCAGATGGAG GTATTTGCCG GTTTCGCCGA ATATACCGAC ACCGAAATCG GACGCTTGGT CGACGCCATC AAAGCCACCG GCCAACTCGA CAATACGCTC ATCTTCTACA TCGTTGGCGA CAATGGCGCG AGTGCCGAGG GTGGCATGAG CGGAATGTTC AACGAGATGA CCTATTTTAA CGGTGTGCAG GAGACCGTTC AGGACGTGCT CAAGCATTAC GATGAGCTGG GCGGGCCAAA CACTTACAGC CACTACGCAG CCGGCTGGGC GGTTGCTGGC GACACGCCGT TCACCTGGAC CAAGCAGGTC GCGTCGAGCT ATGGTGGTAC CCGTAACGGA ATGGTGGTGT ATTGGCCGAA GGTGATCAGG GCCAAGAACG AGGTGCGGAC ACAATGGCAC CACGTCGTGG ACATCGCACC AACCATCCTG GAAGCAGCAA GTTTGCCGGA GCCTAAGAGT GTGAATGGCA CCATCCAGGT GCCGATCGAA GGCACCAGCA TGGTCTACTC GTTCGACGCC CCCAAGGCGG AGAGCACCCA TAAGACGCAG TACTTCGAGA TCTTCGGCAA CCGCGCAATC TATCATGACG GATGGCTCGC GGGCACGATC CATCGTGCGG CTTGGGAGAC CAAGCCTCGA CGTCCACTCG ATCAGGATGT CTGGGAACTG TACGATACCC GGTCGGACTT CAGCCTGGCG AACGACTTGG CGGCGAAGGA TCCTGACAAG CTGAAGGAGA TGCAAGACCT CTTCATGAAG GAGGCAGAAG CAAACTCTGT GTTGCCTCTC GACGACCGTA CCCTCGAACG GGCCAACGCG GCACTGGCTG GGCGACCGGA TTTGATGGCA GGCCGCACGA GTCTCACGGT GCACGAAGGC ATGGCGGGAA TGTCGGAGAA CGTCTTCATC AACATCAAGA ATCGGTCGCA CACGATCACG GCCGAAGTCG ACATTCCGAA GGGTGGCGCC AACGGCGTGA TCCTTGCACA AGCCGGGCGG TTCGGCGGCT GGAGCCTTTA TCTGAAGGAC GGCAAACCGA CCTACACCTA CAACTTCCTC GGCCTGGAAC GGTTCACAGT TAGCGCGAAG CAGGCAGTGC CCGCCGGAAG AGCAACAATC CGGTTCGAGT TCGCGTATGA CGGCGGTGGC GTCGGCAAGG GTGGCCTCGG CACTATCATC GTTAACGGTC GGAGCGTTGC GACCGGGCGG ATCGAGCGTA CTGAGTTTGG GGTTTTCTCG GCGGATGAAG GCGCTGATGT CGGTGCCGAC GAGGGGACGC CCGTCACCGA AAGTTACAAG GTCCCGTTCA AGTTCACCGG CAAGATAGCG AAGGTGACGA TCGATCTGCT CGACATGAAG AAGGCCGATA TTGAGGACGC GAAGCGGGCT CGAAAGGCCG CCTTGGTCAA GAAAGGCCTC TCCGATTGA
|
Protein sequence | MLDVRNAGAP PPRFEIKAPS QAPNVLIVLI DDMGFGQSSA FGGPIHMPTV EQLASGGLRY NQFHTTALCS PTRAALLSGR NHHVNNFGSI AETGTSFPGQ TGQRPNNVAS VAEMLRLNGY STAHFGKNHE TAAWEVSASG PTDRWPTRQG FDKFYGFMGG ETNQWAPLIY DGTTQVELPK DPNYHFMTDM TDKAIAWMKS QKSLTPDKPF FMYFAPGATH APHHVPKQWI AKYKGKFDQG WDALREETLA RQIKLGVVPA GTKLAPKPDA IADWAKLSGD EKKLFTRQME VFAGFAEYTD TEIGRLVDAI KATGQLDNTL IFYIVGDNGA SAEGGMSGMF NEMTYFNGVQ ETVQDVLKHY DELGGPNTYS HYAAGWAVAG DTPFTWTKQV ASSYGGTRNG MVVYWPKVIR AKNEVRTQWH HVVDIAPTIL EAASLPEPKS VNGTIQVPIE GTSMVYSFDA PKAESTHKTQ YFEIFGNRAI YHDGWLAGTI HRAAWETKPR RPLDQDVWEL YDTRSDFSLA NDLAAKDPDK LKEMQDLFMK EAEANSVLPL DDRTLERANA ALAGRPDLMA GRTSLTVHEG MAGMSENVFI NIKNRSHTIT AEVDIPKGGA NGVILAQAGR FGGWSLYLKD GKPTYTYNFL GLERFTVSAK QAVPAGRATI RFEFAYDGGG VGKGGLGTII VNGRSVATGR IERTEFGVFS ADEGADVGAD EGTPVTESYK VPFKFTGKIA KVTIDLLDMK KADIEDAKRA RKAALVKKGL SD
|
| |