Gene BBta_0599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0599 
Symbol 
ID5148989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp602089 
End bp604377 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content59% 
IMG OID640555606 
Productputative arylsulfatase 
Protein accessionYP_001236778 
Protein GI148252193 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.217593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGATG TTCGCAACGC CGGCGCGCCG CCGCCGCGTT TCGAGATCAA GGCGCCCTCA 
CAGGCTCCGA ACGTGCTGAT TGTCCTGATT GATGACATGG GCTTCGGCCA GTCGTCGGCA
TTCGGTGGCC CAATTCACAT GCCGACAGTC GAGCAACTTG CGAGTGGCGG ATTGCGATAC
AATCAGTTCC ACACGACGGC GTTGTGTTCA CCGACCCGAG CAGCCCTGCT TTCCGGCCGC
AATCACCATG TGAACAATTT CGGCTCGATT GCCGAGACAG GCACCTCATT TCCGGGCCAG
ACCGGTCAGC GTCCTAACAA CGTTGCGTCA GTCGCTGAGA TGCTGCGTCT GAACGGCTAC
AGCACCGCCC ATTTCGGCAA AAATCACGAG ACGGCCGCCT GGGAGGTCAG TGCTTCGGGA
CCGACGGATC GCTGGCCAAC CCGCCAGGGA TTCGACAAGT TCTATGGTTT CATGGGAGGC
GAAACCAACC AGTGGGCCCC ACTGATCTAC GATGGCACGA CGCAGGTCGA GTTGCCAAAG
GATCCTAACT ATCACTTCAT GACGGATATG ACGGACAAGG CCATAGCCTG GATGAAGTCG
CAGAAATCGC TGACTCCTGA CAAGCCGTTC TTCATGTATT TCGCTCCGGG GGCGACCCAT
GCGCCTCATC ACGTGCCCAA GCAGTGGATC GCGAAATATA AGGGCAAGTT CGACCAGGGA
TGGGACGCAT TACGCGAAGA GACTCTGGCA CGGCAGATCA AGCTCGGCGT CGTTCCTGCG
GGCACCAAGC TCGCACCTAA GCCGGACGCC ATTGCGGACT GGGCCAAGCT TAGCGGTGAC
GAAAAAAAGC TGTTCACGAG GCAGATGGAG GTATTTGCCG GTTTCGCCGA ATATACCGAC
ACCGAAATCG GACGCTTGGT CGACGCCATC AAAGCCACCG GCCAACTCGA CAATACGCTC
ATCTTCTACA TCGTTGGCGA CAATGGCGCG AGTGCCGAGG GTGGCATGAG CGGAATGTTC
AACGAGATGA CCTATTTTAA CGGTGTGCAG GAGACCGTTC AGGACGTGCT CAAGCATTAC
GATGAGCTGG GCGGGCCAAA CACTTACAGC CACTACGCAG CCGGCTGGGC GGTTGCTGGC
GACACGCCGT TCACCTGGAC CAAGCAGGTC GCGTCGAGCT ATGGTGGTAC CCGTAACGGA
ATGGTGGTGT ATTGGCCGAA GGTGATCAGG GCCAAGAACG AGGTGCGGAC ACAATGGCAC
CACGTCGTGG ACATCGCACC AACCATCCTG GAAGCAGCAA GTTTGCCGGA GCCTAAGAGT
GTGAATGGCA CCATCCAGGT GCCGATCGAA GGCACCAGCA TGGTCTACTC GTTCGACGCC
CCCAAGGCGG AGAGCACCCA TAAGACGCAG TACTTCGAGA TCTTCGGCAA CCGCGCAATC
TATCATGACG GATGGCTCGC GGGCACGATC CATCGTGCGG CTTGGGAGAC CAAGCCTCGA
CGTCCACTCG ATCAGGATGT CTGGGAACTG TACGATACCC GGTCGGACTT CAGCCTGGCG
AACGACTTGG CGGCGAAGGA TCCTGACAAG CTGAAGGAGA TGCAAGACCT CTTCATGAAG
GAGGCAGAAG CAAACTCTGT GTTGCCTCTC GACGACCGTA CCCTCGAACG GGCCAACGCG
GCACTGGCTG GGCGACCGGA TTTGATGGCA GGCCGCACGA GTCTCACGGT GCACGAAGGC
ATGGCGGGAA TGTCGGAGAA CGTCTTCATC AACATCAAGA ATCGGTCGCA CACGATCACG
GCCGAAGTCG ACATTCCGAA GGGTGGCGCC AACGGCGTGA TCCTTGCACA AGCCGGGCGG
TTCGGCGGCT GGAGCCTTTA TCTGAAGGAC GGCAAACCGA CCTACACCTA CAACTTCCTC
GGCCTGGAAC GGTTCACAGT TAGCGCGAAG CAGGCAGTGC CCGCCGGAAG AGCAACAATC
CGGTTCGAGT TCGCGTATGA CGGCGGTGGC GTCGGCAAGG GTGGCCTCGG CACTATCATC
GTTAACGGTC GGAGCGTTGC GACCGGGCGG ATCGAGCGTA CTGAGTTTGG GGTTTTCTCG
GCGGATGAAG GCGCTGATGT CGGTGCCGAC GAGGGGACGC CCGTCACCGA AAGTTACAAG
GTCCCGTTCA AGTTCACCGG CAAGATAGCG AAGGTGACGA TCGATCTGCT CGACATGAAG
AAGGCCGATA TTGAGGACGC GAAGCGGGCT CGAAAGGCCG CCTTGGTCAA GAAAGGCCTC
TCCGATTGA
 
Protein sequence
MLDVRNAGAP PPRFEIKAPS QAPNVLIVLI DDMGFGQSSA FGGPIHMPTV EQLASGGLRY 
NQFHTTALCS PTRAALLSGR NHHVNNFGSI AETGTSFPGQ TGQRPNNVAS VAEMLRLNGY
STAHFGKNHE TAAWEVSASG PTDRWPTRQG FDKFYGFMGG ETNQWAPLIY DGTTQVELPK
DPNYHFMTDM TDKAIAWMKS QKSLTPDKPF FMYFAPGATH APHHVPKQWI AKYKGKFDQG
WDALREETLA RQIKLGVVPA GTKLAPKPDA IADWAKLSGD EKKLFTRQME VFAGFAEYTD
TEIGRLVDAI KATGQLDNTL IFYIVGDNGA SAEGGMSGMF NEMTYFNGVQ ETVQDVLKHY
DELGGPNTYS HYAAGWAVAG DTPFTWTKQV ASSYGGTRNG MVVYWPKVIR AKNEVRTQWH
HVVDIAPTIL EAASLPEPKS VNGTIQVPIE GTSMVYSFDA PKAESTHKTQ YFEIFGNRAI
YHDGWLAGTI HRAAWETKPR RPLDQDVWEL YDTRSDFSLA NDLAAKDPDK LKEMQDLFMK
EAEANSVLPL DDRTLERANA ALAGRPDLMA GRTSLTVHEG MAGMSENVFI NIKNRSHTIT
AEVDIPKGGA NGVILAQAGR FGGWSLYLKD GKPTYTYNFL GLERFTVSAK QAVPAGRATI
RFEFAYDGGG VGKGGLGTII VNGRSVATGR IERTEFGVFS ADEGADVGAD EGTPVTESYK
VPFKFTGKIA KVTIDLLDMK KADIEDAKRA RKAALVKKGL SD