Gene Bind_2206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2206 
Symbol 
ID6199474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2529247 
End bp2531775 
Gene Length2529 bp 
Protein Length842 aa 
Translation table11 
GC content53% 
IMG OID641706196 
Productsulfatase 
Protein accessionYP_001833314 
Protein GI182679168 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0228911 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.614416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGAG TGACACGGAG CAAAGTGCGA ACGAGCCTAT CAGCGATCTT TCTTACGCTG 
ACGCTCATAT CGGTGGTGCC TGCGGCCGCT CAACAGCGAG AAGGCGTACC CGGCTCCCCA
AGTGCAACAA CGACGATCGA TGGGCGATAT TTGCCACCCC CTCCACCGCC ATTTGGCGGT
GAAATTAATT TAAACGCCGT GCAATCAAAA CCTTATTGGC CAGCGCGCGT TGTGCCCCCG
AAGGGAGCCC CGAATGTCTT GCTGATTATG ACCGATGACG TCGGTTTTGG CGCTCCGAGC
ACTTTTGGAG GGGTCATTCC CACTCCGGCT CTCGACCGCA TTGCCAATGC TGGCTTGCGT
TATACGCAAT TTCACTCGAC AGCCCTCTGC TCACCAACAC GCGCCGCGCT TATTACCGGG
CGCAACCATC ATTCGGTGGG TTTCGGCCAA ATTTCAGAAA TGTCCACAGG TTTCCCCGGT
TATGACAGTA TCATCACCAG AGATACAGCG ACGATCGGTC GAATTCTGAA AGACAATGGC
TACGCGACCT CGTGGTTCGG CAAGAACCAT AACACTCCAG CCTTTCAGGC GAGCCAAGCG
GGACCTTTCG AGCAATGGCC CATCGGTATG GGCTTTGAAT ATTTCTACGG GTTTGTTGGT
GGTGATACCA GCCAGTGGCA GCCAAATCTG TTCCGCCAAA CCACGGCAAT CTATCCCTAT
ATCGGTCATC CCGACTGGAA CCTGACCACG GCCATGGCCG ATGATGCCAT CGCTCATATC
AAAATGCTGA ACGAAGTCGA TCCCACTAAA CCATTCTTTG TTAATTATAC CCCCGGGGGC
ACCCACGCAC CACACCACCC GACGCCCGAA TGGATCAAAA AGATCAGTGA CATGCATCTA
TTCGACAAGG GCTGGAACAC ACTCCGCGAG ACGATTTTCT CTAACCAAAA GCGGCTCAAT
GTTATCCCAC AAGATGCCAA GCTTACGCCC TGGCCTGATG ATTTGCTGAA GCAATGGGAT
ACACTTACAG AGGACGAGAA AAAGCTGTTC ATTCGTCAAG CCGACGTCTA CGCCGCTTAT
CTTACCTATA CCGACCATGA GATCGGCCGG GTGATTCAAG CGGTCGAGGA TACCGGCAAG
CTCGACAATA CATTGATTAT TTACATCAGC GGCGACAATG GTTCCAGTGC GGAGGGATCA
CCCAATGGCA CACCTAATGA AGTCGCGCAA TTCAACAGCG TCGAGGTGCC GGTAGCAGAA
CAGCTAAAGT ATTTCTACGA CGTCTGGGGT TCGGACAAAA CTTATAATCA CATGGCAGTG
GGTTGGACCT GGGCTTTCGA CACGCCCTAT AAATGGACGA AACAAGTGGC ATCGCATTTC
GGCGGCACAC GGCAAGGCAT GGCCATCAGT TGGCCGGGCC ACATTAAAGA TGTCGGAGGC
ATCCGTAATC AGTTTCATCA CGTCATCGAT ATTGTCCCGA CCGTTTTGGA AGCAGCGAAT
ATCTCGCCTC CGGTGATGGT CGATGGCATC GCCCAAAATC CGATCGAAGG CGTGAGCTTA
GCCTATACTT TCGATAAAGC GAACGCCGAC ACCCCCACCA AGCATCACAC CCAATATTTC
GAGATGTTCG GTAATCGCGG CATCTATCAC GATGGCTGGT ACGCCAATAC ACAACCGATC
AGCCCGCCGT GGAAACTCGG CGCCACACCA AACCCAAACG TCATAAACAG CTATAAATGG
GAGCTTTACG ATCTCACCAA GGATTGGACG CAGAACGAAG ATCTCGCCGC CGCCAATCCC
GCCAAGCTGA AAGAGATGCA GGATCTTTTC CTGGTCGAAG CCGCCAAATA TCAGGTCTTC
CCGCTCGACA ATTCACTCGC TTCACGAATG GTCGTGCCGA GGCCCAGCGT TACGGCCGGA
AGAACTGAAT TCACTTATAC AGGCGAATTG ACCGGCGTCC CGATGGGCGA TGCTCCAAGT
TTGATTGCGG CTTCCTATAC GATCACAGCA GAGATCGATG TCCCTGAAGG TGGCGCCGAG
GGAATACTCG CGACCCAAGG AGGACGCTTC GGCGGATGGG GGTTCTATCT TGTGAAAGGC
AAACCCGTCT TTACCTGGAA CCTGCTCGAT CTCAAACGCG TTCGTTGGGA AAATCCGGAA
GCCCTTTTAC CAGGAAAACA TGCTCTGGTC TTCGACTTCA AATATGACGG CCTCGGTTTT
GGGACGTTGG CCTTCAACAA TGTGAGCGGC CTTGGCCAGG GGGGCACGGG CACGCTCCGG
GTCGATGGGA AAATCGTGGC CACACAAACG ATGGAACGAA CGATTCCGGT TATTCTTCAA
TGGGATGAGA CCTTTGACAT TGGGGCCGAT ACCGGTACGC CTGTTGATGA TAATGACTAT
CAAGTACCGT TCGGATTTAC TGGTAAACTC AACAAACTGA CGATCAAGAT CGATCGACCG
AAATTATCGC CAGAAGACGA GAAACGTCTC ATGCAAGAAG GACAACGCAG CAATCGCATG
AGCGAATAG
 
Protein sequence
MRGVTRSKVR TSLSAIFLTL TLISVVPAAA QQREGVPGSP SATTTIDGRY LPPPPPPFGG 
EINLNAVQSK PYWPARVVPP KGAPNVLLIM TDDVGFGAPS TFGGVIPTPA LDRIANAGLR
YTQFHSTALC SPTRAALITG RNHHSVGFGQ ISEMSTGFPG YDSIITRDTA TIGRILKDNG
YATSWFGKNH NTPAFQASQA GPFEQWPIGM GFEYFYGFVG GDTSQWQPNL FRQTTAIYPY
IGHPDWNLTT AMADDAIAHI KMLNEVDPTK PFFVNYTPGG THAPHHPTPE WIKKISDMHL
FDKGWNTLRE TIFSNQKRLN VIPQDAKLTP WPDDLLKQWD TLTEDEKKLF IRQADVYAAY
LTYTDHEIGR VIQAVEDTGK LDNTLIIYIS GDNGSSAEGS PNGTPNEVAQ FNSVEVPVAE
QLKYFYDVWG SDKTYNHMAV GWTWAFDTPY KWTKQVASHF GGTRQGMAIS WPGHIKDVGG
IRNQFHHVID IVPTVLEAAN ISPPVMVDGI AQNPIEGVSL AYTFDKANAD TPTKHHTQYF
EMFGNRGIYH DGWYANTQPI SPPWKLGATP NPNVINSYKW ELYDLTKDWT QNEDLAAANP
AKLKEMQDLF LVEAAKYQVF PLDNSLASRM VVPRPSVTAG RTEFTYTGEL TGVPMGDAPS
LIAASYTITA EIDVPEGGAE GILATQGGRF GGWGFYLVKG KPVFTWNLLD LKRVRWENPE
ALLPGKHALV FDFKYDGLGF GTLAFNNVSG LGQGGTGTLR VDGKIVATQT MERTIPVILQ
WDETFDIGAD TGTPVDDNDY QVPFGFTGKL NKLTIKIDRP KLSPEDEKRL MQEGQRSNRM
SE