Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_2206 |
Symbol | |
ID | 6199474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | + |
Start bp | 2529247 |
End bp | 2531775 |
Gene Length | 2529 bp |
Protein Length | 842 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641706196 |
Product | sulfatase |
Protein accession | YP_001833314 |
Protein GI | 182679168 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0228911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.614416 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGGAG TGACACGGAG CAAAGTGCGA ACGAGCCTAT CAGCGATCTT TCTTACGCTG ACGCTCATAT CGGTGGTGCC TGCGGCCGCT CAACAGCGAG AAGGCGTACC CGGCTCCCCA AGTGCAACAA CGACGATCGA TGGGCGATAT TTGCCACCCC CTCCACCGCC ATTTGGCGGT GAAATTAATT TAAACGCCGT GCAATCAAAA CCTTATTGGC CAGCGCGCGT TGTGCCCCCG AAGGGAGCCC CGAATGTCTT GCTGATTATG ACCGATGACG TCGGTTTTGG CGCTCCGAGC ACTTTTGGAG GGGTCATTCC CACTCCGGCT CTCGACCGCA TTGCCAATGC TGGCTTGCGT TATACGCAAT TTCACTCGAC AGCCCTCTGC TCACCAACAC GCGCCGCGCT TATTACCGGG CGCAACCATC ATTCGGTGGG TTTCGGCCAA ATTTCAGAAA TGTCCACAGG TTTCCCCGGT TATGACAGTA TCATCACCAG AGATACAGCG ACGATCGGTC GAATTCTGAA AGACAATGGC TACGCGACCT CGTGGTTCGG CAAGAACCAT AACACTCCAG CCTTTCAGGC GAGCCAAGCG GGACCTTTCG AGCAATGGCC CATCGGTATG GGCTTTGAAT ATTTCTACGG GTTTGTTGGT GGTGATACCA GCCAGTGGCA GCCAAATCTG TTCCGCCAAA CCACGGCAAT CTATCCCTAT ATCGGTCATC CCGACTGGAA CCTGACCACG GCCATGGCCG ATGATGCCAT CGCTCATATC AAAATGCTGA ACGAAGTCGA TCCCACTAAA CCATTCTTTG TTAATTATAC CCCCGGGGGC ACCCACGCAC CACACCACCC GACGCCCGAA TGGATCAAAA AGATCAGTGA CATGCATCTA TTCGACAAGG GCTGGAACAC ACTCCGCGAG ACGATTTTCT CTAACCAAAA GCGGCTCAAT GTTATCCCAC AAGATGCCAA GCTTACGCCC TGGCCTGATG ATTTGCTGAA GCAATGGGAT ACACTTACAG AGGACGAGAA AAAGCTGTTC ATTCGTCAAG CCGACGTCTA CGCCGCTTAT CTTACCTATA CCGACCATGA GATCGGCCGG GTGATTCAAG CGGTCGAGGA TACCGGCAAG CTCGACAATA CATTGATTAT TTACATCAGC GGCGACAATG GTTCCAGTGC GGAGGGATCA CCCAATGGCA CACCTAATGA AGTCGCGCAA TTCAACAGCG TCGAGGTGCC GGTAGCAGAA CAGCTAAAGT ATTTCTACGA CGTCTGGGGT TCGGACAAAA CTTATAATCA CATGGCAGTG GGTTGGACCT GGGCTTTCGA CACGCCCTAT AAATGGACGA AACAAGTGGC ATCGCATTTC GGCGGCACAC GGCAAGGCAT GGCCATCAGT TGGCCGGGCC ACATTAAAGA TGTCGGAGGC ATCCGTAATC AGTTTCATCA CGTCATCGAT ATTGTCCCGA CCGTTTTGGA AGCAGCGAAT ATCTCGCCTC CGGTGATGGT CGATGGCATC GCCCAAAATC CGATCGAAGG CGTGAGCTTA GCCTATACTT TCGATAAAGC GAACGCCGAC ACCCCCACCA AGCATCACAC CCAATATTTC GAGATGTTCG GTAATCGCGG CATCTATCAC GATGGCTGGT ACGCCAATAC ACAACCGATC AGCCCGCCGT GGAAACTCGG CGCCACACCA AACCCAAACG TCATAAACAG CTATAAATGG GAGCTTTACG ATCTCACCAA GGATTGGACG CAGAACGAAG ATCTCGCCGC CGCCAATCCC GCCAAGCTGA AAGAGATGCA GGATCTTTTC CTGGTCGAAG CCGCCAAATA TCAGGTCTTC CCGCTCGACA ATTCACTCGC TTCACGAATG GTCGTGCCGA GGCCCAGCGT TACGGCCGGA AGAACTGAAT TCACTTATAC AGGCGAATTG ACCGGCGTCC CGATGGGCGA TGCTCCAAGT TTGATTGCGG CTTCCTATAC GATCACAGCA GAGATCGATG TCCCTGAAGG TGGCGCCGAG GGAATACTCG CGACCCAAGG AGGACGCTTC GGCGGATGGG GGTTCTATCT TGTGAAAGGC AAACCCGTCT TTACCTGGAA CCTGCTCGAT CTCAAACGCG TTCGTTGGGA AAATCCGGAA GCCCTTTTAC CAGGAAAACA TGCTCTGGTC TTCGACTTCA AATATGACGG CCTCGGTTTT GGGACGTTGG CCTTCAACAA TGTGAGCGGC CTTGGCCAGG GGGGCACGGG CACGCTCCGG GTCGATGGGA AAATCGTGGC CACACAAACG ATGGAACGAA CGATTCCGGT TATTCTTCAA TGGGATGAGA CCTTTGACAT TGGGGCCGAT ACCGGTACGC CTGTTGATGA TAATGACTAT CAAGTACCGT TCGGATTTAC TGGTAAACTC AACAAACTGA CGATCAAGAT CGATCGACCG AAATTATCGC CAGAAGACGA GAAACGTCTC ATGCAAGAAG GACAACGCAG CAATCGCATG AGCGAATAG
|
Protein sequence | MRGVTRSKVR TSLSAIFLTL TLISVVPAAA QQREGVPGSP SATTTIDGRY LPPPPPPFGG EINLNAVQSK PYWPARVVPP KGAPNVLLIM TDDVGFGAPS TFGGVIPTPA LDRIANAGLR YTQFHSTALC SPTRAALITG RNHHSVGFGQ ISEMSTGFPG YDSIITRDTA TIGRILKDNG YATSWFGKNH NTPAFQASQA GPFEQWPIGM GFEYFYGFVG GDTSQWQPNL FRQTTAIYPY IGHPDWNLTT AMADDAIAHI KMLNEVDPTK PFFVNYTPGG THAPHHPTPE WIKKISDMHL FDKGWNTLRE TIFSNQKRLN VIPQDAKLTP WPDDLLKQWD TLTEDEKKLF IRQADVYAAY LTYTDHEIGR VIQAVEDTGK LDNTLIIYIS GDNGSSAEGS PNGTPNEVAQ FNSVEVPVAE QLKYFYDVWG SDKTYNHMAV GWTWAFDTPY KWTKQVASHF GGTRQGMAIS WPGHIKDVGG IRNQFHHVID IVPTVLEAAN ISPPVMVDGI AQNPIEGVSL AYTFDKANAD TPTKHHTQYF EMFGNRGIYH DGWYANTQPI SPPWKLGATP NPNVINSYKW ELYDLTKDWT QNEDLAAANP AKLKEMQDLF LVEAAKYQVF PLDNSLASRM VVPRPSVTAG RTEFTYTGEL TGVPMGDAPS LIAASYTITA EIDVPEGGAE GILATQGGRF GGWGFYLVKG KPVFTWNLLD LKRVRWENPE ALLPGKHALV FDFKYDGLGF GTLAFNNVSG LGQGGTGTLR VDGKIVATQT MERTIPVILQ WDETFDIGAD TGTPVDDNDY QVPFGFTGKL NKLTIKIDRP KLSPEDEKRL MQEGQRSNRM SE
|
| |