Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3535 |
Symbol | |
ID | 5151559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3689431 |
End bp | 3691710 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640558388 |
Product | putative arylsulfatase |
Protein accession | YP_001239534 |
Protein GI | 148254949 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.441448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGG CAGCATTCGC CGGCGCCATC GGCAGAACCG TCGCGAGCTC CAAACCCTGG TGGCCCGCGC CGCCGCGGCC GCCGGCGGGC GCGCCGAACA TCCTTGTGGT GCTGTTCGAC GATGTCGGAT TCTCCGATTT CGGCTGCTAC GGCTCGCCGA TCCGCACCCC TGCGATCGAC GCGCTGGCCG CGCAGGGGCT GCGTTATTCC GGCTTCCACA CCACGGCGAT GTGCTCAACG ACGCGGGCGG CGCTCCTGAC AGGCCGCAAT CATCATTCGG TCGGCGTCGG CTGTCTCGCC AATTTCGACT CTGGGTATCC TGGTTATCGC GGCAAGATCG CGCGCGAGGC GGGGACCATC GCCGAGATGC TGCGTGCGCA TGCCTATCGC AACTACATGG TCGGCAAGTG GCATGTGACG CCGTTGACCG AGAGCGGCGC GACCGGGCCG TTCGACGGCT GGCCGCTCGG CCGTGGCTTC GACCGCTTCT ACGGCTTCCT CGATGCCGAG ACCGATCAAT ATGCGCCGGA GCTCGTGCTC GACAACACGC ATATCTCGCC GCCCGGCAGC TTCGCCGACG GCTATCATCT GACCGCCGAC CTCATCGACC AGGCCATCCG CTTCATCGCC GACCACACGG CCGATCGGCC TGACCTGCCG TGGCTGACCT GGCTCGCCCT CGGCGCCTGC CACGCGCCGC ACCAGGCGCC GCGCGACATC ATCGAGAGCT ATGATGCGGT CTTCGCGCAT GGCTGGGACG TCGAGCGCGC GCAACGCCTG GCGCGGCAGA AGGCGATGGG GCTGGTGCCG GAGACGACGG ACCTGCCGCC GCGCAATGAC GGCGTGAAGG CCTGGGAGAC GCATTCGGAC GAGGAGCGGC GCGTCTTCAC CCGGCTGCAA TCGGCCTTCG CCGGCATGCT CGATCATGCC GACCAGCATC TGGCGCGGTT GCTGGCGTTC CTCGACAGGA CCGGCCAACG TGACAACACG CTGGTGATCG TGATGTCCGA CAATGGCGCC AGCCAGGAGG GCGGACCGCT CGGCTTCGTC AACGCCATGG GACCGTTCAA CTTCAAGCCG GAGCCGATCG CCGAGAAACT CGCCCGCATC GACGACATCG GCGGCCCCGA CACCCATTCG AATTTCCCGC ATGGCTGGGC GATGGCGTCC AACACGCCGC TCCGGCGCTA CAAGCAGAAC ACCCATGGCG GCGGCATCCG CGATCCCTTC ATCCTGTCAT GGCCGCAACG CATCGCCGCG CAGGGCGAAT TGCGCCACCA ATTCGTCCAT GCCAGCGACC TGGTGCCGAC CCTGCTCGAC CTGATCGGCA TCGCGGCGCC GGCCACGATC GCAGGCGTGC CGCAGATGCC GCTGGAGGGC GTCAGCTTCG CGGCGTCGAT CGCGGATGCG ACGGCGCCAT CGAAGCCGGT GCCGCAATAT TTCGAGATGT TCGGGCATCG CGGCCTCTGG CACGACGGCT GGAAGGCGGT CGCCTTTCAT CCCCCGGGCA CGCCGTTCGA CAACGACAAA TGGGAGCTGT TTCATCTCGC CGAGGATTTC TCCGAGACGC ATGATCTCGC CGCCGCCGAG CCGGAGCGGC TCGCGGCGCT GGTCAAGCTG TGGTGGGAGC AGGCGGAGGC GCATCAGGTG CTGCCGCTCG ATGACCGCTT CGGCCCGCGT TTCGCCGAGA ACGCCGCACG CTTCCACGGC GCCCGCACGC GCTTCGTGTT TCACGCCGGC ATGGGCCATG TGCCGACCGA CGTCGCACCC GACGTGCGCA GCCGCGACTA CCTGATCGAG GCCCATGTCG AGATCGGACC GGAGGGCGCC GAGGGCGTCC TGATCGCGCA TGGCGATGCC ACATCGGGCT ACAGCCTCTA CGTCAAGGAC GGCCATCTCG TGCACGATCT CAACATCGGT GGCCGGCACG AGATCGTCAG CTCGACTCGC GCGGTGCCGG CGGGAGCGCA TCGCCTCGGC CTGCGGGTCG AGCGTCTCCG GCGCGAGAGC GAGCCTGCCA AGGGCGCGCG CACCGGCTTC AGCCAATACA CCTTGCTGAT CGACGGCGAA GAGGTCGGCG CGCTGACGAC GCAGCTCGCC TTTCACACGC TGATCTCATG GTCGGGACTC GACATCGGCC ATGACCGCGG CAGCCCGGTG TCAGACTACA CGGCGCCGTT CGCGTTCACC GGCCGGCTGT CGCATGTGAC CGTTACAATG CAGAACGAGC AGAGCCTGGA TGGCGACAGT GTGGGGCGGG CCGAGATGGC GCGGCAATAA
|
Protein sequence | MSEAAFAGAI GRTVASSKPW WPAPPRPPAG APNILVVLFD DVGFSDFGCY GSPIRTPAID ALAAQGLRYS GFHTTAMCST TRAALLTGRN HHSVGVGCLA NFDSGYPGYR GKIAREAGTI AEMLRAHAYR NYMVGKWHVT PLTESGATGP FDGWPLGRGF DRFYGFLDAE TDQYAPELVL DNTHISPPGS FADGYHLTAD LIDQAIRFIA DHTADRPDLP WLTWLALGAC HAPHQAPRDI IESYDAVFAH GWDVERAQRL ARQKAMGLVP ETTDLPPRND GVKAWETHSD EERRVFTRLQ SAFAGMLDHA DQHLARLLAF LDRTGQRDNT LVIVMSDNGA SQEGGPLGFV NAMGPFNFKP EPIAEKLARI DDIGGPDTHS NFPHGWAMAS NTPLRRYKQN THGGGIRDPF ILSWPQRIAA QGELRHQFVH ASDLVPTLLD LIGIAAPATI AGVPQMPLEG VSFAASIADA TAPSKPVPQY FEMFGHRGLW HDGWKAVAFH PPGTPFDNDK WELFHLAEDF SETHDLAAAE PERLAALVKL WWEQAEAHQV LPLDDRFGPR FAENAARFHG ARTRFVFHAG MGHVPTDVAP DVRSRDYLIE AHVEIGPEGA EGVLIAHGDA TSGYSLYVKD GHLVHDLNIG GRHEIVSSTR AVPAGAHRLG LRVERLRRES EPAKGARTGF SQYTLLIDGE EVGALTTQLA FHTLISWSGL DIGHDRGSPV SDYTAPFAFT GRLSHVTVTM QNEQSLDGDS VGRAEMARQ
|
| |