Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_1946 |
Symbol | |
ID | 6201199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | - |
Start bp | 2217698 |
End bp | 2219338 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641705934 |
Product | sulfatase |
Protein accession | YP_001833058 |
Protein GI | 182678912 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGATGA TTCGAACTTT ATGGCTTAGC CTCGTTGCGC TGGTTTCCGT CACTATGGCG GTGACTACCC CGGCGTCCGC GCAGCCGCAG AAACCTAACA TTCTCTTTAT CATGGGCGAT GACATCGGCT GGTTCAACAT CGGCGCCTAC CATCAGGGCC TCATGTATTC GACGACGCCA AATCTCGACA AGCTTGCCAC CGAAGGCATG CGTTTCACCG ATTATTACGC GGAACCGAGT TGTACTGCGG GCCGCGCCAA TTTCATCACC GGGGAACTGC CGATCCGCAC GGGGCTGACC ACGGTTGGTC AGGCGGGCGC CACGGTCGGT ATTCCAGACG AGGCCCCCAC GATCGCCACA GCGCTCAAGG CGATGGGCTA TGTCACGGGC CAATTCGGCA AGAACCATTT GGGCGATTTG AATCGCTACC TGCCGACCGT CCATGGGTTC GACGAATATT TCGGCTACCT CTATCACCTC GACGCAATGG AGGACCCGTT TTGGCATTCC TATCCTCCTG CGTTGAAGGA TCAGGTCGGA CCGCGCAACT TGATTCACAG CTTTGCCACG ACGACCGATG ACCCGACCGA ACAGCCTCGT TGGGGCAAGA TCGGCAAGCA GAGGATCGAG GATGCGGGGC CGCTACCGCC GCATCCTATA CAGGGCATCA AATACAATAT GGAAACGGTC GACGAAGACA TTCTCGACTA TTCGGTGAAG TTCATCGACA AGGCCAAGCA GGACGGCAAG CCGTTTTTCA TGTGGGTCAA TCCCACCCGT GCGCATGTTC TCTCGCACCT GTCGCCGAAA TATGCCGCGA AGCTGACCGG TGATAATGAA TGGTATCTGG AAGAAGGCGT GATGGCCCAG CTTGATGACG TCGTCGGGGG CTTGTTGGCT AAGCTTAAAG CCGAAGGGCT GGAAGATAAT ACGATCGTTG TGTTCACGAC TGACAATGGG GCCGAGAATT TTACTTGGCC AGACGGTGGG AACACGCCAT TTGCTGCGGG CAAGGGAACG ATCATGGAAG GTGGCATGCG TGTGCCAATG ATCATTCGCT GGCCGGGTCA TATTCCAGCA GGAAAGGTCG AGAATGGTCT CATGTCGGGT CTGGACTTCT TCCCGACATT CGCCGCCATA GCCGGCAATC CGAACATCAA GGAAGAGCTG CAGAAGGGCA AGCAACTCGG AGACACGACA TACAAGGTTC ATCTCGACGG TTACAATCAG TTGGATTTTC TGACCGGCAA GGGCCCATCC AATCGGAAAG AGATCTTCTA CTTTGCCGAG GGTACTCTTG GGGCGGTTCG CCTCGGGGAC TGGAAATATA GAATGATCGA CCAACCCGAC GGTTGGATTG GGGGAACGGT CCACCTCGAT ATGCCGGTCC TCAGTAATCT TCGGCTGGAT CCGTTCGAGC GCATGCAATA TCCGAAGGGC AACATGGGCT CTTACTTCTT TTTCCCGGAT TTCTATGTCC ATGAGTTCTG GCGCTTCGTC TTCCTTCAGC AAAAGGTTGG CGAATATGCT CAGACATTCA TCGATTTTCC GCCGATGCAA CGGGGTGCGA GCTTCAATCT CGAAGCAGTC AAGGCCGAAA TCGCTGAACG TGTCAGGGCG ATGAAAGGCA AGCTGGAATA G
|
Protein sequence | MEMIRTLWLS LVALVSVTMA VTTPASAQPQ KPNILFIMGD DIGWFNIGAY HQGLMYSTTP NLDKLATEGM RFTDYYAEPS CTAGRANFIT GELPIRTGLT TVGQAGATVG IPDEAPTIAT ALKAMGYVTG QFGKNHLGDL NRYLPTVHGF DEYFGYLYHL DAMEDPFWHS YPPALKDQVG PRNLIHSFAT TTDDPTEQPR WGKIGKQRIE DAGPLPPHPI QGIKYNMETV DEDILDYSVK FIDKAKQDGK PFFMWVNPTR AHVLSHLSPK YAAKLTGDNE WYLEEGVMAQ LDDVVGGLLA KLKAEGLEDN TIVVFTTDNG AENFTWPDGG NTPFAAGKGT IMEGGMRVPM IIRWPGHIPA GKVENGLMSG LDFFPTFAAI AGNPNIKEEL QKGKQLGDTT YKVHLDGYNQ LDFLTGKGPS NRKEIFYFAE GTLGAVRLGD WKYRMIDQPD GWIGGTVHLD MPVLSNLRLD PFERMQYPKG NMGSYFFFPD FYVHEFWRFV FLQQKVGEYA QTFIDFPPMQ RGASFNLEAV KAEIAERVRA MKGKLE
|
| |