Gene Bind_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1946 
Symbol 
ID6201199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2217698 
End bp2219338 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content55% 
IMG OID641705934 
Productsulfatase 
Protein accessionYP_001833058 
Protein GI182678912 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATGA TTCGAACTTT ATGGCTTAGC CTCGTTGCGC TGGTTTCCGT CACTATGGCG 
GTGACTACCC CGGCGTCCGC GCAGCCGCAG AAACCTAACA TTCTCTTTAT CATGGGCGAT
GACATCGGCT GGTTCAACAT CGGCGCCTAC CATCAGGGCC TCATGTATTC GACGACGCCA
AATCTCGACA AGCTTGCCAC CGAAGGCATG CGTTTCACCG ATTATTACGC GGAACCGAGT
TGTACTGCGG GCCGCGCCAA TTTCATCACC GGGGAACTGC CGATCCGCAC GGGGCTGACC
ACGGTTGGTC AGGCGGGCGC CACGGTCGGT ATTCCAGACG AGGCCCCCAC GATCGCCACA
GCGCTCAAGG CGATGGGCTA TGTCACGGGC CAATTCGGCA AGAACCATTT GGGCGATTTG
AATCGCTACC TGCCGACCGT CCATGGGTTC GACGAATATT TCGGCTACCT CTATCACCTC
GACGCAATGG AGGACCCGTT TTGGCATTCC TATCCTCCTG CGTTGAAGGA TCAGGTCGGA
CCGCGCAACT TGATTCACAG CTTTGCCACG ACGACCGATG ACCCGACCGA ACAGCCTCGT
TGGGGCAAGA TCGGCAAGCA GAGGATCGAG GATGCGGGGC CGCTACCGCC GCATCCTATA
CAGGGCATCA AATACAATAT GGAAACGGTC GACGAAGACA TTCTCGACTA TTCGGTGAAG
TTCATCGACA AGGCCAAGCA GGACGGCAAG CCGTTTTTCA TGTGGGTCAA TCCCACCCGT
GCGCATGTTC TCTCGCACCT GTCGCCGAAA TATGCCGCGA AGCTGACCGG TGATAATGAA
TGGTATCTGG AAGAAGGCGT GATGGCCCAG CTTGATGACG TCGTCGGGGG CTTGTTGGCT
AAGCTTAAAG CCGAAGGGCT GGAAGATAAT ACGATCGTTG TGTTCACGAC TGACAATGGG
GCCGAGAATT TTACTTGGCC AGACGGTGGG AACACGCCAT TTGCTGCGGG CAAGGGAACG
ATCATGGAAG GTGGCATGCG TGTGCCAATG ATCATTCGCT GGCCGGGTCA TATTCCAGCA
GGAAAGGTCG AGAATGGTCT CATGTCGGGT CTGGACTTCT TCCCGACATT CGCCGCCATA
GCCGGCAATC CGAACATCAA GGAAGAGCTG CAGAAGGGCA AGCAACTCGG AGACACGACA
TACAAGGTTC ATCTCGACGG TTACAATCAG TTGGATTTTC TGACCGGCAA GGGCCCATCC
AATCGGAAAG AGATCTTCTA CTTTGCCGAG GGTACTCTTG GGGCGGTTCG CCTCGGGGAC
TGGAAATATA GAATGATCGA CCAACCCGAC GGTTGGATTG GGGGAACGGT CCACCTCGAT
ATGCCGGTCC TCAGTAATCT TCGGCTGGAT CCGTTCGAGC GCATGCAATA TCCGAAGGGC
AACATGGGCT CTTACTTCTT TTTCCCGGAT TTCTATGTCC ATGAGTTCTG GCGCTTCGTC
TTCCTTCAGC AAAAGGTTGG CGAATATGCT CAGACATTCA TCGATTTTCC GCCGATGCAA
CGGGGTGCGA GCTTCAATCT CGAAGCAGTC AAGGCCGAAA TCGCTGAACG TGTCAGGGCG
ATGAAAGGCA AGCTGGAATA G
 
Protein sequence
MEMIRTLWLS LVALVSVTMA VTTPASAQPQ KPNILFIMGD DIGWFNIGAY HQGLMYSTTP 
NLDKLATEGM RFTDYYAEPS CTAGRANFIT GELPIRTGLT TVGQAGATVG IPDEAPTIAT
ALKAMGYVTG QFGKNHLGDL NRYLPTVHGF DEYFGYLYHL DAMEDPFWHS YPPALKDQVG
PRNLIHSFAT TTDDPTEQPR WGKIGKQRIE DAGPLPPHPI QGIKYNMETV DEDILDYSVK
FIDKAKQDGK PFFMWVNPTR AHVLSHLSPK YAAKLTGDNE WYLEEGVMAQ LDDVVGGLLA
KLKAEGLEDN TIVVFTTDNG AENFTWPDGG NTPFAAGKGT IMEGGMRVPM IIRWPGHIPA
GKVENGLMSG LDFFPTFAAI AGNPNIKEEL QKGKQLGDTT YKVHLDGYNQ LDFLTGKGPS
NRKEIFYFAE GTLGAVRLGD WKYRMIDQPD GWIGGTVHLD MPVLSNLRLD PFERMQYPKG
NMGSYFFFPD FYVHEFWRFV FLQQKVGEYA QTFIDFPPMQ RGASFNLEAV KAEIAERVRA
MKGKLE