Gene Bind_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0555 
Symbol 
ID6198516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp614747 
End bp616207 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID641704546 
Productsulfatase 
Protein accessionYP_001831696 
Protein GI182677550 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGTC TCTTTTCTTT CAACCGCCGC AATCTGCTCA TCGGAACGGC GGCAGCGACT 
GTTACTTCTT TGCCCCAGAT CGCACGGCCT GCAACGGGTG ATAGAGCGCC GAATATTATC
TTCATCCTTG CCGATGATCT CGGCTATGCG GATGTTTCCA TTTATGGACG GCCCGATCTC
TCCACACCTA ATATCGACGG CATCGGGCTC AAGGGAGCAC GTTTGCTTCA GGCTTACGCA
AATTCCGCTG TCTGCTCGGC GACACGGACG GCTTTGCTCA CCGGCCGCTA TCAATATCGC
GAGCGGGTGG GCCTTGAGGA GCCGATCGCC GGCAATATCC ATGTCGGCCT GCCACCGCAG
CGCCCGACCT TGCCCTCGCT TTTGAAAAAA GCAGGTTACA CCACGACTCT CATCGGTAAA
TGGCATCTCG GCACATTGCC GGACTTCGGC CCACTGCAAA GCGGCTATGA TCATTTCTAC
GGCTTTCGCG GCGGTGCCGT CGATTATTAT TCACACAAAG GCACCGATGA TCAGGACGAT
CTGTGGGATC AAGATACAAA GGTTCACCAA ACCGGTTATT TGACGGAATT GCTCGGCGAC
CGCGCCATCG AAACCATCAA CGCTTCAGCC AAAACCGGCC AGCCTTTCTT CATCAGCCTG
CATTTCAATG CGCCCCATTG GCCCTGGGAA GCGCCCGGGG ATGAAGCGGA ATCCGCGCGT
GTGGCAGGGA CGCGCCTGTT CGACTTCGAT GGCGGATCAC AAGCGACCTA TCGCGGCATG
ATCGCAGCGA TGGATCTCCA AATCGGGCGC ATTGTGCAGG CTCTGCAAGC CAATGGGATC
AGCGAGAATA CGATTGTCAT CTTCACAAGC GATAATGGCG GTGAGCGTTT TGCCGATACA
TGGCCGTTTA CCGGCCGTAA GACGGAACTA CTCGAAGGCG GATTGCGCAT CCCTGCCCTC
GTCTCTTGGC CGGCGCGGAT CAAAGCAGAT CAAACCATCG ATCAGGTCAG CATCAGCATG
GATTGGCTGC CGACTCTCTT AGCGGCCGCT GGGAGCGAAC CCGATCCAAA TTTTCCCTCC
GACGGGATTA ATCTGCTGCC TTTCCTGAGC GAAAGCAAAG CCGCTATCCC TCGCAAATTG
TTCTGGCGCT ACAAAGCCAA TGCCCAGCGC GCAGTGCGCG ATGGCGATTA CAAATATCTC
AAAATCCGGG ACAATGATTT TCTCTTCAAC GTGGTCGATG ATCCGCTGGA ACGCGTCAAT
CTGAAAGAGC GCCACAAAGA TATTTACAAT CGCCTTCTCG CCGAATGGCT CGAGTGGAAC
AGCACTATGC TACCCGAAAT CACTGAGAGC TTTACGCACG GCTTCACGGG TCACGAACTT
GCCGATCACT ATGGCGTGAC CGCACCAACC ACAGAACCTG ACAATCCCGC GCCTCTTCGG
GCGATGCGCC GCGATGATTA A
 
Protein sequence
MPGLFSFNRR NLLIGTAAAT VTSLPQIARP ATGDRAPNII FILADDLGYA DVSIYGRPDL 
STPNIDGIGL KGARLLQAYA NSAVCSATRT ALLTGRYQYR ERVGLEEPIA GNIHVGLPPQ
RPTLPSLLKK AGYTTTLIGK WHLGTLPDFG PLQSGYDHFY GFRGGAVDYY SHKGTDDQDD
LWDQDTKVHQ TGYLTELLGD RAIETINASA KTGQPFFISL HFNAPHWPWE APGDEAESAR
VAGTRLFDFD GGSQATYRGM IAAMDLQIGR IVQALQANGI SENTIVIFTS DNGGERFADT
WPFTGRKTEL LEGGLRIPAL VSWPARIKAD QTIDQVSISM DWLPTLLAAA GSEPDPNFPS
DGINLLPFLS ESKAAIPRKL FWRYKANAQR AVRDGDYKYL KIRDNDFLFN VVDDPLERVN
LKERHKDIYN RLLAEWLEWN STMLPEITES FTHGFTGHEL ADHYGVTAPT TEPDNPAPLR
AMRRDD