Gene Dshi_3111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3111 
SymbolaslA 
ID5710963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3277491 
End bp3279092 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content63% 
IMG OID641269038 
Productarylsulfatase precursor 
Protein accessionYP_001534445 
Protein GI159045651 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.513855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACT TAGGAAGGCT GCGGCGCTGC GCGCTGGGGG CGGTCTTGCT GGGCATTTCG 
GCAAGTGTCG CCGCCGCCCA GGAGACGGAC AAGCCCAACA TCCTGGTTAT CTGGGGCGAC
GATGTGGGCC AGTCGAACAT CTCGGCCTAC ACGATGGGTC TGATGGGATA CGAAACGCCC
AATATCGACC GGATCGCCGA AGAAGGCATG ATCTTCACCG ACTATTATGG CGAGCAGTCC
TGCACCGCGG GCCGGTCCTC CTATATCATG GGCCAGTCAG TGTTCCGCAC GGGCCTCTCC
AAGGTCGGTC TGCCCGGTGC CGAAGAAGGC ATGCAGGTCG AAGACCCAAC CATCGCGGGC
CTGCTGAAGG CCCAAGGCTA CGCAACCGGC CAGTTCGGCA AGAACCACCT GGGCGACCGG
GATGAGCATC TGCCGACCAA CCACGGCTTC GACGAGTTCT TCGGCAACCT CTATCACCTG
AACGCCGAGG AAGAGCCCGA GAACGAGGAC TACCCGGGCG ATCTCGTGCT CGAAGACGGC
CGCACCTTCC GCGAGGCGTT CGGGCCCCGC GGCGTGATCA AGTCCTCCGC CGACGGTACG
ATCGAAGACA CCGGCCCCCT GACCAAGGAG CGGATGGAAA CCGTGGACGA CGAGACCGTC
GCCGCGGCGA TCGACTTCAT CAAGCGCCAG GAAGAGGCGG GCAACCCCTG GTTCGTCTGG
TGGTCGGGCA CCCGGATGCA CTTCCGCACC CATGTCAGCG ACGAGCGTCG CCAGATGGCC
AACGAAATCG TCGGCAAGTC GGTGGACGAA TACACCGCCG GCATGATCGA ACATGACATG
CATATCGGTC AGTTCCTCGA CCTGCTGGAC GAGCTCGGCA TCGCCGACGA GACCATCGTG
CATTACTCCA CCGACAATGG CCCGCACATG AACACATGGC CCGATGCCGC CATGACGCCG
TTCTGGGGTG AGAAGAACAC CCAGTGGGAA GGCGCATGGC GCGTGCCCTC CATGGTCCGC
TGGCCCGGCC TGATCGAACC CGGCTCCGTG TCGAACTCGA TCATGCACCA CATGGACTGG
CTGCCCACCT ACCTGGCCGC AGCCGGGCGT CCGAACATCA AGGAAGAACT TCTCGACGGT
ATAACCGTGG CCGAGGTCGG CGGCGGACGC GATTACCGCG TGCATCTGGA TGGCTATAAC
TTCCTGCCCT ATTTCGCGGG CGAAGTTGAC ACCGGCCCCC GGCAGGAGAT CTTCTACTTC
ACCGATGACG GGGATCTTGC GGCCCTGCGC TTCGGCGACT GGAAGATCAC CTTCCTGGAG
CAGAAGGAAT GGGCGACTCT GCGCGCCTGG ATGGAGCCTC TGACGCCGCT GCGGGTGCCG
CTCATCGCCA ACCTGCGCCG CGACCCCTAT GAGCGCGGGT ATCGCACGTC GAACACCTAT
TACGACTGGA TGCTCGACCG GGCCTACATG CTGGTGCCCG CGCAAGCCTA CGTCGCGGAC
TTCCTGGAAA CCTTCCAGGA GTATCCACCC CGGCAGGAAG CCGCCTCCTT CAGCCTCGAC
AAGGTGATGG AGAAGCTGAC CGCACCCAGC GGCGCGCGCT AA
 
Protein sequence
MVNLGRLRRC ALGAVLLGIS ASVAAAQETD KPNILVIWGD DVGQSNISAY TMGLMGYETP 
NIDRIAEEGM IFTDYYGEQS CTAGRSSYIM GQSVFRTGLS KVGLPGAEEG MQVEDPTIAG
LLKAQGYATG QFGKNHLGDR DEHLPTNHGF DEFFGNLYHL NAEEEPENED YPGDLVLEDG
RTFREAFGPR GVIKSSADGT IEDTGPLTKE RMETVDDETV AAAIDFIKRQ EEAGNPWFVW
WSGTRMHFRT HVSDERRQMA NEIVGKSVDE YTAGMIEHDM HIGQFLDLLD ELGIADETIV
HYSTDNGPHM NTWPDAAMTP FWGEKNTQWE GAWRVPSMVR WPGLIEPGSV SNSIMHHMDW
LPTYLAAAGR PNIKEELLDG ITVAEVGGGR DYRVHLDGYN FLPYFAGEVD TGPRQEIFYF
TDDGDLAALR FGDWKITFLE QKEWATLRAW MEPLTPLRVP LIANLRRDPY ERGYRTSNTY
YDWMLDRAYM LVPAQAYVAD FLETFQEYPP RQEAASFSLD KVMEKLTAPS GAR