Gene Mjls_0378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0378 
Symbol 
ID4876124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp421277 
End bp422674 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID640137692 
Productsulfatase 
Protein accessionYP_001068682 
Protein GI126432991 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0510065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.447114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGGAC AACCACGGGT GACTCCGCAG GACCGCGCCA ACGTGCTGAT CGTCCACTGG 
CACGATCTCG GTCGCTACCT CGGCGCCTAC GGACACCCGG ACGTACAGAG CCCCCGCCTC
GACCGGTTCG CCGCCGAAAG CATCCTGTTC ACCCGCGCCC ACGCCACCGC ACCGCTGTGC
TCACCGTCGC GCGGGTCGCT GTTCACGGGC CGCTACCCGC AGAGCAACGG CCTGGTCGGA
CTGGCGCACC ACGGCTGGGA GTACCGCGCC GGCGTCCGCA CCCTACCGCA CATCTTGTCT
GAAAACGGTT GGCACACCGC ACTTTTCGGG ATGCAGCACG AGACGTCGTA TCCGCCGAAA
CTGGGGTTCG ACGAGTTCGA CGTGTCCAAC TCCTACTGCG AATACGTGGT CGAACGCGCC
ACCGGGTGGC TGCTCGACGC ACCGCAGCGC CCCTTCCTGC TCACCGCGGG ATTCTTCGAG
ACCCACCGGC CCTACCCGCG TGACCGCTAC GAACCCGCCG ACGCCACCAC CGTCGCGCTA
CCCGACTACC TTCCCGAGGA CCGGGAGGTG CGCCAGGATC TGGCCGAGTT CTACGGGTCG
ATCACCGTCG CCGACGCGGC AGTCGGCCAA CTGCTCGACA CGCTCGCGGC CACCGGACTG
GACCGCAGCA CCTGGGTGGT GTTCATGACC GACCACGGTC CGGCCCTGCC CCGGGCGAAG
TCCACGCTGT ACGACGCGGG CACCGGTATC GCGATGATCA TCCGGCCGCC GCTTGACGCC
GGCATCGCCC CCGGCGTCTA CGACGATCTG TTCAGCGGCG TCGACCTGCT ACCCACGCTG
CTCGACGTGC TCGGCGTCGA CATTCCCGGG GAGGTCGAGG GACTCTCGCA TGCCGACAAT
TTGCTGGGCG GCGCGGAGAA AACGCGGGAA GTGCGCACCG CGGTGTACAC CACGAAGACC
TATCACGATT CCTTCGACCC AATTCGGGCG ATCCGGACAA AAGAATTCAG CTATATCGAG
AATTACGCGC AACGGCCGCT GTTGGATCTG CCGTGGGACA TCGCCGAAAG CGCCCCCGGG
CGCATCGTCG GACCGCGGGC ACGCACGCCA CGGCCCGCCC GCGAACTCTA CGACCTCCGC
ACCGACCCCA CCGAGCAACA CAACCTGCTG ACGTCGGAGA ACAAGATCAA CGCCGAGGCC
GTCGCGACCG ATCTGGCGCT CCTGCTCGAC GACTGGCGGG TGAAGACCAA CGACGTCATA
CCGTCGGATT TCGCGGGTAC GCGGATATCC GACCGATACA CCGAGACATA TCTGCGAATT
CACCGGCGGG AAGTCACCAG TCGCTCGGCC ATCGCTGCGG AACGAGGCGT CAAGGGTGAG
CGCCGAACGG CGCAATGA
 
Protein sequence
MTGQPRVTPQ DRANVLIVHW HDLGRYLGAY GHPDVQSPRL DRFAAESILF TRAHATAPLC 
SPSRGSLFTG RYPQSNGLVG LAHHGWEYRA GVRTLPHILS ENGWHTALFG MQHETSYPPK
LGFDEFDVSN SYCEYVVERA TGWLLDAPQR PFLLTAGFFE THRPYPRDRY EPADATTVAL
PDYLPEDREV RQDLAEFYGS ITVADAAVGQ LLDTLAATGL DRSTWVVFMT DHGPALPRAK
STLYDAGTGI AMIIRPPLDA GIAPGVYDDL FSGVDLLPTL LDVLGVDIPG EVEGLSHADN
LLGGAEKTRE VRTAVYTTKT YHDSFDPIRA IRTKEFSYIE NYAQRPLLDL PWDIAESAPG
RIVGPRARTP RPARELYDLR TDPTEQHNLL TSENKINAEA VATDLALLLD DWRVKTNDVI
PSDFAGTRIS DRYTETYLRI HRREVTSRSA IAAERGVKGE RRTAQ