Gene Mmcs_0390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0390 
Symbol 
ID4109236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp437304 
End bp438701 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content66% 
IMG OID638029515 
Productsulfatase 
Protein accessionYP_637567 
Protein GI108797370 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACAGGAC AACCACGGGT GACTCCGCAG GACCGCGCCA ACGTGCTGAT CGTCCACTGG 
CACGATCTCG GTCGCTACCT CGGCGCCTAC GGACACCCGG ACGTACAGAG CCCCCGCCTC
GACCGGTTCG CCGCCGAAAG CATCCTGTTC ACCCGCGCCC ACGCCACCGC ACCGCTGTGC
TCACCGTCGC GCGGGTCGCT GTTCACGGGC CGCTACCCGC AGAGCAACGG CCTGGTCGGA
CTGGCGCACC ACGGCTGGGA GTACCGCGCC GGCGTCCGCA CCCTACCGCA CATCTTGTCT
GAAAACGGTT GGCACACCGC ACTTTTCGGG ATGCAGCACG AGACGTCGTA TCCGCCGAAA
CTGGGGTTCG ACGAGTTCGA CGTGTCCAAC TCCTACTGCG AATACGTGGT CGAACGCGCC
ACCGGGTGGC TGCTCGACGC ACCGCAGCGC CCCTTCCTGC TCACCGCGGG ATTCTTCGAG
ACCCACCGGC CCTACCCGCG TGACCGCTAC GAACCCGCCG ACGCCACCAC CGTCGCGCTA
CCCGACTACC TTCCCGAGGA CCGGGAGGTG CGCCAGGATC TGGCCGAGTT CTACGGGTCG
ATCACCGTCG CCGACGCGGC AGTCGGCCAA CTGCTCGACA CGCTCGCGGC CACCGGACTG
GACCGCAGCA CCTGGGTGGT GTTCATGACC GACCACGGTC CGGCCCTGCC CCGGGCGAAG
TCCACGCTGT ACGACGCGGG CACCGGTATC GCGATGATCA TCCGGCCGCC GCTTGACGCC
GGCATCGCCC CCGGCGTCTA CGACGATCTG TTCAGCGGCG TCGACCTGCT ACCCACGCTG
CTCGACGTGC TCGGCGTCGA CATTCCCGGG GAGGTCGAGG GACTCTCGCA TGCCGACAAT
TTGCTGGGCG GCGCGGAGAA AACGCGGGAA GTGCGCACCG CGGTGTACAC CACGAAGACC
TATCACGATT CCTTCGACCC AATTCGGGCG ATCCGGACAA AAGAATTCAG CTATATCGAG
AATTACGCGC AACGGCCGCT GTTGGATCTG CCGTGGGACA TCGCCGAAAG CGCCCCCGGG
CGCATCGTCG GACCGCGGGC ACGCACGCCA CGGCCCGCCC GCGAACTCTA CGACCTCCGC
ACCGACCCCA CCGAGCAACA CAACCTGCTG ACGTCGGAGA ACAAGATCAA CGCCGAGGCC
GTCGCGACCG ATCTGGCGCT CCTGCTCGAC GACTGGCGGG TGAAGACCAA CGACGTCATA
CCGTCGGATT TCGCGGGTAC GCGGATATCC GACCGATACA CCGAGACATA TCTGCGAATT
CACCGGCGGG AAGTCACCAG TCGCTCGGCC ATCGCTGCGG AACGAGGCGT CAAGGGTGAG
CGCCGAACGG CGCAATGA
 
Protein sequence
MTGQPRVTPQ DRANVLIVHW HDLGRYLGAY GHPDVQSPRL DRFAAESILF TRAHATAPLC 
SPSRGSLFTG RYPQSNGLVG LAHHGWEYRA GVRTLPHILS ENGWHTALFG MQHETSYPPK
LGFDEFDVSN SYCEYVVERA TGWLLDAPQR PFLLTAGFFE THRPYPRDRY EPADATTVAL
PDYLPEDREV RQDLAEFYGS ITVADAAVGQ LLDTLAATGL DRSTWVVFMT DHGPALPRAK
STLYDAGTGI AMIIRPPLDA GIAPGVYDDL FSGVDLLPTL LDVLGVDIPG EVEGLSHADN
LLGGAEKTRE VRTAVYTTKT YHDSFDPIRA IRTKEFSYIE NYAQRPLLDL PWDIAESAPG
RIVGPRARTP RPARELYDLR TDPTEQHNLL TSENKINAEA VATDLALLLD DWRVKTNDVI
PSDFAGTRIS DRYTETYLRI HRREVTSRSA IAAERGVKGE RRTAQ