Gene Mflv_5039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5039 
Symbol 
ID4976350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5357697 
End bp5360048 
Gene Length2352 bp 
Protein Length783 aa 
Translation table11 
GC content66% 
IMG OID640459266 
Productsulfatase 
Protein accessionYP_001136293 
Protein GI145225615 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0252328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.170532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGG AGTTCAACGG CAAGATCGCG CTCGACATCC GCGATTCCGA ACCCGACTGG 
GGCCCGTTCG CGGCGCCGAC CGCGCAACCC GAGGCGCCCA ACGTGCTCTA CCTCGTCTGG
GACGACATCG GCATCGCGAC CTGGGACTGC TTCGGTGGGC TCGTCGACAT GCCCGCCATG
AGCCGCATCG CCGAACGCGG GGTGCGGCTT TCGCAGTTCC ACACCACGGC GCTGTGCTCG
CCGACCCGGG CCTCGCTGCT GACCGGACGC AACGCGACCA CCGTCGGCAT GGCCACGATC
GAGGAGTTCA CCGACGGTTT CCCGAACTGC AACGGTCGGA TCCCGTTCGA CACCGCTCTG
CTCTCGGAGG TGCTTTCCGA GAACGGCTAC AACACCTACT GCATCGGCAA GTGGCATCTG
ACGCCGTTGG AGGAATCCAA TCTGGCGGCC ACCAAGAGGC ACTGGCCGCT GTCGAGGGGG
TTCGAGAGGT TCTACGGGTT CATGGGCGGC GAAACCGACC AGTGGTATCC GGATCTGATG
TACGACAACC ACCCCGTCGC GCCTCCCGCA ACCCCGGAGG AGGGCTACCA CCTCTCGAAA
GACCTGGCGG ACAAGACGAT CGAATTCATC CGCGATTCCA AGGTCATCGC GCCGGACAAG
CCGTGGTTCT CGTATGTGTG CCCCGGGGCG GGACATGCGC CGCACCACGT CTTCAAGGAG
TGGGCGGACC GTTACGCCGG ACGTTTCGAC ATGGGCTATG AGGCCTACCG GGAGATCGTG
CTGGAGAACC AGAAGCGCCT CGGCCTGGTG CCGCCGGACA CCGAATTGTC GGCCGTCAAC
CCCTATCTGG ACGTCAAAGG TCCCGACGGC CAGGACTGGC CGGCGCAGGA CACCGTGCGA
CCGTGGGATT CGCTGAGCGA GGACGAGAAG CGGCTGTTCG CGCGGATGGC CGAGGTGTTC
GCCGGATTCC TGTCCTACAC CGACGCCCAG ATCGGCCGCG TGCTCGACTA TCTCGAAGAG
TCCGGCCAGC TCGACAACAC CGTCATCGTG GTCATCTCCG ACAACGGTGC CAGCGGCGAG
GGCGGCCCGA ACGGTTCGGT CAACGAGGTC AAGTTCTTCA ACGGCTACAT CGACTCCGCG
GAGGAGAGCC TGAAGGTCTT CGACGAACTC GGTGGCCCGC AGACCTACAA CCATTACCCG
ATCGGCTGGG CGATGGCGTT CAACACCCCG TACAAGCTGT TCAAGCGCTA CGCCTCGCAC
GAGGGCGGGA TCGCCGACAC CGCAATCATC TCCTGGCCCA AGGGGATTGC CGCGCACGGT
GAGATCAGGG ACAACTACGT CAACGTCGCC GACATCACGC CCACGGTGTA CGACCTGCTC
GACATCACGC CGCCGGCCAC GGTGCGCGGG GTCGCGCAGA AGCCGCTGGA CGGCGTGAGT
TTCAAAGTGG CGCTGGAGAA TCCGAACGCG CCCACCGGCA AGGAGACACA GTTCTACACG
ATGCTGGGCA CCCGCGGCAT CTGGCACCGG GGGTGGTTCG CCAACACCGT GCACGCGGCG
TCCCCGGCCG GCTGGTCTCA CTTCGACGAC GACCGGTGGG AGCTCTACCA CGTCGACGCC
GACCGCAGCC AGGTTCACGA CCTGGCCGCG GAGTACCCGG AAAAACTCGA CGAGCTCAAG
GCGCTGTGGT TCTCCGAGGC GCAGAAGTAC AACGGGCTGC CGCTCGGTGA TCTCGACATC
TTCGAGACGA TGTCGCGGTG GCGGCCGACG CTGTCGGGGG AACGATCCGC ATACGTGTAC
TACCCGGGGA CGGCCGATGT CGGCATCGGC GCGGTGGTGG AGCTGCGCGG GCGGTCCTTC
GCGGTGCTGG CCGAGGTCGA GGTCGACCCG GGCGGTGCCA ATGGTGTGGT CGTGAAACAC
GGTGGGGCGC ACGGCGGTTA CGTGATGTAC CTGCAGGGCG GACGGCTGCA CTTCTGCTAC
AACTTCCTCG GCGAGTACGA GCAGACACTG GCCGCGCCGG ATCCGGTCCC GCCCGGCCTG
CACACGCTCG GTTTCACCTA CACCGTCACC GGCACCGCCG AGGGCAGCCA CACCCCGATC
GGTGACGCCG AGCTGTTCCT CGACACCGAG CGGGTGGCCA GCCTCGCAGA GATGCGTTCG
CAGCCGGGCA CTTTCGGTCT GGCCGGCGCG AGCCTGAGCG TGGGCCGCAA CAACGGCTCG
CCGGTGTCGG AGGCCTACCA CCCGCCGTTC GCGTTCGCCG GCGGCCGAAT CGCGCGGGTG
AACATCGACA CCTCGGGCGC GCCGTACGTC GATCTGGAAC GCGACTTCGC CCGGGCGTTC
GCCAGGGACT GA
 
Protein sequence
MTTEFNGKIA LDIRDSEPDW GPFAAPTAQP EAPNVLYLVW DDIGIATWDC FGGLVDMPAM 
SRIAERGVRL SQFHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC NGRIPFDTAL
LSEVLSENGY NTYCIGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPDLM
YDNHPVAPPA TPEEGYHLSK DLADKTIEFI RDSKVIAPDK PWFSYVCPGA GHAPHHVFKE
WADRYAGRFD MGYEAYREIV LENQKRLGLV PPDTELSAVN PYLDVKGPDG QDWPAQDTVR
PWDSLSEDEK RLFARMAEVF AGFLSYTDAQ IGRVLDYLEE SGQLDNTVIV VISDNGASGE
GGPNGSVNEV KFFNGYIDSA EESLKVFDEL GGPQTYNHYP IGWAMAFNTP YKLFKRYASH
EGGIADTAII SWPKGIAAHG EIRDNYVNVA DITPTVYDLL DITPPATVRG VAQKPLDGVS
FKVALENPNA PTGKETQFYT MLGTRGIWHR GWFANTVHAA SPAGWSHFDD DRWELYHVDA
DRSQVHDLAA EYPEKLDELK ALWFSEAQKY NGLPLGDLDI FETMSRWRPT LSGERSAYVY
YPGTADVGIG AVVELRGRSF AVLAEVEVDP GGANGVVVKH GGAHGGYVMY LQGGRLHFCY
NFLGEYEQTL AAPDPVPPGL HTLGFTYTVT GTAEGSHTPI GDAELFLDTE RVASLAEMRS
QPGTFGLAGA SLSVGRNNGS PVSEAYHPPF AFAGGRIARV NIDTSGAPYV DLERDFARAF
ARD