Gene Mflv_5494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5494 
Symbol 
ID4976885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009339 
Strand
Start bp232563 
End bp234905 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content66% 
IMG OID640459720 
Productsulfatase 
Protein accessionYP_001136743 
Protein GI145226089 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.309233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.572846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGTCTG AGCGGCTGGA TGGTCAGATT GTGGCGCCGG TGTTGCCCGA CGGCGGGGTG 
TTGCCATTCC CACCGACGCC GTCGGGCAGT ATCGCCGGCC GCACGCTGCA GGAATCGACG
TACCAGCCGC GCGCGATCCC CAAGCATCTG CATGAGGATT CGCCGAACAT CGTGATCGTG
TTGATCGACG ACGCCGGGCC CGGTTTGCCG TCGACGTTCG GCGGTGAGGT CACCACCGCG
ACACTGGACC GGATCTGCGG CGAAGGCGTG TCCTACAACC GCTTTCACAC CACGGCGATG
TGTTCGCCGA CCCGGGCGTC GCTGCTGACG GGCCGCAATC ATCACGAGAT CGGCAACGGT
CAGATCGCGG AGCTGGCCAA CGACTGGGAC GGCTACGCCG GCAAGATCCC GCGCTCCAGT
GCGACCGTGG CCGAGGTGCT CAAGCAGTAC GGCTACGCGA CGTCGGCGTT CGGGAAGTGG
CACAACACCC CGGCCGAGGA AACGACCGCG GCGGGTCCGT TCGAGAACTG GCCCACCGGG
CTGGGTTTCG AGTACTTCTA CGGCTTCCTC GCCGGTGAGG CCTCACAGTA CGAGCCGAAC
CTGGTGCGCA ACACCACCGT GGTCGCCCCG CCGAAGACTC CGGAGGAGGG CTATCACCTG
TCGGAGGATC TGGCCGATGA CGCCATCTCC TGGCTGCGCC GGCACAAGGC GTTCAACGCC
GATAAGCCGT TCATGATGTA TTGGGCCAGC GGCTGCCTGC ACGGCCCGCA CCACATCATG
AAGGACTGGG CCGACCGCTA CGCCGGCAAG TTCGATGACG GTTGGGATGC CTACCGGCAG
CGGGTGTTCG ACCGGGCCAA GGACAAGGGC TGGATCCCGC AGGACGCAGT ACTCACCGAG
CGTGATCCGC AGTTGCCGGC GTGGGAGGAT ATCCCCGAGG ATGAGAAGCC GTTCCAGCGG
CGCTTGATGG AGGTCGCCGC CGGTTACGCC GAGCACGTGG ATGTGCAGGT GGGGCGCCTC
GTCGACGAGC TCGACGCGCT GGGGTACGGC GAGAACACGT TGTTCCTCTA CATCTGGGGT
GACAACGGAT CCTCCGGTGA GGGCCAGAAC GGGACCATCT CGGAGCTGTT GGCGCAGAAC
GGAATTCCGA CCACCGTGCG CCAGCACATC GACGCTCTGG ATGAGCTGGG CGGGCTGGAT
GTGCTGGGTT CCCCGTTGGT GGACAACCAG TACCACGCCG GGTGGGCGTG GGCCGGGTCC
ACCCCGTACA AGGGCATGAA GCTGATGGCC TCGCACCTGG GCGGCACCCG CAACCCGTTG
GCGGTGCGCT GGCCGGCCAA GGTCGCCGCC GACCCCACGC CGCGCACCCA GTTCCTGCAC
TGCAACGATG TCGTGCCGAC CATCTACCAG GTGGTCGGGA TCGAGGCGCC GCGCACGGTG
TTCGGGGAGA CCCAGATTCC GCTGGCCGGG GTGAGTTTCG CGCAGACTCT GGTTGACCGG
AATGCCGAAG GCGGCAAGAA GACCCAGTAC TTCGAGATCA TGGGCAGCCG CGGGATCTAC
CACGACGGCT GGTTCGCCGG CGCGGTCGGG CCGCGTCTGC CGTGGATACC GGGCCTGCCG
CCCGGGATGG CCACCTGGAC CCCGGACCAG GACACCTGGG AGCTATACCA CCTGGACGAG
GACTGGACCC AGGCCAACGA TCTCGCCGCG CAGATGCCCG ACAAACTCGC GCAGATGCGC
GAGACCTTCG CTATCGAGGC CGCCAAGAAC GCGGTGCTGC CGATCGGCGG CGGGCTGTGG
GTGCCGGTTT ATCACCCCGA GCTGCGGATC GCACCGCCAT TCACCGAGTG GGAGTTCTCC
GGCGACATGA TCCGGATGCC GGAGTTCTGC GCACCGAGCC TCGGGAACAA GAACAACACG
GTGACCATCG ACGCCGACAT CCCCGCCAAT GCCCACGGCG TCCTCTACGC CCTCGGCTCC
GGCGCCGGCG GGTTGACCGT CTATATGGAC GAGGGCGACC TCTGCTACGA ATACAACCTG
TTCATCCTGT CGCGCACCAA GATCCGCACC ACGGAGAAGA TCCCTCCGGG ACGTGTCACA
CTCACCGTGG CCACCCGGTA CGCCGATCCA CGCCCGGCCG GACCGCTCGA CATCACGATC
GCCCGCAACG GTGAATCGAT CGCCACCGGC CAGGTGCCGA TGAGTGCGCC GCTGCTGTTC
ACCGCCAACG ATTGCCTCGA CATCGGCACC TGCCTCGGGT CACCGGTGTC GATGGACTAC
CGGGAACGCG CACCGTTCCC GTTCGAGGGC CGCATCCACA CCGTCCACGT CGCCTACACC
TAA
 
Protein sequence
MLSERLDGQI VAPVLPDGGV LPFPPTPSGS IAGRTLQEST YQPRAIPKHL HEDSPNIVIV 
LIDDAGPGLP STFGGEVTTA TLDRICGEGV SYNRFHTTAM CSPTRASLLT GRNHHEIGNG
QIAELANDWD GYAGKIPRSS ATVAEVLKQY GYATSAFGKW HNTPAEETTA AGPFENWPTG
LGFEYFYGFL AGEASQYEPN LVRNTTVVAP PKTPEEGYHL SEDLADDAIS WLRRHKAFNA
DKPFMMYWAS GCLHGPHHIM KDWADRYAGK FDDGWDAYRQ RVFDRAKDKG WIPQDAVLTE
RDPQLPAWED IPEDEKPFQR RLMEVAAGYA EHVDVQVGRL VDELDALGYG ENTLFLYIWG
DNGSSGEGQN GTISELLAQN GIPTTVRQHI DALDELGGLD VLGSPLVDNQ YHAGWAWAGS
TPYKGMKLMA SHLGGTRNPL AVRWPAKVAA DPTPRTQFLH CNDVVPTIYQ VVGIEAPRTV
FGETQIPLAG VSFAQTLVDR NAEGGKKTQY FEIMGSRGIY HDGWFAGAVG PRLPWIPGLP
PGMATWTPDQ DTWELYHLDE DWTQANDLAA QMPDKLAQMR ETFAIEAAKN AVLPIGGGLW
VPVYHPELRI APPFTEWEFS GDMIRMPEFC APSLGNKNNT VTIDADIPAN AHGVLYALGS
GAGGLTVYMD EGDLCYEYNL FILSRTKIRT TEKIPPGRVT LTVATRYADP RPAGPLDITI
ARNGESIATG QVPMSAPLLF TANDCLDIGT CLGSPVSMDY RERAPFPFEG RIHTVHVAYT