Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mflv_5494 |
Symbol | |
ID | 4976885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium gilvum PYR-GCK |
Kingdom | Bacteria |
Replicon accession | NC_009339 |
Strand | - |
Start bp | 232563 |
End bp | 234905 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640459720 |
Product | sulfatase |
Protein accession | YP_001136743 |
Protein GI | 145226089 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.309233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.572846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGTCTG AGCGGCTGGA TGGTCAGATT GTGGCGCCGG TGTTGCCCGA CGGCGGGGTG TTGCCATTCC CACCGACGCC GTCGGGCAGT ATCGCCGGCC GCACGCTGCA GGAATCGACG TACCAGCCGC GCGCGATCCC CAAGCATCTG CATGAGGATT CGCCGAACAT CGTGATCGTG TTGATCGACG ACGCCGGGCC CGGTTTGCCG TCGACGTTCG GCGGTGAGGT CACCACCGCG ACACTGGACC GGATCTGCGG CGAAGGCGTG TCCTACAACC GCTTTCACAC CACGGCGATG TGTTCGCCGA CCCGGGCGTC GCTGCTGACG GGCCGCAATC ATCACGAGAT CGGCAACGGT CAGATCGCGG AGCTGGCCAA CGACTGGGAC GGCTACGCCG GCAAGATCCC GCGCTCCAGT GCGACCGTGG CCGAGGTGCT CAAGCAGTAC GGCTACGCGA CGTCGGCGTT CGGGAAGTGG CACAACACCC CGGCCGAGGA AACGACCGCG GCGGGTCCGT TCGAGAACTG GCCCACCGGG CTGGGTTTCG AGTACTTCTA CGGCTTCCTC GCCGGTGAGG CCTCACAGTA CGAGCCGAAC CTGGTGCGCA ACACCACCGT GGTCGCCCCG CCGAAGACTC CGGAGGAGGG CTATCACCTG TCGGAGGATC TGGCCGATGA CGCCATCTCC TGGCTGCGCC GGCACAAGGC GTTCAACGCC GATAAGCCGT TCATGATGTA TTGGGCCAGC GGCTGCCTGC ACGGCCCGCA CCACATCATG AAGGACTGGG CCGACCGCTA CGCCGGCAAG TTCGATGACG GTTGGGATGC CTACCGGCAG CGGGTGTTCG ACCGGGCCAA GGACAAGGGC TGGATCCCGC AGGACGCAGT ACTCACCGAG CGTGATCCGC AGTTGCCGGC GTGGGAGGAT ATCCCCGAGG ATGAGAAGCC GTTCCAGCGG CGCTTGATGG AGGTCGCCGC CGGTTACGCC GAGCACGTGG ATGTGCAGGT GGGGCGCCTC GTCGACGAGC TCGACGCGCT GGGGTACGGC GAGAACACGT TGTTCCTCTA CATCTGGGGT GACAACGGAT CCTCCGGTGA GGGCCAGAAC GGGACCATCT CGGAGCTGTT GGCGCAGAAC GGAATTCCGA CCACCGTGCG CCAGCACATC GACGCTCTGG ATGAGCTGGG CGGGCTGGAT GTGCTGGGTT CCCCGTTGGT GGACAACCAG TACCACGCCG GGTGGGCGTG GGCCGGGTCC ACCCCGTACA AGGGCATGAA GCTGATGGCC TCGCACCTGG GCGGCACCCG CAACCCGTTG GCGGTGCGCT GGCCGGCCAA GGTCGCCGCC GACCCCACGC CGCGCACCCA GTTCCTGCAC TGCAACGATG TCGTGCCGAC CATCTACCAG GTGGTCGGGA TCGAGGCGCC GCGCACGGTG TTCGGGGAGA CCCAGATTCC GCTGGCCGGG GTGAGTTTCG CGCAGACTCT GGTTGACCGG AATGCCGAAG GCGGCAAGAA GACCCAGTAC TTCGAGATCA TGGGCAGCCG CGGGATCTAC CACGACGGCT GGTTCGCCGG CGCGGTCGGG CCGCGTCTGC CGTGGATACC GGGCCTGCCG CCCGGGATGG CCACCTGGAC CCCGGACCAG GACACCTGGG AGCTATACCA CCTGGACGAG GACTGGACCC AGGCCAACGA TCTCGCCGCG CAGATGCCCG ACAAACTCGC GCAGATGCGC GAGACCTTCG CTATCGAGGC CGCCAAGAAC GCGGTGCTGC CGATCGGCGG CGGGCTGTGG GTGCCGGTTT ATCACCCCGA GCTGCGGATC GCACCGCCAT TCACCGAGTG GGAGTTCTCC GGCGACATGA TCCGGATGCC GGAGTTCTGC GCACCGAGCC TCGGGAACAA GAACAACACG GTGACCATCG ACGCCGACAT CCCCGCCAAT GCCCACGGCG TCCTCTACGC CCTCGGCTCC GGCGCCGGCG GGTTGACCGT CTATATGGAC GAGGGCGACC TCTGCTACGA ATACAACCTG TTCATCCTGT CGCGCACCAA GATCCGCACC ACGGAGAAGA TCCCTCCGGG ACGTGTCACA CTCACCGTGG CCACCCGGTA CGCCGATCCA CGCCCGGCCG GACCGCTCGA CATCACGATC GCCCGCAACG GTGAATCGAT CGCCACCGGC CAGGTGCCGA TGAGTGCGCC GCTGCTGTTC ACCGCCAACG ATTGCCTCGA CATCGGCACC TGCCTCGGGT CACCGGTGTC GATGGACTAC CGGGAACGCG CACCGTTCCC GTTCGAGGGC CGCATCCACA CCGTCCACGT CGCCTACACC TAA
|
Protein sequence | MLSERLDGQI VAPVLPDGGV LPFPPTPSGS IAGRTLQEST YQPRAIPKHL HEDSPNIVIV LIDDAGPGLP STFGGEVTTA TLDRICGEGV SYNRFHTTAM CSPTRASLLT GRNHHEIGNG QIAELANDWD GYAGKIPRSS ATVAEVLKQY GYATSAFGKW HNTPAEETTA AGPFENWPTG LGFEYFYGFL AGEASQYEPN LVRNTTVVAP PKTPEEGYHL SEDLADDAIS WLRRHKAFNA DKPFMMYWAS GCLHGPHHIM KDWADRYAGK FDDGWDAYRQ RVFDRAKDKG WIPQDAVLTE RDPQLPAWED IPEDEKPFQR RLMEVAAGYA EHVDVQVGRL VDELDALGYG ENTLFLYIWG DNGSSGEGQN GTISELLAQN GIPTTVRQHI DALDELGGLD VLGSPLVDNQ YHAGWAWAGS TPYKGMKLMA SHLGGTRNPL AVRWPAKVAA DPTPRTQFLH CNDVVPTIYQ VVGIEAPRTV FGETQIPLAG VSFAQTLVDR NAEGGKKTQY FEIMGSRGIY HDGWFAGAVG PRLPWIPGLP PGMATWTPDQ DTWELYHLDE DWTQANDLAA QMPDKLAQMR ETFAIEAAKN AVLPIGGGLW VPVYHPELRI APPFTEWEFS GDMIRMPEFC APSLGNKNNT VTIDADIPAN AHGVLYALGS GAGGLTVYMD EGDLCYEYNL FILSRTKIRT TEKIPPGRVT LTVATRYADP RPAGPLDITI ARNGESIATG QVPMSAPLLF TANDCLDIGT CLGSPVSMDY RERAPFPFEG RIHTVHVAYT
|
| |