Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mflv_5039 |
Symbol | |
ID | 4976350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium gilvum PYR-GCK |
Kingdom | Bacteria |
Replicon accession | NC_009338 |
Strand | - |
Start bp | 5357697 |
End bp | 5360048 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640459266 |
Product | sulfatase |
Protein accession | YP_001136293 |
Protein GI | 145225615 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0252328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.170532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGACGG AGTTCAACGG CAAGATCGCG CTCGACATCC GCGATTCCGA ACCCGACTGG GGCCCGTTCG CGGCGCCGAC CGCGCAACCC GAGGCGCCCA ACGTGCTCTA CCTCGTCTGG GACGACATCG GCATCGCGAC CTGGGACTGC TTCGGTGGGC TCGTCGACAT GCCCGCCATG AGCCGCATCG CCGAACGCGG GGTGCGGCTT TCGCAGTTCC ACACCACGGC GCTGTGCTCG CCGACCCGGG CCTCGCTGCT GACCGGACGC AACGCGACCA CCGTCGGCAT GGCCACGATC GAGGAGTTCA CCGACGGTTT CCCGAACTGC AACGGTCGGA TCCCGTTCGA CACCGCTCTG CTCTCGGAGG TGCTTTCCGA GAACGGCTAC AACACCTACT GCATCGGCAA GTGGCATCTG ACGCCGTTGG AGGAATCCAA TCTGGCGGCC ACCAAGAGGC ACTGGCCGCT GTCGAGGGGG TTCGAGAGGT TCTACGGGTT CATGGGCGGC GAAACCGACC AGTGGTATCC GGATCTGATG TACGACAACC ACCCCGTCGC GCCTCCCGCA ACCCCGGAGG AGGGCTACCA CCTCTCGAAA GACCTGGCGG ACAAGACGAT CGAATTCATC CGCGATTCCA AGGTCATCGC GCCGGACAAG CCGTGGTTCT CGTATGTGTG CCCCGGGGCG GGACATGCGC CGCACCACGT CTTCAAGGAG TGGGCGGACC GTTACGCCGG ACGTTTCGAC ATGGGCTATG AGGCCTACCG GGAGATCGTG CTGGAGAACC AGAAGCGCCT CGGCCTGGTG CCGCCGGACA CCGAATTGTC GGCCGTCAAC CCCTATCTGG ACGTCAAAGG TCCCGACGGC CAGGACTGGC CGGCGCAGGA CACCGTGCGA CCGTGGGATT CGCTGAGCGA GGACGAGAAG CGGCTGTTCG CGCGGATGGC CGAGGTGTTC GCCGGATTCC TGTCCTACAC CGACGCCCAG ATCGGCCGCG TGCTCGACTA TCTCGAAGAG TCCGGCCAGC TCGACAACAC CGTCATCGTG GTCATCTCCG ACAACGGTGC CAGCGGCGAG GGCGGCCCGA ACGGTTCGGT CAACGAGGTC AAGTTCTTCA ACGGCTACAT CGACTCCGCG GAGGAGAGCC TGAAGGTCTT CGACGAACTC GGTGGCCCGC AGACCTACAA CCATTACCCG ATCGGCTGGG CGATGGCGTT CAACACCCCG TACAAGCTGT TCAAGCGCTA CGCCTCGCAC GAGGGCGGGA TCGCCGACAC CGCAATCATC TCCTGGCCCA AGGGGATTGC CGCGCACGGT GAGATCAGGG ACAACTACGT CAACGTCGCC GACATCACGC CCACGGTGTA CGACCTGCTC GACATCACGC CGCCGGCCAC GGTGCGCGGG GTCGCGCAGA AGCCGCTGGA CGGCGTGAGT TTCAAAGTGG CGCTGGAGAA TCCGAACGCG CCCACCGGCA AGGAGACACA GTTCTACACG ATGCTGGGCA CCCGCGGCAT CTGGCACCGG GGGTGGTTCG CCAACACCGT GCACGCGGCG TCCCCGGCCG GCTGGTCTCA CTTCGACGAC GACCGGTGGG AGCTCTACCA CGTCGACGCC GACCGCAGCC AGGTTCACGA CCTGGCCGCG GAGTACCCGG AAAAACTCGA CGAGCTCAAG GCGCTGTGGT TCTCCGAGGC GCAGAAGTAC AACGGGCTGC CGCTCGGTGA TCTCGACATC TTCGAGACGA TGTCGCGGTG GCGGCCGACG CTGTCGGGGG AACGATCCGC ATACGTGTAC TACCCGGGGA CGGCCGATGT CGGCATCGGC GCGGTGGTGG AGCTGCGCGG GCGGTCCTTC GCGGTGCTGG CCGAGGTCGA GGTCGACCCG GGCGGTGCCA ATGGTGTGGT CGTGAAACAC GGTGGGGCGC ACGGCGGTTA CGTGATGTAC CTGCAGGGCG GACGGCTGCA CTTCTGCTAC AACTTCCTCG GCGAGTACGA GCAGACACTG GCCGCGCCGG ATCCGGTCCC GCCCGGCCTG CACACGCTCG GTTTCACCTA CACCGTCACC GGCACCGCCG AGGGCAGCCA CACCCCGATC GGTGACGCCG AGCTGTTCCT CGACACCGAG CGGGTGGCCA GCCTCGCAGA GATGCGTTCG CAGCCGGGCA CTTTCGGTCT GGCCGGCGCG AGCCTGAGCG TGGGCCGCAA CAACGGCTCG CCGGTGTCGG AGGCCTACCA CCCGCCGTTC GCGTTCGCCG GCGGCCGAAT CGCGCGGGTG AACATCGACA CCTCGGGCGC GCCGTACGTC GATCTGGAAC GCGACTTCGC CCGGGCGTTC GCCAGGGACT GA
|
Protein sequence | MTTEFNGKIA LDIRDSEPDW GPFAAPTAQP EAPNVLYLVW DDIGIATWDC FGGLVDMPAM SRIAERGVRL SQFHTTALCS PTRASLLTGR NATTVGMATI EEFTDGFPNC NGRIPFDTAL LSEVLSENGY NTYCIGKWHL TPLEESNLAA TKRHWPLSRG FERFYGFMGG ETDQWYPDLM YDNHPVAPPA TPEEGYHLSK DLADKTIEFI RDSKVIAPDK PWFSYVCPGA GHAPHHVFKE WADRYAGRFD MGYEAYREIV LENQKRLGLV PPDTELSAVN PYLDVKGPDG QDWPAQDTVR PWDSLSEDEK RLFARMAEVF AGFLSYTDAQ IGRVLDYLEE SGQLDNTVIV VISDNGASGE GGPNGSVNEV KFFNGYIDSA EESLKVFDEL GGPQTYNHYP IGWAMAFNTP YKLFKRYASH EGGIADTAII SWPKGIAAHG EIRDNYVNVA DITPTVYDLL DITPPATVRG VAQKPLDGVS FKVALENPNA PTGKETQFYT MLGTRGIWHR GWFANTVHAA SPAGWSHFDD DRWELYHVDA DRSQVHDLAA EYPEKLDELK ALWFSEAQKY NGLPLGDLDI FETMSRWRPT LSGERSAYVY YPGTADVGIG AVVELRGRSF AVLAEVEVDP GGANGVVVKH GGAHGGYVMY LQGGRLHFCY NFLGEYEQTL AAPDPVPPGL HTLGFTYTVT GTAEGSHTPI GDAELFLDTE RVASLAEMRS QPGTFGLAGA SLSVGRNNGS PVSEAYHPPF AFAGGRIARV NIDTSGAPYV DLERDFARAF ARD
|
| |