Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mflv_2372 |
Symbol | |
ID | 4973693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium gilvum PYR-GCK |
Kingdom | Bacteria |
Replicon accession | NC_009338 |
Strand | - |
Start bp | 2469510 |
End bp | 2471324 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640456585 |
Product | sulfatase |
Protein accession | YP_001133637 |
Protein GI | 145222959 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCGC AGAGGCCAGA CGTCGTGATC GTGATGACCG ACGAGGAGCG TGCGATCCCG CCGTACGAAT CGGATCGGGT ACGTACCTGG CGTGACGAAA CCCTGACCGG CAGAAGGTGG TTCGAGGAGC ACGGCGTGAG CTTCACGCGC CACTACACAG GATCACTCGC GTGCGTCCCC AGCCGTCCGA CCATCTTCAC CGGCCACTAC CCGGACCTGC ACGGGGTCAC GCAGACCGAC GGCATCGGCA AGTCCCACGA CGATTCGCGG CTGCGGTGGC TGCGCCGCGG CGAGGTCCCG ACCTTGGGCA ACTGGTTCCG TGCCGCCGGC TATGACACGC ACTACGACGG CAAGTGGCAC ATCTCCCACG CCGACCTGAC CGACCCGTCC ACCGGTCGAC CACTGGCGAC CAACGACTCC GACGGTGTCG TCGATCCCGG TGCCGTAAAA CGCTATCTGG ACGCCGACCC GCTGGCGCCG TACGGCTTCT CGGGCTGGGT CGGGCCGGAA CCGCACGGCG CCGGCCTGGC CAACGCCGGA ATCCGGCGCG ATCCGCTCAT CGCCGACCGC GTGGTCGCCT GGCTGACTGC CCGTTACGCG GCCCGCGCCG CCGGAGATTC TGCTGCGCTA CGTCCGTTCC TGCTGGTCGC CAGCTTCGTC AATCCCCACG ACATCGTGCT GTTCCCGGCG TGGGCACGCC GAAATCCGTT GTCCCCCTCG CCGCTCGACC CGCCGTCCGT CCCGCCCGCG CCCACCGCGG ACGAGGACCT CTCGACGAAA CCCGCCGCAC AGATCGCGTT CCGCGAGGCC TACTACTCCG GATACGGGCC CGCCGGCAGC ATCGAACGCA CCTACCGCCG CAACGCCCAG CGCTACCGCG ATCTGTACTA CCGCCTGCAC GCCGAAGTGG ACGAACCGAT CGATCGCGTG CGGCGCGCCG TGACCGACGG CGCCGGAGCG CATCCGACGG TCCTGGTCCG CACGGCCGAC CACGGCGACC TGCTGGGCGC CCACGGCGGA CTGCACCAGA AGTGGTTCAA CCTCTACGAC GAGGCCACCC GGGTGCCGTT CGTCATCGCC CGGACCGGAC CCGACGCCAC CACGGCCCGC ACGGTGACGG CACCGACGTC GCACGTCGAT CTGGTGCCGA CCCTGTTGGC CGCGGCCGGG ATCGACGCCG AGTCCGTCGC CGCCACGCTG GGGGAATCGT TCACCGAGGT CCATCCGCTT CCGGGCCGTG ACCTGATGCC GGTTGTCGAC GGTGCGCCCG CGGACGAGGA CCGCCCGGTG TACCTGATGA CCCGGGACAA CGTCCTCGAA GGCGACACCG GGGCCTCCGG CCTTGCCCGG GCCCTTCGCC TGACCTCGAG AGTCCCTGCG CCGCTGCGGA TCCGGGTACC CGCACATACC GCGGCGAATT TCGAGGGCCT GGTGCTGCGG GTCCCCGAGA CGTCCGCGGC GGGCGGCGGC GGCCACCTGT GGAAGCTGGT CCGGTCGTTC GACGACCCCG GCACCTGGAC CGAGCCGGGG GTCCGTCAGC TGGCGGCCGA CGGCGTCGGC GGCCCCACCT ATCGCAGCGA GCCGCTCGAC GACCAGTGGG AACTCTACGA CCTGACCGAC GATCCGATCG AGCAGACCAA TCGGTGGCCC GACCCCGCGC TGCACGCACT CCGCGCGTAC CTACGGACGC AACTCAAACA CGCTCGCGCA CAATCGATTC CCGAACGCAA CCAACCCTGG CCGTATGCCC GGCGGCAACC CCCGCCGGCG CGGCGGTGGA CCCCGGGGCG GGCGCTGCGC CGGCTGCTGG GCTGA
|
Protein sequence | MTAQRPDVVI VMTDEERAIP PYESDRVRTW RDETLTGRRW FEEHGVSFTR HYTGSLACVP SRPTIFTGHY PDLHGVTQTD GIGKSHDDSR LRWLRRGEVP TLGNWFRAAG YDTHYDGKWH ISHADLTDPS TGRPLATNDS DGVVDPGAVK RYLDADPLAP YGFSGWVGPE PHGAGLANAG IRRDPLIADR VVAWLTARYA ARAAGDSAAL RPFLLVASFV NPHDIVLFPA WARRNPLSPS PLDPPSVPPA PTADEDLSTK PAAQIAFREA YYSGYGPAGS IERTYRRNAQ RYRDLYYRLH AEVDEPIDRV RRAVTDGAGA HPTVLVRTAD HGDLLGAHGG LHQKWFNLYD EATRVPFVIA RTGPDATTAR TVTAPTSHVD LVPTLLAAAG IDAESVAATL GESFTEVHPL PGRDLMPVVD GAPADEDRPV YLMTRDNVLE GDTGASGLAR ALRLTSRVPA PLRIRVPAHT AANFEGLVLR VPETSAAGGG GHLWKLVRSF DDPGTWTEPG VRQLAADGVG GPTYRSEPLD DQWELYDLTD DPIEQTNRWP DPALHALRAY LRTQLKHARA QSIPERNQPW PYARRQPPPA RRWTPGRALR RLLG
|
| |