Gene Mflv_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_2372 
Symbol 
ID4973693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp2469510 
End bp2471324 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content71% 
IMG OID640456585 
Productsulfatase 
Protein accessionYP_001133637 
Protein GI145222959 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGC AGAGGCCAGA CGTCGTGATC GTGATGACCG ACGAGGAGCG TGCGATCCCG 
CCGTACGAAT CGGATCGGGT ACGTACCTGG CGTGACGAAA CCCTGACCGG CAGAAGGTGG
TTCGAGGAGC ACGGCGTGAG CTTCACGCGC CACTACACAG GATCACTCGC GTGCGTCCCC
AGCCGTCCGA CCATCTTCAC CGGCCACTAC CCGGACCTGC ACGGGGTCAC GCAGACCGAC
GGCATCGGCA AGTCCCACGA CGATTCGCGG CTGCGGTGGC TGCGCCGCGG CGAGGTCCCG
ACCTTGGGCA ACTGGTTCCG TGCCGCCGGC TATGACACGC ACTACGACGG CAAGTGGCAC
ATCTCCCACG CCGACCTGAC CGACCCGTCC ACCGGTCGAC CACTGGCGAC CAACGACTCC
GACGGTGTCG TCGATCCCGG TGCCGTAAAA CGCTATCTGG ACGCCGACCC GCTGGCGCCG
TACGGCTTCT CGGGCTGGGT CGGGCCGGAA CCGCACGGCG CCGGCCTGGC CAACGCCGGA
ATCCGGCGCG ATCCGCTCAT CGCCGACCGC GTGGTCGCCT GGCTGACTGC CCGTTACGCG
GCCCGCGCCG CCGGAGATTC TGCTGCGCTA CGTCCGTTCC TGCTGGTCGC CAGCTTCGTC
AATCCCCACG ACATCGTGCT GTTCCCGGCG TGGGCACGCC GAAATCCGTT GTCCCCCTCG
CCGCTCGACC CGCCGTCCGT CCCGCCCGCG CCCACCGCGG ACGAGGACCT CTCGACGAAA
CCCGCCGCAC AGATCGCGTT CCGCGAGGCC TACTACTCCG GATACGGGCC CGCCGGCAGC
ATCGAACGCA CCTACCGCCG CAACGCCCAG CGCTACCGCG ATCTGTACTA CCGCCTGCAC
GCCGAAGTGG ACGAACCGAT CGATCGCGTG CGGCGCGCCG TGACCGACGG CGCCGGAGCG
CATCCGACGG TCCTGGTCCG CACGGCCGAC CACGGCGACC TGCTGGGCGC CCACGGCGGA
CTGCACCAGA AGTGGTTCAA CCTCTACGAC GAGGCCACCC GGGTGCCGTT CGTCATCGCC
CGGACCGGAC CCGACGCCAC CACGGCCCGC ACGGTGACGG CACCGACGTC GCACGTCGAT
CTGGTGCCGA CCCTGTTGGC CGCGGCCGGG ATCGACGCCG AGTCCGTCGC CGCCACGCTG
GGGGAATCGT TCACCGAGGT CCATCCGCTT CCGGGCCGTG ACCTGATGCC GGTTGTCGAC
GGTGCGCCCG CGGACGAGGA CCGCCCGGTG TACCTGATGA CCCGGGACAA CGTCCTCGAA
GGCGACACCG GGGCCTCCGG CCTTGCCCGG GCCCTTCGCC TGACCTCGAG AGTCCCTGCG
CCGCTGCGGA TCCGGGTACC CGCACATACC GCGGCGAATT TCGAGGGCCT GGTGCTGCGG
GTCCCCGAGA CGTCCGCGGC GGGCGGCGGC GGCCACCTGT GGAAGCTGGT CCGGTCGTTC
GACGACCCCG GCACCTGGAC CGAGCCGGGG GTCCGTCAGC TGGCGGCCGA CGGCGTCGGC
GGCCCCACCT ATCGCAGCGA GCCGCTCGAC GACCAGTGGG AACTCTACGA CCTGACCGAC
GATCCGATCG AGCAGACCAA TCGGTGGCCC GACCCCGCGC TGCACGCACT CCGCGCGTAC
CTACGGACGC AACTCAAACA CGCTCGCGCA CAATCGATTC CCGAACGCAA CCAACCCTGG
CCGTATGCCC GGCGGCAACC CCCGCCGGCG CGGCGGTGGA CCCCGGGGCG GGCGCTGCGC
CGGCTGCTGG GCTGA
 
Protein sequence
MTAQRPDVVI VMTDEERAIP PYESDRVRTW RDETLTGRRW FEEHGVSFTR HYTGSLACVP 
SRPTIFTGHY PDLHGVTQTD GIGKSHDDSR LRWLRRGEVP TLGNWFRAAG YDTHYDGKWH
ISHADLTDPS TGRPLATNDS DGVVDPGAVK RYLDADPLAP YGFSGWVGPE PHGAGLANAG
IRRDPLIADR VVAWLTARYA ARAAGDSAAL RPFLLVASFV NPHDIVLFPA WARRNPLSPS
PLDPPSVPPA PTADEDLSTK PAAQIAFREA YYSGYGPAGS IERTYRRNAQ RYRDLYYRLH
AEVDEPIDRV RRAVTDGAGA HPTVLVRTAD HGDLLGAHGG LHQKWFNLYD EATRVPFVIA
RTGPDATTAR TVTAPTSHVD LVPTLLAAAG IDAESVAATL GESFTEVHPL PGRDLMPVVD
GAPADEDRPV YLMTRDNVLE GDTGASGLAR ALRLTSRVPA PLRIRVPAHT AANFEGLVLR
VPETSAAGGG GHLWKLVRSF DDPGTWTEPG VRQLAADGVG GPTYRSEPLD DQWELYDLTD
DPIEQTNRWP DPALHALRAY LRTQLKHARA QSIPERNQPW PYARRQPPPA RRWTPGRALR
RLLG