Gene Mflv_4463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4463 
Symbol 
ID4975776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4750425 
End bp4751528 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content65% 
IMG OID640458692 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001135720 
Protein GI145225042 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.100375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATCCT TCGTCCACCT GCGAAAGGGT AAGACGCCCA AGCGGATTCA TGCCGATCTG 
GACGGGCTCA AGGACGACGA GCTCGGACGC GGCGGATTCG TCGGACGCAC CGCGAACATG
TACCGCCGCA ATGACCCGAC GGCGTACCGC ACCGTCGGGC CACTGCGACC CACCGACGTC
CTGAGCTCCG AACTCAAGCC GAGCGACGCG ACCGACCCTC GCGGCGGCCC CCTACTCATG
TTCTCCAACG CGGACTGTCA GGTGCTGCTG TCCCGGCGCA CCGAGGAGAT GCCCTTCTTC
GTGCGGTACG TCGACGGCGA CCTGCTCTCG TTCGTCCACC GCGGATCCGG CTCGCTGGAA
ACAGAATTCG GGCCGCTGAC CTACCGCCAG GGCGACTGGA TCTACATCCC GAAGGCCTGC
ACATGGCGCC AAATTCCTGA TCCTGGTCCC ACCGGGACTA CCACGCTGCT GATGGTCCAG
GCCACCGAGG AGTTCCGTGT CCCACCCGCA GGCACTCTGG GGCGGCATTT CCCGTTCGAC
CCGGCGCAGG CGGTCATCCC GGAACCGCAG CCGATCGACG ACGACGGCAG GGACGAATAC
GAGGTGCGGC TGATCCATGA GGGCGGCCCC ACATCGCTGT TCTACAAGCA CCATCCGCTC
GATGTCGAAG GCTGGCGCGG CGACAACTTC CCGTTCACCT TCAACATCGA CGACTACACG
GTGATCACCT CCGACAGCGT CCACCTGCCG CCGACCGTGC ACCTGTTCAT GCAGGCGACC
GGCGTCTACA TCATGAACTT CCTGCCCAAG CCCGCGGAAT CGGTTCCCGG GACCGAGCGC
ACACCGTGGT ACCACCGCAA CGTCGACTTC GATGAGATCG CGTTCTTCCA CGACGGCTCG
CTGTACGGAA TCCCGATGCC GCCCGGCCTG GTCTCTCACG CCCCCCAGGG CGTCCATCAC
GGCGCGCCGG AGAAGGCGCG CGAGCGTGCA CGACGCAAGT TCGACGACTA CGACCGCGTG
GACTGGTCCG TCATCGCCGT CGACACCCGC AGGCGGTTGA TCCCGTCTCC GGAGATTCTC
GCCAACGATC TGGGGCAGCA CTAA
 
Protein sequence
MESFVHLRKG KTPKRIHADL DGLKDDELGR GGFVGRTANM YRRNDPTAYR TVGPLRPTDV 
LSSELKPSDA TDPRGGPLLM FSNADCQVLL SRRTEEMPFF VRYVDGDLLS FVHRGSGSLE
TEFGPLTYRQ GDWIYIPKAC TWRQIPDPGP TGTTTLLMVQ ATEEFRVPPA GTLGRHFPFD
PAQAVIPEPQ PIDDDGRDEY EVRLIHEGGP TSLFYKHHPL DVEGWRGDNF PFTFNIDDYT
VITSDSVHLP PTVHLFMQAT GVYIMNFLPK PAESVPGTER TPWYHRNVDF DEIAFFHDGS
LYGIPMPPGL VSHAPQGVHH GAPEKARERA RRKFDDYDRV DWSVIAVDTR RRLIPSPEIL
ANDLGQH