Gene Mflv_4566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4566 
Symbol 
ID4975878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4861005 
End bp4862165 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content64% 
IMG OID640458794 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001135822 
Protein GI145225144 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0832436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.12498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTATGA CTGTGCAGCC CCCCGCCGAC GAGGTGCGTC TGATCGAGGC CGACGCCGCA 
CCGACCCGCT TCGCCCGTGG ATGGCACTGC CTGGGCCTGA TCCGCGACTT CGGTGACGGA
ACACCCCACC AGGTCAACGC GTTCGGGCAG AAGCTCGTGG TCTTCCGTGC CGAGGACGGC
TCGATCAACG TGCTGGACAG CTACTGCCGG CACATGGGCG GAGACCTGTC GCAGGGCACG
GTCAAGGGCA ACGAGATCGC GTGCCCGTTC CACGACTGGC GCTGGGGCGG TGACGGCCGG
TGCAAACAGG TCCCGTACGC CAAGCGCGTC CCTCGCCTGG CCCGCACCCA GACCTGGCCG
ACGCTGGAGC AGGACGGCAT GCTGTTCGTC TGGAACGACC CGCAGCGCAA ACCGCCGCCG
GACGACGTCA CCATCCCGCG GATCGAAGGC GCCACCAGCG ACGAGTGGAC CGACTGGCAC
TGGTACACCA CCGTGGTCCA CACCAACTGC CGCGAGATCA TCGACAACGT CGTCGACATG
GCGCACTTCT TCTACATCCA CGGGTCGCTG CCCACCCAGT TCAAGAACAT CTTCGAAGGT
CACACCGCGA CGCAGTTCAT GAGCAGCGGC GGCCGTCCCG ACCTCGGCCA GGCCGAAGGC
GGTGTCAAAC TCCTGGGCAC GACGTCACTG GCGGCCTACC ACGGCCCGTC CTTCATGATC
GACGACCTGA CCTATCACTA CGAGCACGGC GACACGAAGA CCGTGTTGAT CAACTGCCAC
TATCCGATCG ACGCGAATTC CTTTGTGCTG CAGTATGGCA TCGTGGTGCA GAAGTCCGCC
GACCTTCCCG AGGACCTGGC CGTGCAGACC GCGGTCGCTC TCGGTGACTT CGTGAAGATG
GGCTTCGAAC AGGACGTGCA GATCTGGCGG AACAAGGCGC GCATCGACAA CCCGCTGCTC
GTCGAGGAGG ACGGTCCGGT GTATCAGTTG CGACGCTGGT ACGAACAGTT CTACGTCGAC
GTCGAGGACG TGACCCCGGA CATGGTCGAT CGATTCGAAT TCGAGATCGA CACGACACGG
CCGCGCGAAT CCTGGATGAA AGAGGTCGAG GACAACATCG CCGCCGGCCG GATACCGAAG
CTGGCGTCCG GAACCACCTG A
 
Protein sequence
MGMTVQPPAD EVRLIEADAA PTRFARGWHC LGLIRDFGDG TPHQVNAFGQ KLVVFRAEDG 
SINVLDSYCR HMGGDLSQGT VKGNEIACPF HDWRWGGDGR CKQVPYAKRV PRLARTQTWP
TLEQDGMLFV WNDPQRKPPP DDVTIPRIEG ATSDEWTDWH WYTTVVHTNC REIIDNVVDM
AHFFYIHGSL PTQFKNIFEG HTATQFMSSG GRPDLGQAEG GVKLLGTTSL AAYHGPSFMI
DDLTYHYEHG DTKTVLINCH YPIDANSFVL QYGIVVQKSA DLPEDLAVQT AVALGDFVKM
GFEQDVQIWR NKARIDNPLL VEEDGPVYQL RRWYEQFYVD VEDVTPDMVD RFEFEIDTTR
PRESWMKEVE DNIAAGRIPK LASGTT