Gene Mflv_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1533 
Symbol 
ID4972859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1600236 
End bp1601387 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content64% 
IMG OID640455737 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001132803 
Protein GI145222125 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG ACACGGCTGG CATTCGCGAG ATCGACACCG GAGCGCTGCC CGACCGGTAC 
GCCAGAGGCT GGCACTGCCT CGGTCCGGTC AAGAACTTCA CCGACGGCGA ACCACACGGC
ATCGAGATCT TCGGGACCAT GCTGGTGGTC TTCGCCGACT CGCAGGGCGA ATTGAAGGTC
CTCGACGGCT ACTGCCGCCA CATGGGCGGC AACCTCGCCC AGGGCACCAT CAAGGGCGAC
GAGGTCGCCT GCCCGTTCCA CGACTGGCGC TGGGGCGGCG ACGGCAAATG CAAGCTCGTC
CCCTATGCCA AACGCACCCC CCGCCTGGCC CGCACGCGCG CCTGGCACAC CGACGTCCGC
GGCGGGTTGC TCTTCGTCTG GCACGACCAC GAGGGCAATC CTCCGCAGCC GGAGGTCCGC
ATCCCGGAGA TCCCGCAGTG GTCGAGCGGC GAGTGGACCG ACTGGAAGTG GAACACGATG
CTGATCGAGG GCTCCAACTG CCGCGAGATC ATCGACAACG TCACCGACAT GGCGCACTTC
TTCTACATCC ATTTCGGCTT GCCGACGTAT TTCAAGAACG TCTTCGAAGG GCATGTCGCC
AGCCAGTACC TGCACAACGT CGGCCGCCCC GACATCAACG ACATGGGCAC CGCCTACGGT
GACGCGTCCC TGGACTCCGA GGCCAGCTAC TTCGGCCCGT CGTTCATGAT CAACTGGCTG
CACAACACCT ACGGCGACTT CAAGGCCGAG TCGATCCTGA TCAACTGTCA CTATCCGGTG
TCGCAGGACT CGTTCGTCCT GCAGTGGGGT GTGATCGTGG AGAAGCCCCA GGGCCTCGAC
GACAAGACCA CCGAGAAACT CGCCGATGCG TTCACCGACG GTGTCAGCAA GGGCTTCCTG
CAGGACGTCG AGATCTGGAA GCACAAGACG CGTATCGACA ACCCCCTGCT GGTCGAAGAA
GACGGCGCCG TCTACCAGAT GCGCCGTTGG TACCAGCAGT TCTACGTCGA CGTCGCCGAC
GTGACGCCGG AGATGACCGA CCGCTTCGAG ATGGAAGTCG ACACCACGGT GGCGAACCAG
AAGTGGAACG TCGAGGTCGA GGAGAATCTC AAGGCGCGCG AGGCCGAGAA GACGGAGCAG
CCGGCGACAT GA
 
Protein sequence
MSTDTAGIRE IDTGALPDRY ARGWHCLGPV KNFTDGEPHG IEIFGTMLVV FADSQGELKV 
LDGYCRHMGG NLAQGTIKGD EVACPFHDWR WGGDGKCKLV PYAKRTPRLA RTRAWHTDVR
GGLLFVWHDH EGNPPQPEVR IPEIPQWSSG EWTDWKWNTM LIEGSNCREI IDNVTDMAHF
FYIHFGLPTY FKNVFEGHVA SQYLHNVGRP DINDMGTAYG DASLDSEASY FGPSFMINWL
HNTYGDFKAE SILINCHYPV SQDSFVLQWG VIVEKPQGLD DKTTEKLADA FTDGVSKGFL
QDVEIWKHKT RIDNPLLVEE DGAVYQMRRW YQQFYVDVAD VTPEMTDRFE MEVDTTVANQ
KWNVEVEENL KAREAEKTEQ PAT