Gene Mflv_0595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0595 
Symbol 
ID4971929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp621093 
End bp622544 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content59% 
IMG OID640454796 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001131873 
Protein GI145221195 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.577316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGATC ACGGTGAGGT GTTGGCGGCT GTACGCACTG GCATGATCCC GGCGCACGTG 
TATAACGACA AGCAGATTTT TTCGCTCGAG AAGGAGCGGC TGTTCAGTCG GGCGTGGTTG
TTCGTGGCGC ACGAATCGGA GATTCCGCAA CCGGGGGACT ACGTGGTCAG ACAAGTGTTG
CAGGATTCGT TCATCGTCGC TCGTGATTCT GCGGGCGAGG TCCGGGTGAT GTTCAATATG
TGCCTCCATC GCGGTATGCA GGTTTGTCGG GCGGAGATGG GGAACGCGTC GAACTTCAGA
TGCCCGTACC ACGGGTGGTC TTACCGCAAT GACGGCCGCA TTATCGGACT GCCTTTTCAC
CAAGAGGCCT ATGGAGGAGA CGCGGGGTTT AACAAGGCGG GGCAGACCCT GTTGCCAGCG
CCGAGTGTGG CCAGCTACAA CGGGTTGATC TTTCTGTCGA TGGATCCTGA CGCAGAATCG
CTTGAAGACT ATCTGGGTGA TTTCAGGTTC TATCTCGATT TCTACACCAA GCAAGGCCCC
AACGGTCTTG AGGTGCGAGG TCCGCAGCGT TGGCGGGTAA AAGCGAACTG GAAGATCGCA
GCTGAAAATT TCGCCGGGGA CATGTACCAC ACACCTCAGA CGCACACGTC GGTGGTCGAG
ATCGGCCTGT TCCGAGAGCC GAAGGCTAAC AAGCGCAAAG ACGGCGCCAC GTATTGGGCG
GGTAGAGGTG GGGGCACCAC ATACAAGCTG CCCGAGGGGA GTTTCGAGGA CCGGATGAGC
TATGTGGGCT ACCCGGCGGA CATGATTAGT CGAGCCAAGG CCACCTGGAC CGAGCAGCAG
CAACAAGTCG TCGGCACCGA CGGGTTCATG ATCTCGGCCG CGACGTGTTT TCCCAACATC
AGTTTCGTGC ACAACTGGCC GAAAGTGGAG GACGGGGAGC ACGTCTTGCC GTTCATTTCA
ATCCGGGTGT GGCAGCCAAT CAGCGAGAAC GAAACCGAGG TGCTGTCGTG GTTTGCGGTG
GATTCTGATG CCCCGGCAGA CTTTAAGGCG GACTCGTATA AGGCTTATTT GATGTGCTTC
GGCTCGACGG GAATGTTCGA GCAAGACGAT GTCGAGAACT GGGTGTCGCT GACCAACACG
GCGGGGGGTT CCATGGCCCG CCGACTGCGG CTGAACAGCC GGATGGGGCT GCTCGCAGAC
GATGTACGGG TGGTCGACAC CCTTAGCAGC GCTCAATTCC ACGGGCCGGG ATACGCTCAG
CTCGGTTACA ACGAGAACAA TCAACGGCAA TTGTTGAGGC TCTGGGCCGA CTACCTGGAC
ATGCCGCCGC TGCGCGTCGA CCCGGCTACG GTGCTCAGCG ACAATCCGCA TGGAATTGAA
CCAATGGTCC AGACCAACGG CGCGGCCGCC GCCGATATTG ACTCGGGGTC CGCTGAGTCG
GTGATGCTAT GA
 
Protein sequence
MQDHGEVLAA VRTGMIPAHV YNDKQIFSLE KERLFSRAWL FVAHESEIPQ PGDYVVRQVL 
QDSFIVARDS AGEVRVMFNM CLHRGMQVCR AEMGNASNFR CPYHGWSYRN DGRIIGLPFH
QEAYGGDAGF NKAGQTLLPA PSVASYNGLI FLSMDPDAES LEDYLGDFRF YLDFYTKQGP
NGLEVRGPQR WRVKANWKIA AENFAGDMYH TPQTHTSVVE IGLFREPKAN KRKDGATYWA
GRGGGTTYKL PEGSFEDRMS YVGYPADMIS RAKATWTEQQ QQVVGTDGFM ISAATCFPNI
SFVHNWPKVE DGEHVLPFIS IRVWQPISEN ETEVLSWFAV DSDAPADFKA DSYKAYLMCF
GSTGMFEQDD VENWVSLTNT AGGSMARRLR LNSRMGLLAD DVRVVDTLSS AQFHGPGYAQ
LGYNENNQRQ LLRLWADYLD MPPLRVDPAT VLSDNPHGIE PMVQTNGAAA ADIDSGSAES
VML