Gene Mvan_4415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4415 
Symbol 
ID4649006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4738281 
End bp4739621 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content62% 
IMG OID639807886 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_955197 
Protein GI120405368 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.415837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACT TCACCGACCT CGTCGACCCT GAGCACGGCT GGGTCAGCCC ACAGATCTAT 
ACCGACCCCG AGATCTACGA ACGCGAGTTG CAACACGTGT TCGGACGCAG CTGGCTGTTT
CTGGCGCACG ACTCCCAGCT GCCCAAGCCG GGCAGCTTCC TGCAGACCTA CATGGGCGAG
GATCCGGTGC TCGTCGTCCG CCAGCGCGAT GGGTCGGTCC GCGCCTTTCT GAACCAGTGC
CGGCATCGAG GCATGCGGAT CTGTCGCTCC GAAGCCGGTG TCAGCAAGGC CTTCACCTGT
ACCTACCACG GTTGGTCCTA TGACCTGGCG GGCAACCTCA TCAACGTGCC GCTGGAAGAG
CGTGCCTACC ACAGCTCCAT CGACAAGAAG GAGTGGGGCG CAATGAAAGT GCCTCGCGTA
GCCAACTATC GCGGGTTCTA CTTCGGCACC TGGTCGGAGG AAACTCCGGA GTTCGACGCC
TACCTCGGTG ATATGGCCTT CTACTTCGAC GCGATCGTCG ACCGTTTCGA CTCAGGTCTG
GAGTTCGTCA AGGGCACCAC GAAATGGGTG ATCGACTGCA ATTGGAAGTT CGCGTCCGAA
CAGTTCGCCA GCGACATGTA CCACACCCAG TCGGCGCACG CCTCAGCCCT GCTGGCACTC
ACCGATGATC CCAACCCAAT AGGGCCGCTC AACGACCCCA ACGTGCCCGG GCGTCAGTTC
AGCGGGAACG GGCACGGGTC CGGCGGCTAC TTCCTGCCCG CTCCCGTGGT GAAGACGCCG
GAAATGACCG ACACCATGTT CGAATGGTTC AAGAGTCGTG AGGAGGAGAT GGTCGCACGG
ATCGGGGCCG ACCGGCTGAG CAAGGTGAGC ATCACGCACA ACACCATCTT CCCGAACTTC
TCCTGGCTCG GAGCGCACTC CACCATGCGG GTCTGGCATC CACGCGGGCC CGGGCAGATC
GAAGTCTGGG CGTGGACTTA CGTTCCCAAA GACGCCCCGC CCAAGGTGAA GAACGAGATC
CGCGAGCTCA CCCAACGAAC TTTCAGCCCC GCCGGCTCCT TCGAAACCGA CGACGGTGAG
AACTGGACGG AGATCCAGCA AGTGCTCCGA GGTTCCCAGG CCCGCCGCAA CCGGTTACAT
ACCGCCATGG GTGTCGGCTA CGAAGAGCGC GACGCCTTTG GACTGCCCGG ACTCGGCAAT
GACGTGTACT CCGAGACGGC AGCGCGGGGC TTCTACCGCC ACTGGCTCGA CATGCTGACC
GGAAAGCCGT GGTCGGAGAT TCAGAAATGG ACGCCTAACG GCAATCACGG CGAACTGCGC
GACGAAGGGG TGACCGCATG A
 
Protein sequence
MIDFTDLVDP EHGWVSPQIY TDPEIYEREL QHVFGRSWLF LAHDSQLPKP GSFLQTYMGE 
DPVLVVRQRD GSVRAFLNQC RHRGMRICRS EAGVSKAFTC TYHGWSYDLA GNLINVPLEE
RAYHSSIDKK EWGAMKVPRV ANYRGFYFGT WSEETPEFDA YLGDMAFYFD AIVDRFDSGL
EFVKGTTKWV IDCNWKFASE QFASDMYHTQ SAHASALLAL TDDPNPIGPL NDPNVPGRQF
SGNGHGSGGY FLPAPVVKTP EMTDTMFEWF KSREEEMVAR IGADRLSKVS ITHNTIFPNF
SWLGAHSTMR VWHPRGPGQI EVWAWTYVPK DAPPKVKNEI RELTQRTFSP AGSFETDDGE
NWTEIQQVLR GSQARRNRLH TAMGVGYEER DAFGLPGLGN DVYSETAARG FYRHWLDMLT
GKPWSEIQKW TPNGNHGELR DEGVTA