Gene Mvan_0463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0463 
Symbol 
ID4648259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp505224 
End bp506678 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content60% 
IMG OID639803971 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_951316 
Protein GI120401487 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.328369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGATC ACGGTGAGGT GTTGGCGGCT GTACGCACTG GCATGATTCC GGCGCACGTG 
TATAACGACA AGCAGATTTT CTCGCTCGAA AAGGAGCGGC TGTTCAGTCG GGCGTGGTTG
TTCGTGGCGC ACGAGTCGGA GATTCCGCAG CCGGGGGACT ACGTGGTCAG GCAAGTGTTA
CAGGATTCGT TCATCATCGC TCGTGATTCT GCAGGCGAGG TCCGGGTGAT GTTCAATATG
TGCCTCCATC GCGGTATGCA GGTTTGTCGG GCGGAGATGG GGAACGCGTC GAATTTCAGA
TGCCCGTACC ACGGGTGGTC TTACCGCAAC GACGGCCGCA TTATCGGACT GCCTTTTCAC
CAAGAGGCCT ATGGAGGAGA CGCGGGGTTT AACAAGACGG GTCAGACCCT GTTGCCAGCG
CCGAGTGTGG CCAGCTACAA CGGGTTGATC TTTCTGTCGA TGGATCCTGA CGCAGAATCG
CTTGAAGACT ACCTGGGTGA TTTCAGGTTC TATCTCGATT TCTACACCAG GCAGGGCCCC
AACGGTCTCG AGGTGCAAGG TCCTCAGCGT TGGCGGGTAA AAGCGAACTG GAAGATCGCA
GCTGAGAATT TCGCCGGGGA CATGTACCAC ACCCCGCAGA CGCACACGTC GGTGGTCGAG
ATCGGCCTGT TCCGAGAGCC GAAGGCCCAC AAGCGCAAAG ACGGCGCAAC GTATTGGGCG
GGCAGAGGTG GGGGCACCAC ATACAAGCTG CCGGAGGGGA GTTTCGAAGA CCGGATGAGC
TATGTCGGCT ACCCGGCGGA GATGATCAGT CGTGCCAAGG CCACCTGGAC CGAGCAGCAG
CGACAGGTCG TGGGCGCCGA CGGGTTCATG ATCTCGGCGG CGACGTGCTT TCCGAACATC
AGTTTCGTGC ACAACTGGCC GAAAGTGGAG GACGGCGAGC ACGTCTTGCC GTTCATTTCG
ATTCGGGTAT GGCAGCCCAT TAGCGAGAAC GAGACTGAGG TGCTGTCCTG GTTTGCGGTG
GATTCGGATG CCCCGGAAGC GTTTAAGGCG GCCTCATACA AGGCTTATTT GATGTGTTTC
GGCTCGACGG GAATGTTCGA ACAAGACGAT GTCGAGAACT GGGTGTCGCT GACCAACACC
GCGGGGGGTT CCATGGCCCG CCGACTGCGG CTGAACAGCC GGATGGGGCT GCTCGCAGAC
GATGCCCGGG TGGTCGACAC CCTAAGCAGC GCTCAATTCC ACGGGCCTGG ATACGCTCAG
CTCGGCTACA ACGAGAACAA TCAACGGCAA TTGTTGAGGC TCTGGGCCGA CTACCTCGAC
ATGCCGCCGC TGCGAGTCGA CCCGGCTACT GTGCTCACGG ACAACCCGCA AGGTATTGAG
CCGATGGTGC AGACCAACGG CGGGGCCGTC GCCGGTATCG ACTCGGAGTC GGCTACGACG
TCGGTGACGC TGTGA
 
Protein sequence
MQDHGEVLAA VRTGMIPAHV YNDKQIFSLE KERLFSRAWL FVAHESEIPQ PGDYVVRQVL 
QDSFIIARDS AGEVRVMFNM CLHRGMQVCR AEMGNASNFR CPYHGWSYRN DGRIIGLPFH
QEAYGGDAGF NKTGQTLLPA PSVASYNGLI FLSMDPDAES LEDYLGDFRF YLDFYTRQGP
NGLEVQGPQR WRVKANWKIA AENFAGDMYH TPQTHTSVVE IGLFREPKAH KRKDGATYWA
GRGGGTTYKL PEGSFEDRMS YVGYPAEMIS RAKATWTEQQ RQVVGADGFM ISAATCFPNI
SFVHNWPKVE DGEHVLPFIS IRVWQPISEN ETEVLSWFAV DSDAPEAFKA ASYKAYLMCF
GSTGMFEQDD VENWVSLTNT AGGSMARRLR LNSRMGLLAD DARVVDTLSS AQFHGPGYAQ
LGYNENNQRQ LLRLWADYLD MPPLRVDPAT VLTDNPQGIE PMVQTNGGAV AGIDSESATT
SVTL