Gene Mvan_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4110 
Symbol 
ID4648869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4411459 
End bp4412469 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID639807577 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_954893 
Protein GI120405064 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000291648 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.122878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTTT CCACCCGCTA TCTGGGTCTG AACCTGCGCA ATCCGCTCGT CGCGTCGGCC 
TCCCCGCTGT CAAAGACCGT CGACGGCGTG CGGCGCCTGG CCGATGCCGG AGTCGGTGCG
GTGGTCCTCT ATTCACTGTT CGAAGAACAG GTGCGGCGCG AAGCCCGGCA GAACGCGCGG
TTGGCCGAAG CAGGCACGGA CAGCTTCGCC GAGTCGCTGT CGTACTTTCC GGAAGGCACC
GGCGTCGACC ATGGCCCGCG CCGTTACCTG AGCCTGCTCG AGCGCGCGGC CGAAGCGGTC
GCGATTCCGG TGATCGGCAG CATCAACGCC AGTACACCCG GCAGCTGGGC CGGGTACGCC
CGATCGATCC AGGATTGCGG GGCGGCCGCC GTCGAGTTGA ACGTCTACCT TCTGCCCGGT
GACGCGCACC TGTCCGGACG CGACGTCGAG CAGCGTCACC TCGACATCCT GGCCCGGGTC
AAGGACGCGG TGACCATTCC GGTGGCGGTG AAGCTCAGCC CGTACTTCAG TGCGACGGCT
GACATGGCCC GGCGGCTGGA CCTGGCGGGC GCGGACGGAC TGGTGTTGTT CAACCGGTTC
CTGCAGCCCG ACATCGACAC CGAGACGTTG GCGGTGGCGC CGGGGATCCT GTTGTCCCGG
GCCGCGGAAG TCCGACTACC GCTGACATGG ATCACGTTGC TGCACGGACG GATCGGGGCT
TCGCTTGCTG CGACCACCGG CGTGGAGGGC CCGGTCGAAC TGATCAAGTA CCTGCTGGGG
GGCGCGGATG TGGTGATGAC CGCGTCGGCG CTTCTGCGCC ACGGCCCGGA CTACGCCGCC
GTCCTGCTCG ACGGGTTACG GCACTGGATG TCCCGCAAGG GATACGCCGC GATCAACGAT
TTCCGCGGGC TGCTCGCAGT GCCGATCGGG ACGGACGAGG CGGCCCACGA ACGGGTGAAC
TACGTCAGCG CGTTGCGGCA GGCCAACAGC GGCGACTACG GACCATGGTA A
 
Protein sequence
MDLSTRYLGL NLRNPLVASA SPLSKTVDGV RRLADAGVGA VVLYSLFEEQ VRREARQNAR 
LAEAGTDSFA ESLSYFPEGT GVDHGPRRYL SLLERAAEAV AIPVIGSINA STPGSWAGYA
RSIQDCGAAA VELNVYLLPG DAHLSGRDVE QRHLDILARV KDAVTIPVAV KLSPYFSATA
DMARRLDLAG ADGLVLFNRF LQPDIDTETL AVAPGILLSR AAEVRLPLTW ITLLHGRIGA
SLAATTGVEG PVELIKYLLG GADVVMTASA LLRHGPDYAA VLLDGLRHWM SRKGYAAIND
FRGLLAVPIG TDEAAHERVN YVSALRQANS GDYGPW