Gene Mkms_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3224 
Symbol 
ID4611148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3381010 
End bp3382080 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID639792895 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_939208 
Protein GI119869256 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.425254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGCT ACCACGCGCT GCGGCGGGTG CTGTTCCTGA TCTCCCCCGA GCGCATCCAC 
ACCTGGGTTT TCGCGCTGCT TCGCGCCGTC ACCACACCGG ACCTCCTGCG TCGCGCACTG
CAGGGCCGGC TCGCCCCCCG CGATCCCGTG CTGGCCAGCA CCGTGTTCGG CGTGCGCTTC
CCGGGGCCGC TCGGGCTGGC GGCCGGTTTC GACAAGGACG GTCGCGGCCT GCACACCTGG
CCCGCACTCG GATTCGGGTA CGCCGAGGTG GGCACCGTCA CCGCCCACCC GCAGCCCGGT
AACCCCGAAC CGCGGTTGTT CCGTCTGCCC GAGGACCGGG CGCTGCTCAA CCGCATGGGA
TTCAACAACG ACGGCGCCGC CCGGCTCGCG CAGCGCCTCA CCCGCCACAC CTCCGACGCC
CCCGTCGGCG TGAACATCGG CAAGACCAAG GCGACCCCCG CCGACCGAGC CGTCGAGGAC
TACGCCCAGA GCGCCCGCCA ACTCGGCCCG CTCGCCACCT TTCTCGTGGT CAACGTCAGC
TCGCCGAACA CGCCGGGCCT GCGTGACCTG CAGGCGGTGG AATCGTTGCG GCCCATTCTG
ACCGCGGTGC GCGCCCAGAC CTCGACACCG GTGCTGGTGA AGATCGCCCC CGACCTGTCC
GACGCCGACG TCGACGAGAT CGCCGACCTG GCAGTCGAAC TGGGGTTGGC CGGCATCGTG
GCCACCAACA CCACGATCTC GCGCGCGGGG CTGAAGACCC CCGGCGTCGA AGAGCTCGGC
CCCGGCGGGG TGTCCGGTGC CCCGGTCGCC GCCCGCTCCC TCGAGGTGCT GCGCCGGCTG
TACCGCCGGG CGGGTGACCG GCTGGTGCTG ATCAGCGTCG GGGGTATCGA GACCGCCGAC
GACGCCTGGG AGCGCATCAC CTCGGGCGCC TCACTGCTGC AGGGGTACAC CGGTTTCGTC
TACGGGGGCG GCCTGTGGGC CAAGCACATT CACGACGGGC TGGCGACCCG GCTGCGCGCG
GAGGGCTTCA CCTCACTGTC CGATGCAGTG GGCTCCGCGA TGCGGCAGTG A
 
Protein sequence
MTGYHALRRV LFLISPERIH TWVFALLRAV TTPDLLRRAL QGRLAPRDPV LASTVFGVRF 
PGPLGLAAGF DKDGRGLHTW PALGFGYAEV GTVTAHPQPG NPEPRLFRLP EDRALLNRMG
FNNDGAARLA QRLTRHTSDA PVGVNIGKTK ATPADRAVED YAQSARQLGP LATFLVVNVS
SPNTPGLRDL QAVESLRPIL TAVRAQTSTP VLVKIAPDLS DADVDEIADL AVELGLAGIV
ATNTTISRAG LKTPGVEELG PGGVSGAPVA ARSLEVLRRL YRRAGDRLVL ISVGGIETAD
DAWERITSGA SLLQGYTGFV YGGGLWAKHI HDGLATRLRA EGFTSLSDAV GSAMRQ