Gene Mkms_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_0720 
Symbol 
ID4614920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp764621 
End bp765745 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content70% 
IMG OID639790395 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_936726 
Protein GI119866774 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0155051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAACCG CGATCTGTGA ACAGTTCGGC ATCGACTTCC CCCTCTTCGC GTTCAGCCAC 
TGCCGCGACG TCGTCGCCGC CGTGACCAAC GCCGGCGGGT TCGGCGTCCT CGGGGCGACC
GCCTACACCC CCGAGCAGCT GGACCGCGAG CTCAGCTGGA TCGACGAGCA GGTGGGCGGG
AAGCCCTACG GTGCGGACAT CATCGTGCCG GCCAAGTTCG AGGGCAAGGG TGAGAACCTC
ACGCGTACCG ACCTGGTCGA CCGCATCCCC GCCGAGTACC GCGAGTTCGT CACCCAGCTG
CTGGCCGACC ACGACATCGA ACTCGACAGT GCGCCGAAGC TCGGCGGCTC GTCGCTGTCC
GGTGACACCG GCCGCGAACT GCTCGACGTC GCGATGAGCC ACCCGATCAA GCTGATCGCC
AACGCGCTCG GCGTCCCGCC GGACTACATG ATCGAGGCGG GCCGCGAACG GGGCGTGCCG
GTGGCGGCGC TCGTCGGCGC GCGCGAACAC GCGGTCAAAC AGGTCGCCGC CGGTGTCGAC
CTCATCGTCG CGCAGGGCAC CGAGGCCGGC GGACACTGCG GTGAGGTGAC GACGCTGGTG
CTGATCCCCG AGGTCATCGC GGCCGTCGAG GAGACCGGCA CCAAGGTTCC GGTGCTGGCC
GCCGGCGGCA TCGTCACCGG ACGGCAGATG GCCGCGGCCG TGGCGATGGG CGCCGACGGG
GCGTGGACCG GTTCGGTGTG GCTGACCACC GAGGAGGCCG AGACCGCCCC GCACACCGTG
CAGAAGATGC TGGCGGCGAC GTCGCGGGAC ACCGTGCGGT CGGCGGGCCG GACCGGTAAA
CCCGCCCGGC AGTTGGTGTC GGACTGGACG CAGGCCTGGG CGCCGCGCGG TGATCGGCGG
CCGCTGCCGT TACCCCTGCA GAACATGCTC GCCGAACCCC TGTTGCGCAG GATCGACGTC
CTCGCCGCGC AGGGGCATCC GGGTGCGCAG CAGCTGGCGA CCTACTTCGT CGGCCAGGGG
GTGGGGCTGA TGAACAAGGT CAAACCCGCG CGTGAGGTGG TGCGCGAGTT CATCGAGGAC
TATCTGTCCG CCGCGGAGCG GTTGAACAAC TCGCTGCCGG ACTGA
 
Protein sequence
MKTAICEQFG IDFPLFAFSH CRDVVAAVTN AGGFGVLGAT AYTPEQLDRE LSWIDEQVGG 
KPYGADIIVP AKFEGKGENL TRTDLVDRIP AEYREFVTQL LADHDIELDS APKLGGSSLS
GDTGRELLDV AMSHPIKLIA NALGVPPDYM IEAGRERGVP VAALVGAREH AVKQVAAGVD
LIVAQGTEAG GHCGEVTTLV LIPEVIAAVE ETGTKVPVLA AGGIVTGRQM AAAVAMGADG
AWTGSVWLTT EEAETAPHTV QKMLAATSRD TVRSAGRTGK PARQLVSDWT QAWAPRGDRR
PLPLPLQNML AEPLLRRIDV LAAQGHPGAQ QLATYFVGQG VGLMNKVKPA REVVREFIED
YLSAAERLNN SLPD