Gene Mmcs_4626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4626 
Symbol 
ID4113455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4899089 
End bp4900210 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID638033777 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_641786 
Protein GI108801589 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACCG AACTGTGCGA CCGCTTCGGC ATCGAGTATC CGATCTTCGT GTTCACACCG 
TCGGAGAAGG TCGCCGCCGC CGTCACGCGC GCCGGCGGGA TGGGTGTGCT GGGATGCGTC
CGGTTCAACG ACTCCGACGA CCTCGAGAAC GTCCTGCAGT GGATGGACGA GAACACTCTC
GGCAAGCCGT ACGGCGTCGA CGTCGTCATG CCCGCCAAGA TTCCCACCGA GGGCACCGCC
GTCGACATCA ACAAGCTGAT CCCGCAGACG CATCGGGAGT TCGTCGACAA GACACTCGCC
GACCTCGGGG TGCCGCCGCT GCCGGAGGAC GAGGCCCGCA ACGAAGGAGT GCTGGGCTGG
CTGCATTCGG TGGCTCGCTC ACACGTCGAG GTGGCGCTCA AACACCCGAT CAAGCTGATC
GCCAACGCGC TGGGCTCCCC GCCGAAGGAC GTCATCGACC AGGTGCACGA AGCCGGGGTG
CCCGTCGCGG CGCTGGCCGG TAGCGCCAAA CACGCTCAGC GACATGTCGA CAACGGCGTC
GACATCGTCG TCGCCCAGGG CCACGAGGCC GGTGGGCACA CCGGTGAGAT CGGTTCGATG
GTGCTGTGGC CGGAGATCGT CGACGCGTTG AACGGGCAGG CGCCCGTGCT GGCGGCCGGC
GGCATCGGCA CGGGGCGGCA GGTGGCGGCG GCGCTGGCGC TCGGCGCGTC CGGGGTGTGG
ATGGGTTCGG CGTTCCTGAC GTCGGCGGAA TACGATCTCG GACACCGCAA ACCGAGCGGG
GTGTCGACGA TCCAGGAGGC GATGCTGCGG GCGACGTCCA GCGACACCGT GCGCCGCAGG
ATCTACACCG GTAAACCGGC ACGGCTGTTG AAGACCAAGT GGACCGAGGC GTGGGACGCC
CCCGAGGCGC CGGAACCGTT GCCGATGCCG CTGCAGAACA TCCTCGTCAG CGAGGCGCAT
CAGCGGATGA ACGAGTCGGA CAACCCGGAC ACGGTGTCGA TGCCGGTCGG TCAGATCGTC
GGCCGGATGA ACGAGATCCG CCCGGTCGCC GACATCATCG CCGAACTCGT GTCGGGTTTC
GACGAGGCGT CGAAGCGGCT GGACGGCATC CGCGAGGGCT GA
 
Protein sequence
MRTELCDRFG IEYPIFVFTP SEKVAAAVTR AGGMGVLGCV RFNDSDDLEN VLQWMDENTL 
GKPYGVDVVM PAKIPTEGTA VDINKLIPQT HREFVDKTLA DLGVPPLPED EARNEGVLGW
LHSVARSHVE VALKHPIKLI ANALGSPPKD VIDQVHEAGV PVAALAGSAK HAQRHVDNGV
DIVVAQGHEA GGHTGEIGSM VLWPEIVDAL NGQAPVLAAG GIGTGRQVAA ALALGASGVW
MGSAFLTSAE YDLGHRKPSG VSTIQEAMLR ATSSDTVRRR IYTGKPARLL KTKWTEAWDA
PEAPEPLPMP LQNILVSEAH QRMNESDNPD TVSMPVGQIV GRMNEIRPVA DIIAELVSGF
DEASKRLDGI REG