Gene Mjls_5008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5008 
Symbol 
ID4880706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5246967 
End bp5248088 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID640142318 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_001073263 
Protein GI126437572 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.498712 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACCG AACTGTGTGA CCGCTTCGGC ATCGAGTATC CGATCTTCGT CTTCACGCCC 
TCGGAGAAGG TCGCGGCCGC CGTCACCCGC GCCGGCGGGA TGGGTGTGCT CGGGTGTGTG
CGGTTCAACG ACTCCGACGA CCTCGAGAAC GTCCTTCAGT GGATGGACGA GAACACTCTC
GGCAAGCCCT ACGGGGTCGA CGTCGTGATG CCCGCGAAGA TCCCGACCGA GGGCACCGCG
GTCGACATCA ACAAGCTGAT CCCGAAGACG CATCGGGAGT TCGTCGACAA GACGCTCGCC
GATCTCGGGG TGCCGCCGCT GCCCGAGGAC GAGGCCCGCA ACGAAGGTGT GCTGGGCTGG
CTGCACTCGG TGGCCAGGTC GCATGTGGAG GTCGGCCTCA AGCATCCGAT CAAGTTGATC
GCCAACGCGT TGGGTTCGCC GCCGAAGGAC GTCATCGACC AGGTGCACGA GGCGGGTGTG
CCGGTCGCGG CACTGGCGGG CAGCGCCAAA CATGCGCAGC GGCATGTCGA CAACGGCGTC
GACATCGTCG TTGCCCAGGG CCATGAGGCC GGTGGGCACA CAGGTGAGAT CGGTTCGATG
GTGCTGTGGC CGGAGATCGT CGACGCACTC GACGGTCGAG CGCCGGTGCT CGCCGCCGGC
GGTATCGGAA CGGGGCGTCA GGTCGCGGCC GCGCTCGCGC TCGGCGCGTC CGGGGTGTGG
ATGGGGTCGG CGTTCCTGAC GGCGGCGGAA TACGATCTCG GACACCGCAA ACCGAGCGGC
GTGTCGACCA TCCAGGAGGC GATGCTGCGC GCCACCTCCA GCGACACCGT TCGCCGGCGG
ATCTACACCG GTAAGCCGGC CCGGCTGCTG AAGACGAAGT GGACCGAGGC CTGGGACGCC
CCCGACGCTC CCGAACCGCT GCCGATGCCG CTGCAGAACA TCCTCGTCAG CGAGGCGCAT
CAGCGGATGA ACGAGTCGGA CAACCCGGAC ACGGTGTCGA TGCCGGTCGG TCAGATCGTC
GGCCGGATGA ACGAGATCCG CCCGGTCGCC GACATCATCG CCGAACTGGT GTCGGGCTTC
GAAGAGGCCT CGAAGAGGTT GGACGGCATC CGCGAAGGCT GA
 
Protein sequence
MRTELCDRFG IEYPIFVFTP SEKVAAAVTR AGGMGVLGCV RFNDSDDLEN VLQWMDENTL 
GKPYGVDVVM PAKIPTEGTA VDINKLIPKT HREFVDKTLA DLGVPPLPED EARNEGVLGW
LHSVARSHVE VGLKHPIKLI ANALGSPPKD VIDQVHEAGV PVAALAGSAK HAQRHVDNGV
DIVVAQGHEA GGHTGEIGSM VLWPEIVDAL DGRAPVLAAG GIGTGRQVAA ALALGASGVW
MGSAFLTAAE YDLGHRKPSG VSTIQEAMLR ATSSDTVRRR IYTGKPARLL KTKWTEAWDA
PDAPEPLPMP LQNILVSEAH QRMNESDNPD TVSMPVGQIV GRMNEIRPVA DIIAELVSGF
EEASKRLDGI REG