Gene Mflv_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0037 
Symbol 
ID4971660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp33741 
End bp34877 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID640454243 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_001131321 
Protein GI145220643 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.452006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATGA CGTCCACAGC GCTGTGTGAA CAGTTCGGCA TCGACTTCCC GCTCTTCGCG 
TTCAGCCACT GTCGCGACGT GGTGGCCGCG GTGACCAATG CCGGCGGCTT CGGCGTGCTC
GGCGCCACCG CGTACTCGCC CGACCAGCTC GACCAGGAAC TGGCCTGGAT CGACGAGGCC
GTCGGCGGCA GGCCCTACGG CGTGGACCTC ATCGTCCCGG CGAAGTTCGA GGGCAAGGGC
GAGAAGCTGT CCAGCTCCGA TCTCGCGGCG CGCATCCCGC AGACCTACAA GGACCTCGTC
GATGAGCTGC TGCGCAAGCA CGACATCGAG CCCGAGCCCG AGCGGCGTAT CGGCAAGCCC
ATGCTGTCCG GCAACACCGG ACGTGAGCTG CTCGACGTCG CGCTCACCCA TCCGGTCAAG
CTGATCGCCA ACGCCCTCGG CGTCCCGCCG GACTACATGA TCGAGGCCGG CAAGGAACGC
GGCATCCCGG TCGCGGCGCT CGTCGGCGCC AAGGAGCACG CGGTCAAGCA GGCCGCCGCC
GGCGTGGACC TGATCGTCGC GCAGGGCACC GAGGCGGGCG GACACTGCGG TGAGGTCAGC
ACGCTCGTCG TGGTGCCCGA GGTCTTGGAG GGTCTGGCGG CGCTCGGCGT GTCCACCCCG
GTGCTCGCGG CCGGCGGCAT CGTCACCGGA CGCCAGATGG CGGGCATGGT CGCGATGGGC
GCCTCCGGGG CGTGGACGGG GTCGGTGTGG CTGACCACCG AAGAGGCCGA GACCGCACCG
CACACCGTGG CCAAGATGCT GGCCGCGACG TCACGCGACA CCGTGCGCTC GGCGGGCCGT
ACGGGCAAGC CGTCACGGCA GCTGGTGTCG GACTGGACGA AGGCGTGGGC GCCGTCGAAG
GACGGGGAGC AGCCGCTGGG CCTGCCGCTG CAGTCGATGC TGTGCGAGCC GGTGATCCGC
CGCATCGACG TGCTGGCCTC GCAGGGCCAC GAGGGTGCGC AGGCGCTCGC GACGTACTTC
GTCGGGCAGG GCGTCGGGCT GATGAACAAG GTGAAGCCGG CCCGCGAGGT CGTCCGCGAG
TTCATCGAGG ACTACCTCGC CGCCGCCGAG CGCCTCAGCA GCTCTCTGCC GGGCTGA
 
Protein sequence
MSMTSTALCE QFGIDFPLFA FSHCRDVVAA VTNAGGFGVL GATAYSPDQL DQELAWIDEA 
VGGRPYGVDL IVPAKFEGKG EKLSSSDLAA RIPQTYKDLV DELLRKHDIE PEPERRIGKP
MLSGNTGREL LDVALTHPVK LIANALGVPP DYMIEAGKER GIPVAALVGA KEHAVKQAAA
GVDLIVAQGT EAGGHCGEVS TLVVVPEVLE GLAALGVSTP VLAAGGIVTG RQMAGMVAMG
ASGAWTGSVW LTTEEAETAP HTVAKMLAAT SRDTVRSAGR TGKPSRQLVS DWTKAWAPSK
DGEQPLGLPL QSMLCEPVIR RIDVLASQGH EGAQALATYF VGQGVGLMNK VKPAREVVRE
FIEDYLAAAE RLSSSLPG