Gene Mvan_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4389 
Symbol 
ID4648728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4710077 
End bp4711099 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content73% 
IMG OID639807860 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_955171 
Protein GI120405342 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.537564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTTCG ACGTCCGGGA CCTGTCCGTG CCGGTGCTCG TCGCCCCGAT GGCAGGCGGA 
CCCTCGACGC CCGAGCTTGC GGCGGCGGGC ACGAACGCCG GCGGCCTGGG CTTCGTCGCC
GCCGGTTACC TGACGGCCGA CGTGTTCGCC GAACGCGTGC GGGCCGCGCA ACGGCTGACC
AGCGGGCCCC TCGGGGTGAA TCTCTTTGTG CCGCAACCCA GTGCCGGCAC TCCTGCGGCG
GTCGCGGCCT ATGCGGAGAG GTTGGCGGAG GAGGCGCGAC GTTACGGCAC CGAGCTCGGC
GCCCCCCGGT TCGACGACGA CCACTGGAAC GCCAAGCTCG AGGTGGTGCT GGACCTGAGG
CCCGCGCTGG CGTCGTTCAC GTTCGGGCTG CCCACCGTCG AGGAGCGGCG CCGTCTCAGC
GCGGCCGGAA TCGCCACGGC GGCAACGGTG ACCACGCCGG CCGAAGCGCG GCTGGCCGCC
GACTGCGGCG TCGACATCCT GGTGGCACAG GGCCCGTCGG CGGGCGGGCA CCGCGGGACC
TTCGACCCGA CCGCGACGCC CTCCGGGCAG CCACTGGACG AACTGCTGGC CGCGGTCACG
GCCGACCACG CGATCCCCGT CGTCGCGGCG GGCGGCTTGA TGACCGCCAC CGATATCCGC
CGGGTCCGGC AGGCGGGTGC GGCCGCCGCA CAACTCGGCA CCGCCTTCCT GCTGTCCGAC
GAGGCGGGCA GCAGCCCGGT GCACCGGGCC GCGCTGATCG ACCCGCAGTT CACCGAAACG
GCTGTCACGA AAGCGTTTTC CGGACGGTAC GCGCGGGGAC TGCGCAACCG GTTCATCGTC
GAGCACGAAG CGGAGGCGCC GTTCGGTTAC CCCGAGGTGC ATTACCTGAC CAGCCCGCTT
CGGGCCGCGG CGGTACGGGC CGGCGATCCG CAGGCGGTCA ACATCTGGGC CGGCACCGGG
TTCCGGCAGG CCGGCGGCGG TTCGGTACGC GACATCATGG ACACGTTGAT CGGTCGAGAC
TGA
 
Protein sequence
MSFDVRDLSV PVLVAPMAGG PSTPELAAAG TNAGGLGFVA AGYLTADVFA ERVRAAQRLT 
SGPLGVNLFV PQPSAGTPAA VAAYAERLAE EARRYGTELG APRFDDDHWN AKLEVVLDLR
PALASFTFGL PTVEERRRLS AAGIATAATV TTPAEARLAA DCGVDILVAQ GPSAGGHRGT
FDPTATPSGQ PLDELLAAVT ADHAIPVVAA GGLMTATDIR RVRQAGAAAA QLGTAFLLSD
EAGSSPVHRA ALIDPQFTET AVTKAFSGRY ARGLRNRFIV EHEAEAPFGY PEVHYLTSPL
RAAAVRAGDP QAVNIWAGTG FRQAGGGSVR DIMDTLIGRD