Gene Mvan_5214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5214 
Symbol 
ID4644315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5583157 
End bp5584278 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID639808689 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_955991 
Protein GI120406162 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.149834 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCG AACTCTGCGA TCGCTTCGGG ATCGAGTATC CGCTGTTCGT CTTCACCCCG 
TCGGAGAAGG TCGCCGCGGC GGTGAGCAAG GCCGGCGGTC TCGGCGTGCT GGGGTGCGTG
CGGTTCAACG ACGCCGACGA CCTGGAGAAC GTCCTGCAGT GGATGGACGC CAATACCGAC
GGCAAACCGT ACGGCGTCGA CATCGTGATG CCGGCCAAGG TCCCGACCGA GGGCACCTCG
GTCGACATCA ACAAACTCAT CCCGGCCGAG CATCGCGAGT TCGTCGACAA GACGCTCGCC
GACCTCGGGG TGCCGCCGCT GCCCGCCGAT GAAGAGCGCT CCGAAGGCGT TCTGGGATGG
CTGCATTCGG TGGCGCGCAG CCACGTCGAG GTGGCGCTCA AGCACCCGAT CAAGCTGATC
GCCAACGCGC TGGGCTCACC GCCCAAGGAC GTCATCGACC AGGCGCACCA GGCCGGTGTC
CCGGTGGCGG CGCTGGCCGG ATCCGCCAAA CACGCGCTGC GCCATGTGGA CAACGGTGTA
GACATCGTCG TTGCGCAGGG GCACGAGGCC GGTGGGCACA CCGGTGAGAT CGGCTCGATG
GTGCTGTGGC CCGAGATCGT CGATGCCATC GAGGGCAGGG CCCCGGTGCT CGCCGCCGGC
GGTATCGGCA CCGGACGGCA GGTGGCCGCC GCGCTCGCCC TTGGCGCGCA AGGCGTGTGG
ATGGGTTCGG CATTCCTGAC CTCGGCCGAG TACGACCTGG GTGTGCGCCT GCCGTCAGGC
CGGTCGGTGG TCCAGGAGGC CATGCTCAAC GCCACGTCCG CGGACACGGT GCGCCGCCGC
ATCTACACCG GCAAGCCGGC GCGGCTGCTC AAGAGCCGCT GGACCGACGC GTGGGACGCC
GACGGCGCTC CCGAGCCGCT GCCGATGCCG CTGCAGAACA TCCTGGTCAG CGAGGCCCAT
CAGCGGATGA GCGAGAACAG CGATCCGACG GCCGTCGCGA TGCCGGTCGG TCAGATCGTG
GGCCGGATGA ACGAGATCCG CCCGGTCGCC GACATCATCG CCGAGCTGGT GAGCGGATTC
GAGGAAGCCA CCAGGCGGTT GGACGCTATT CGCGACAACT GA
 
Protein sequence
MKTELCDRFG IEYPLFVFTP SEKVAAAVSK AGGLGVLGCV RFNDADDLEN VLQWMDANTD 
GKPYGVDIVM PAKVPTEGTS VDINKLIPAE HREFVDKTLA DLGVPPLPAD EERSEGVLGW
LHSVARSHVE VALKHPIKLI ANALGSPPKD VIDQAHQAGV PVAALAGSAK HALRHVDNGV
DIVVAQGHEA GGHTGEIGSM VLWPEIVDAI EGRAPVLAAG GIGTGRQVAA ALALGAQGVW
MGSAFLTSAE YDLGVRLPSG RSVVQEAMLN ATSADTVRRR IYTGKPARLL KSRWTDAWDA
DGAPEPLPMP LQNILVSEAH QRMSENSDPT AVAMPVGQIV GRMNEIRPVA DIIAELVSGF
EEATRRLDAI RDN