Gene Mvan_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3005 
Symbol 
ID4648533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3172711 
End bp3173841 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content67% 
IMG OID639806485 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_953816 
Protein GI120403987 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.558356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACTC CCCTGTGCGA CGAGTTGGGC ATCGAGTTCC CTATCTTCGC TTTCACCCAC 
TGCCGCGACG TGGTCGTCGC CGTCAGCAAG GCGGGTGGTT TCGGCGTGCT CGGCGCGGTC
GGGTTCACGC CTGAACAGCT CGAGATCGAG CTGAACTGGA TCGACGAGAA CATCGGCGAC
CACCCCTACG GCGTGGACAT CGTGATCCCG AACAAGTACG AGGGCATGGA CTCGAACATG
TCGGCCGACG AACTCAAGTC GACGCTCAAC GCGCTCGTTC CGCAGGAGCA CCTGGACTTC
GCGAAGAAGA TCCTCGCCGA CCACGGCGTG CCCACCGACG ACAGCGACGA CAACGCGCTG
CAGCTGCTCG GCTGGACCGA GGCCACCGCC ACCCCGCAGG TCGAGGTCGC GTTGCGGCAC
CCGAAGATGA CTCTGATCGC CAACGCGCTC GGCACCCCGC CCAAGGACAT GATCGAGCAC
ATCCACGCCG AGGGGCGCAA GGTCGCCGCG CTGTGTGGCT CGCCGTCACA GGCGCGCAAG
CACGCCGACG CCGGGGTGGA CATCATCATC GCCCAGGGCG GTGAGGCCGG TGGACACAGC
GGTGAGGTCG GTTCCATCGT GCTCTGGCCG CAGGTCGTCA AGGAGGTGGC GCCGGTGCCG
GTGCTGGCCG CCGGTGGCAT CGGCAGCGGT CAACAGATCG CCGCGGCGCT CGCGCTCGGC
GCGCAGGGCG CGTGGACGGG CTCCCAGTGG GTGATGGTCG AGGAATCGGA GAACACCCCG
GTCCAGCACG CCGCTTACGC GAAGGCCACC AGCCGCGACA CCGTGCGCAG CCGGTCGTTC
ACCGGAAAGC CGGCACGCAT GCTGCGCAAC GACTGGACCG AGGCCTGGGA GAACCCGGAG
AACCCCAAGC CGCTCGGAAT GCCGCTGCAG TACATGGTTT CCGGGATGGC CGTGGCTGCG
ACGCACAAGT ACCCCAACGA GACCGTCGAC GTCGCGTTCA ACCCGATCGG CCAGGTCGTC
GGACAGTTCA CCAAGGTGGA GAAGACCGCG ACCGTCATCG AGCGCTGGGT GCAGGAGTAC
CTGGAGGCGA CCAACACGCT CAACGAGCTC AACGAGGCCG CCAGCGTATA G
 
Protein sequence
MHTPLCDELG IEFPIFAFTH CRDVVVAVSK AGGFGVLGAV GFTPEQLEIE LNWIDENIGD 
HPYGVDIVIP NKYEGMDSNM SADELKSTLN ALVPQEHLDF AKKILADHGV PTDDSDDNAL
QLLGWTEATA TPQVEVALRH PKMTLIANAL GTPPKDMIEH IHAEGRKVAA LCGSPSQARK
HADAGVDIII AQGGEAGGHS GEVGSIVLWP QVVKEVAPVP VLAAGGIGSG QQIAAALALG
AQGAWTGSQW VMVEESENTP VQHAAYAKAT SRDTVRSRSF TGKPARMLRN DWTEAWENPE
NPKPLGMPLQ YMVSGMAVAA THKYPNETVD VAFNPIGQVV GQFTKVEKTA TVIERWVQEY
LEATNTLNEL NEAASV