Gene Mvan_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3781 
Symbol 
ID4645144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4023864 
End bp4025084 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID639807246 
Productcytochrome P450 
Protein accessionYP_954569 
Protein GI120404740 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTAG GACACACCGA TCTGGGCCTG GCCCTGTTCC ACGACGAGTT CCTTCAGGAT 
CCGCATCCGC TCTACGCCCG GATGCACGCC GAGGCTCCGG TGCACCGGGT GGGCGATTCG
GACTTCTACG CGGTCTGCGG ATGGGACGCC ATCACCGAGG CCGTCGCACG CCCAGGCGAC
TTCTCGTCGA ATCTGACTGG CGCGATGCGG TATCAGGCCG ACGGCACTGT GAGCCTTCTG
CCCCTGGACA CCCTGGGCAG TCCCACGCAG GCGCTCGCCA CAGCCGACGA TCCCGCGCAC
GCCGCCCACC GGAAATTGCT GGTGCCTTAT CTGGCCGCAA AACGCGTCCG CGCCCTGGAG
ACCTTCGCCG AGAACACCAT GCACGCGTTG TGGCAACAGA TGCCTGCCGG CGCACCTGTG
GAATGGATGG CTGTGGTAGC CGATCGCCTG CCGATGACGG TCGTCTGCAA GCTGATCGGT
GTTCCCGCCG AGGACGTCGA TCGGATCGCG GCATGGGCCT ATGCCAGTAC CCAGATCCTT
GAAGGGCGTG TGGACGAGCA CACGCTGACC GCCGCAGGCA CCGCAGCTCT GGAGTTGGCC
GGCTACATCT CTGACAAGCT GGGCCGGGCA TCGCTCGATC CCGGCGATAA CCTACTGGGC
TGTATCGCCA CCGCCTGCGC GGCAGGAGAA TTGAACAACT TGACGGCGCA GGTGATGCTC
GTGACGCTGT TCAGTGCGGG TGGCGAGTCG ACCGCCGCGC TCATCGGATC GGCCACGCAG
ATCATCGCAA CCCGTCCCGA CGTCCAACGG CGCCTGCGCG CGGACCCGGG CTTGATTCCC
GCCTTTCTGG AAGAGGTGTT GCGCTTTGAG CCGCCGTTCC GCGGGCACTA CCGCCATGTA
GTCAGGGATT GCGCGCTCGC CGGCAAAGAG CTCACCGCGG GCTCTCGCCT ACTTCTGTTG
TGGGGTGCCG CCAATCGGGA CCCGGCGCGT TTTGACGCGC CGAATGAGTT TCAGATCGAA
CGCCCGAACA GCAAGGCACA CATCGCGTTT GGCAAAGGCG CCCACTTCTG CATCGGCGCC
GCCCTGGCGC GGTTCGAGGC CAGGATCGTC ATCGACCTCC TGCTGCGGCA CACGTCGTGG
GTCGATGCGG CTGGACCGGG GTGGTGGTTG CCGAGCCTGC TCGTCCGCCG ACTCGACGAG
CTGCCGTTGA CGATGACGTA A
 
Protein sequence
MTVGHTDLGL ALFHDEFLQD PHPLYARMHA EAPVHRVGDS DFYAVCGWDA ITEAVARPGD 
FSSNLTGAMR YQADGTVSLL PLDTLGSPTQ ALATADDPAH AAHRKLLVPY LAAKRVRALE
TFAENTMHAL WQQMPAGAPV EWMAVVADRL PMTVVCKLIG VPAEDVDRIA AWAYASTQIL
EGRVDEHTLT AAGTAALELA GYISDKLGRA SLDPGDNLLG CIATACAAGE LNNLTAQVML
VTLFSAGGES TAALIGSATQ IIATRPDVQR RLRADPGLIP AFLEEVLRFE PPFRGHYRHV
VRDCALAGKE LTAGSRLLLL WGAANRDPAR FDAPNEFQIE RPNSKAHIAF GKGAHFCIGA
ALARFEARIV IDLLLRHTSW VDAAGPGWWL PSLLVRRLDE LPLTMT