Gene Mvan_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4234 
Symbol 
ID4648942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4543660 
End bp4544889 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content66% 
IMG OID639807701 
Productcytochrome P450 
Protein accessionYP_955017 
Protein GI120405188 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.118496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAGA CGCTTGCGCA GGAAGCGGTG TCGGTTCCCG AGTATCCGAT GGAGCGGACG 
GCGGGCTGCC CGTTCGCGCC CCCGCAGCAG ATGCTCGAGA TGAACCAGGT CAAGCCCCTT
TCCCGCGTGC GGATCTGGAA CGGCACCACG CCCTGGCTCG TCACCGGACA CGAGGTCGCG
CGCACGCTGT TCGCCGATTC CAGGGTGAGT GTGGACGACC GTCGGGAGGG CTTTCCGCAC
TGGAACGAGC ACATGCTGTC CACCGTGGAC AAGCGACCGC GGTCGGTGTT CACCTCCGAC
GCCGAGGAGC ACACGCGGTT CCGCCGGATG CTGTCCAAGC CGTTCACCTT CCGGCGTGTC
GAGGCGCTGC GTCCGGTGAT CCAGCAGGTC ACCGACGAGT GCATCGACGA GATTCTGGCG
GGTCCACAAC CGGCCGACAT GGTCGCCAAG CTCGCTCTTC CGGTGCCAAC CCGGGTGATC
TCCGACATGC TCGGGGTGCC GTACGAGGAT CACGAGTTCT TCCAGGAGCA CGCGAACGCC
GGCCTGGCCC GCTATGCCGC CGCGGACGCG ATGCAGAAAG GTGCGATGAG CCTGCACCAG
TACCTGATCA ACCTCGTCGA GGAGAAGCAG GCCCACCCGG CCGAGGACGC GGTGTCCGAT
CTGGCCGAGC GTGTCACCGC CGGTGAGATC AGTGTCAAGG AAGCGGCCCA GTTGGGGACC
GGATTGCTCA TCGCCGGGCA CGAGACCACC GCCAACATGA TCGGCATCGG AATCTGCGCT
CTGCTGGAGA ACCCCGAACA GGCAGCGCTG CTTCGCGATT CGGACGACCC GAAGTTCATC
GCCAACGCCG TCGAAGAGCT GATGCGCTAC CTGTCGATCA TCCAGAACGG ACAGCGGCGC
GTGGCCACCG AGGACATCGA GATCGGCGGC GAGACCATCC GGGCGGGGGA GGGCATCATC
CTCGATCTGG CGCCGGCCAA CTGGGATGCG CGGGCGTTTC CCGAGCCCGA CAAACTCGAC
CTCACCCGGG ATGCCACCCA GCAACTCGGC TTCGGCTACG GCCGTCATCA GTGCGTCGGT
CAGCAATTGG CGCGCGCCGA ACTGCAGATC GTGTTCCACA CCCTGCTGCG CCGCATCCCG
ACGATGAAGC CGGCCATTCC GCTCGAGGAG GTGCCGTTCA AACACGACCG GCTCGCCTAC
GGCGTCTACG AACTACCGGT GACCTGGTAA
 
Protein sequence
MTETLAQEAV SVPEYPMERT AGCPFAPPQQ MLEMNQVKPL SRVRIWNGTT PWLVTGHEVA 
RTLFADSRVS VDDRREGFPH WNEHMLSTVD KRPRSVFTSD AEEHTRFRRM LSKPFTFRRV
EALRPVIQQV TDECIDEILA GPQPADMVAK LALPVPTRVI SDMLGVPYED HEFFQEHANA
GLARYAAADA MQKGAMSLHQ YLINLVEEKQ AHPAEDAVSD LAERVTAGEI SVKEAAQLGT
GLLIAGHETT ANMIGIGICA LLENPEQAAL LRDSDDPKFI ANAVEELMRY LSIIQNGQRR
VATEDIEIGG ETIRAGEGII LDLAPANWDA RAFPEPDKLD LTRDATQQLG FGYGRHQCVG
QQLARAELQI VFHTLLRRIP TMKPAIPLEE VPFKHDRLAY GVYELPVTW