Gene Mvan_3992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3992 
Symbol 
ID4647601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4266214 
End bp4267503 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content64% 
IMG OID639807454 
Producthypothetical protein 
Protein accessionYP_954775 
Protein GI120404946 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCG ACGAGTATCT GGCCGCAACA TCCGACAACA AGCCCGACCA CTGCGATATC 
GACATATCGC GCAAGGCGTT CTGGGCCCGC CCTGCCGACG AGAAGATGGA GATCTTCGCA
GCGCTGCGCG CCGAGCATCC GGTGTCGTGG CAGCGTCCGA TCGAAGGCGC CGTGGTACCT
GACCCTGACG ACCCCGGGTT CTGGGCGGTC GTCCGCCATG CGGACATCAG ACAGGTCAGC
CACGACAATC GGACCTTCAT CTCGGGGCAG GGCGTCATGT TCGACCGCCT CCCCCCGCTG
TTCCTTGAGA TGGCGCTGTC GTTTCTGGCG ATGGACCCGC CCCGCCACGA CAAGCTGCGC
CGGCTGGTCA ACAAGGCGTT CACGCCCCGG CAGATCACCC GCATCGAGGA CAAGATCGAC
CTCGTCGCCA AGGAGGTCGT CGACGGGTTC ATCGCCGATC CCACCGGCGA GATCGAGTTC
ATGGATCGCT GTGCGAGTCG GCTCCCCCTG CGGATGTTCT GTGAGATGTT CGGCGTGCCC
GATCATCTCG AGGAGCGGAC CGTCGAGCAC GTCATCGGCT CGGTGTCCTG GTCGGACGAG
GAGATCCTGG CTGGGCGCAG CGCCGCCGAC ATGCAACTCG AATCGATCGG CGGGCTGCAC
CAGGTGGCGC AGGAGATCAT CGAAATGCGC CGGCAACAGC CGGGGGACGA CATCATTTCC
TCACTGGTGC ACGCTGAGAT CGACGGACAG ATGCTCACCG ACTTCGAGAT CTCGTCGTTC
TTCTGCCTGT TGTCCGGCGC GGCCACCGAC ACCACCAAGA CGACGCTGGG TCACGCTGTG
CGTGCGCTGT CGCTGTTCCC CGAGCAGCAC GCGTGGCTGC TCGAGGACTT TGACAATCGC
ATCGGCACCG CGGTAGAGGA GTTCATCCGT TGGGCCACAC CACTGTTGAC GTTCACCCGC
ACAGCCGCCG TGGAAACCGA GCTGGGCGGC CGGCGAATCA TGCCGGGTGA CAGGGTGGTG
ATGATCTACC AGGCAGGCAA TTTCGACGAA GACGTATTCG ACCGCCCGCG CGAGCTCGAT
CTGGCGCGGT CCCCCAATCC GCATGTGAGT TTCGGTGGGG GCGGCGTGCA CTATTGCCTC
GGAGCCAATC TGGCCCGGTC GATGCTGCGC GCCGAACTGC GCGAGTTGCT GCTTCGGATA
ACCGAATTCG AGACTGACGC GCCGGACATG CTGGCCACCA ATTTCATTCA CGGCCTGAAG
CGACTGCCCT TCCGGTTCAC GCCGCAATGA
 
Protein sequence
MQIDEYLAAT SDNKPDHCDI DISRKAFWAR PADEKMEIFA ALRAEHPVSW QRPIEGAVVP 
DPDDPGFWAV VRHADIRQVS HDNRTFISGQ GVMFDRLPPL FLEMALSFLA MDPPRHDKLR
RLVNKAFTPR QITRIEDKID LVAKEVVDGF IADPTGEIEF MDRCASRLPL RMFCEMFGVP
DHLEERTVEH VIGSVSWSDE EILAGRSAAD MQLESIGGLH QVAQEIIEMR RQQPGDDIIS
SLVHAEIDGQ MLTDFEISSF FCLLSGAATD TTKTTLGHAV RALSLFPEQH AWLLEDFDNR
IGTAVEEFIR WATPLLTFTR TAAVETELGG RRIMPGDRVV MIYQAGNFDE DVFDRPRELD
LARSPNPHVS FGGGGVHYCL GANLARSMLR AELRELLLRI TEFETDAPDM LATNFIHGLK
RLPFRFTPQ