Gene Amir_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_3894 
Symbol 
ID8328087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp4575773 
End bp4576984 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content77% 
IMG OID644944377 
Productcytochrome P450 
Protein accessionYP_003101614 
Protein GI256377954 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTCCG AACCCGTGGT GGACGTCAGC GACCCGGCCG TCCTCGGCGA CCCGGTGGGC 
GGGTACGACC ACCTCCTCGC CGAGCGGCCG GTCTGCTGGG CCCGGTTGCC CGGCGGCCAG
GAGGGCTGGC TCGTGACCGG CAACGCCGAG GTCCGCGCCG TGCTGACGCA TCCGGCCGTG
CGCAACGATC CCACCGGCCT CCCGGGACGG GGCGCGCGGA CCCAGGAGGA GGTGTTCCTC
GCGCTGGGCA CGCCCCCGGA GCACCTGCCG TACCTGCGGG CGAGCCTGCT GAGCCTCGAC
GGCGCCGAGC ACGCGCGGCT GCGGCGCGGC ATCGGCCACG CGTTCAGCGC GTCCCGCGTC
CAGGCGCTGC GGCCACGGGT GCAGGAGATC GTCGACGGGC TGCTCGACGG TCTCGGCCCG
GCGACCGCGG ACCTGCTGGA GGAGTACGCC TACCCGCTGG CCATGGCCGT GGTCTGCGAG
CTGGTCGGGG TGCCCGAGGC GGACTGGGCG CACTGGTACC GCTGGGGCAA GGCCCTGGTC
GACGGCGACC CGGCGCGGAT CACGCCGACG CTCGGCGAGA TGTTCGCCCA CTGCCACGAG
CTCGTCGACC GGCGCCGGGC CGAACCCCGC GCGGACCTGC TCAGCGAGGT CGTCGCCAGG
GACGACCTGA CCGACGTCGA CGTCGTGGCG CTCGTGGTGT TCCTCGTGCT CGCCGGGCAC
GAGACCATGG CGCACCTGCT GTCGAACGCG GCGCTCGCGC TGATGCGCGA CCCCGCCCAG
CGGGAGCTGC TGCGCGCGGA GCCGGGGCTG TGGCCCGCCG CGGTCCGCGA GCTGGTCCGC
ACCGACGGCC CGGTGCAGCT GGCGCGGCTG CGCTACGCCG CGACCGACCT GGAGGTGGGT
GGCGTGCGGA TCAGCGCCGG GGACGCCGTG CAGGCGGTGC TCGGGGCCGC CAACCGCGAC
CCGGCCCAGT TCGCCTGCCC GCGCCACGCC GACGTGCGCG GGCAGGCCGA GCGCGGCGCT
CGCGAGGGCG GCGTCGGGTT CGGCTGGGGT CCGCACTTCT GCCTCGGCGT CGCCCTGGCG
AAGGTCGAGG CCGAGATCGC GCTGCGGAGC CTGTTCGACC GGTTCCCCTC GGTGGCGCCG
GTCGGCGAAC CGGCGTGGGT GCCGCTGCCG CGGGGGCGGC ACCGGGTCGC GCTGGAGGTG
GCGCTGCGGT GA
 
Protein sequence
MSSEPVVDVS DPAVLGDPVG GYDHLLAERP VCWARLPGGQ EGWLVTGNAE VRAVLTHPAV 
RNDPTGLPGR GARTQEEVFL ALGTPPEHLP YLRASLLSLD GAEHARLRRG IGHAFSASRV
QALRPRVQEI VDGLLDGLGP ATADLLEEYA YPLAMAVVCE LVGVPEADWA HWYRWGKALV
DGDPARITPT LGEMFAHCHE LVDRRRAEPR ADLLSEVVAR DDLTDVDVVA LVVFLVLAGH
ETMAHLLSNA ALALMRDPAQ RELLRAEPGL WPAAVRELVR TDGPVQLARL RYAATDLEVG
GVRISAGDAV QAVLGAANRD PAQFACPRHA DVRGQAERGA REGGVGFGWG PHFCLGVALA
KVEAEIALRS LFDRFPSVAP VGEPAWVPLP RGRHRVALEV ALR