Gene Mmcs_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2020 
Symbol 
ID4110854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2169529 
End bp2170785 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content69% 
IMG OID638031142 
Productcytochrome P450 
Protein accessionYP_639185 
Protein GI108798988 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGACG CCCGGAAGAC CCCAGTGACG CAACAGTTCT CGTACGATCC GTTCGATGCC 
GCCGTGATGG CCGACCCGCT GCCGTTCTAC CGCACCCTGC GCGACGAACA CCCGCTCCAC
TACGTGGACA AGTGGGACAC CTACGCCCTC TCGCGGTTCG AGGACATCTG GCAGGTCCTC
GAGGTCAACG ACGGGACGTT CGTCGCCTCG GAGGGCACCC TGCCGTCGAC CGCCGTGCTG
GCCACCCACA ACACCGGCCC CGTGCCGGAT CCGCCCCTGT CGCCGATGCC GTTCCACGCC
AACTACGACG CGCCCCTCTA CACCGACGTG CGACGGTGCA CCTCCGGTCC GTTCCGGCCG
AGGTCGGTGA GCCGGCTGGC CGACCGGATC CGGGAACTGG CCAACGAACG GCTCGACGAA
CTGCTGCCGA AGGGACGGTT CGACCTCACC GCCGATTACG GCGGCATCGT CGCCGCATCC
ATGGTGTGCG AACTGATCGG GCTGCCAACC GAACTCGCCG CCGACGTACT GGCCACCGTC
AACGCCGGCA GCCTCGCCCA GCCCGGCAGC GGCGTCGAGG TCGCCAATGC CCGGCCCGGC
TATCTCGAGT ACCTGACCCC GCTCGTCGCG CGTCGGCGCG CCGAACGCCG CGGCGACGAC
ATGCCGATCG TCGACTCGCT GCTCGACTAC CGCAAACCCG ACGGGACCCC GCTGACCGAC
ACCGAGGCGG CCGTGCAGAT GCTCGGTGTC TTCATCGGCG GAACGGAGAC GGTGCCGAAG
ATCGTGGCGC ACGGGCTGTG GGAACTCTTC CGTCGCCCCG AGCAGCTGGC GCAGGTGCGC
GCCGATCCGG CGGCCCATGT GCCGATGGCG CGCGAGGAGA TGATCCGCTA CTGCGCACCC
GCGCAGTGGT TCGCGCGCAC CGTGCGCAGG CCGTTCACGA TCCACGGCAC CACCATCGAA
CCCGGTCAGC GCATCATCAC GCTGCTGGCG TCGGCCAACC GCGACGAGCG GGAGTACCCC
GATCCCGACG AGTTCGTCTG GGACCGGCGC ATCGAACGCC TGCTGGCGTT CGGGCGCGGG
CAGCACTTCT GCCTCGGGGT GCACCTGGCC CGTCTCGAGA TCGGCATCAT GGTGACCGAA
TGGCTCAGGC GCGTGCCGGA ATTCACCGTG GACGCCGAAC GCGCGTCACG GCCCCCGTCG
AGCTTCCAGT GGGGCTGGAA CAGTGTCCCG GTGGAGGTCC AGGTGGAGGG CGTCTGA
 
Protein sequence
MNDARKTPVT QQFSYDPFDA AVMADPLPFY RTLRDEHPLH YVDKWDTYAL SRFEDIWQVL 
EVNDGTFVAS EGTLPSTAVL ATHNTGPVPD PPLSPMPFHA NYDAPLYTDV RRCTSGPFRP
RSVSRLADRI RELANERLDE LLPKGRFDLT ADYGGIVAAS MVCELIGLPT ELAADVLATV
NAGSLAQPGS GVEVANARPG YLEYLTPLVA RRRAERRGDD MPIVDSLLDY RKPDGTPLTD
TEAAVQMLGV FIGGTETVPK IVAHGLWELF RRPEQLAQVR ADPAAHVPMA REEMIRYCAP
AQWFARTVRR PFTIHGTTIE PGQRIITLLA SANRDEREYP DPDEFVWDRR IERLLAFGRG
QHFCLGVHLA RLEIGIMVTE WLRRVPEFTV DAERASRPPS SFQWGWNSVP VEVQVEGV