Gene Mkms_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3035 
Symbol 
ID4610868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3177439 
End bp3178740 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID639792704 
Productlinalool 8-monooxygenase 
Protein accessionYP_939019 
Protein GI119869067 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAA CACTGCATCC GACCGGTATC GCACCGCGTG AGAACGGCAC GCCGCCACCG 
CACGTACCGC TCGGCGACAT CGATCTGGGC ACCCTCGACT TCTGGGAATG GGACGACGAC
CGCCGCGACG GGGCGTTCGC GACCCTGCGC AAGGAATCGC CGATCACCTT CTTCGAGGTG
CCGGAGTACG CGGGGTTCGC CGGCGGGCGT GGCCATTGGG CCCTGACCAG GTACGACGGC
GTCCACCACG CCAGCAGGCA TCCGGAGGTC TTCAGCTCGA TCCCGACCAG CACCGCCCTC
AACGATGTGC CCGTCGAGAT CGCCGAATAC GTCGGATCCA TGATCTCGCT CGACGACCCT
CGTCATCTCC GGCTGCGGTC GATCGTCAAC CGCGCGTTCA CTCCGAAGAT GCTGACACGG
ATCGAGCAGA GCGTGCGTGA CCGGGCACGC CGGTTGGTCA CCGGTCTGGT GGCCGACCAC
CCCGACGGAC ACGCCGACTT CGTGCAGGCG GTGGCCGGAC CGTTCCCATT GCAGATCATC
TGCGACATGA TGGGGATCCC CGAAGAGGAC GAGGAGAAGG TCTTCCACTG GACCTCGATC
ATCCTGGGCG GCGCCGACGA GGAGGTCGCG CCCGACCACG AGACGATCGT CGGTGCGGTG
CTCGGACTCG GTGAGTACGG TCTGGCACTC GCGGAGGACC GCCGGGCCCA TCCGACCGAC
GATCTGACCA CTAACCTGGT GCGCGCCGAG GTCGACGGGG AACGGCTCAC GTCCGCGGAG
ATCGGCTCGT TCTTCATCCT GCTGTCAGCC GCGGGCAACG AGACGACGCG CAACGCAATC
AGCCACGGCC TCGTCGCGCT CAGCCGCTAT CCGGAACAGC GCCAACGCTG GTGGGACGAC
TTCGACACCG TCGCCCCGAC CGCCGTGGAG GAGATCGTAC GGTGGGCGTC CCCGATCATC
TTCATGCGCC GCAACCTGAC CGAGGACATC GAGATGAGGG GTGTGCGGAT GAAGGCGGGG
GACAAGGTGT CGATGTGGTA CAACTCCGCC AACCGTGACG AGCGCAGGTT CGACAACCCC
TGGCTGTTCG ACGTGACACG GGATCCCAAC CCGCAGATCG GATTCGGCGC GGGTGGCGCG
CACTTCTGCC TCGGCGCGAA CCTCGCCCGG CGCGAGATCA GGGTGCTCTA CTCGGAGTTG
CGCCGTCAGG TGCCCGACAT CGTGGCCGTC GAGGAGCCGG CGATCCTGCG GTCGGCGTTC
GTGCACGGCA TCAAACGGCT GCCCGTGGCC TGGACGGGGT GA
 
Protein sequence
MTTTLHPTGI APRENGTPPP HVPLGDIDLG TLDFWEWDDD RRDGAFATLR KESPITFFEV 
PEYAGFAGGR GHWALTRYDG VHHASRHPEV FSSIPTSTAL NDVPVEIAEY VGSMISLDDP
RHLRLRSIVN RAFTPKMLTR IEQSVRDRAR RLVTGLVADH PDGHADFVQA VAGPFPLQII
CDMMGIPEED EEKVFHWTSI ILGGADEEVA PDHETIVGAV LGLGEYGLAL AEDRRAHPTD
DLTTNLVRAE VDGERLTSAE IGSFFILLSA AGNETTRNAI SHGLVALSRY PEQRQRWWDD
FDTVAPTAVE EIVRWASPII FMRRNLTEDI EMRGVRMKAG DKVSMWYNSA NRDERRFDNP
WLFDVTRDPN PQIGFGAGGA HFCLGANLAR REIRVLYSEL RRQVPDIVAV EEPAILRSAF
VHGIKRLPVA WTG