Gene Mflv_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1404 
Symbol 
ID4972730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1463556 
End bp1467530 
Gene Length3975 bp 
Protein Length1324 aa 
Translation table11 
GC content73% 
IMG OID640455608 
Productnon-ribosomal peptide synthetase 
Protein accessionYP_001132674 
Protein GI145221996 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain
[TIGR02353] non-ribosomal peptide synthetase terminal domain of unknown function 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.305242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCGAA CCCCAGAGAT CCCTAGTCAG TTCCTGCTCT CGGCATACGC CCCGCAACCG 
CGCACCCTGA TCGACATCCT GGCCGAGACG GCGCGCCGGT TCCCCGACGC GCCCGCCCTC
GACGACGGCA CGGTGCAGCT CACCTACGCA GAACTGCTCT CCGACGTCGA GGACAGCGTC
GCGTGGCTCG GCGCCCGCGG CATCGGCCGC GGCGACCGTA TCGGGATCCG GATGCCGTCG
GGCAGCTACG CGCTCTACGT CGCGATCCTG GCCACCCTCG CCGCCGGCGC GGCCTACGTC
CCGGTCGACG CCGACGATCC GCAGGAGCGC GCCGATTTGG TGTTCACCGA GGCCGGCGTC
GTCGCGATCA TCACCGAGCG GGGCCTGGTG CGCGGTCCGG GCGCATCCCG CGGGTGGCGG
GCCGCCGCGC CGCTGAGCCG CGACGACGCC TGGATCATCT TCACCTCGGG CTCCACCGGC
ACCCCGAAGG GCGTCGCGGT CTCGCACCGC AACGCGGCCG CGTTCGTCGA CGCCGAGGCG
AAGATGTTCC TGCAGAACAA CCCGCTCGGC CCGGGGGACC GCGTGCTGGC CGGACTGTCC
GTCGCGTTCG ACGCGTCGTG CGAGGAGATG TGGCTGGCGT GGCGGCACGG CGCCTGCCTG
GTGCCGGCCC CGCGGTCGCT GGTGCGCAGC GGCATGGACC TCGGTCCGTG GCTGGTCTCA
CGGGACATCA CCGTCGTGTC CACGGTGCCG ACGCTGGCCG CCCTGTGGCC CGCCGAGGCG
CTGGAATCGG TGCGGCTGTT GATCTTCGGC GGTGAGGCCT GCCCGCCCGA GCTGGCCGAG
CGCCTCGCCG CCGGACCCGA CTCCGCGGGC CGTGAGGTCT GGAACACCTA CGGCCCCACC
GAGGCGACGG TGGTCGCGTG TGCGGCGAGG CTGGACGGAG GTTCCGGAGG GCAGGAGCGC
AGCGACCAGG GAAATAGGAC AGCGGCGGTC AGCATCGGCT TGCCGCTACC CGGCTGGGAC
CTCGCCGTGG TCGACAAGGC CGGGGAACCG GTCGCGGTCG GCGAGGTCGG CGAACTGATC
ATCGGCGGCG TCGGACTCGC CCGCTACCTC GACCCGGACA AGGACGCCGA GAAGTACGCG
CCGTTCCCGA CCCTGGCGTG GAGCCGCGCG TACCGCAGCG GCGACCTCGT GCGGCTGGAG
ACCGACGGGC TCTACTTCGT CGGCCGCGCC GACGACCAGG TCAAGGTCGG CGGCCGCCGC
ATCGAGCTCG GCGAGGTCGA CAACGCGCTG ATGAACCTGC CCGGGGTCAC CGGCGCCGCC
GCCGCGGTCC GCCGCACCGC CAGCAGCACC CCGCTGCTGG TCGGCTACCT CGCGGTGGCG
CCGGGGCTGG AGACGCCGTT CGACATCTCC CGGGCGCGGG CACAGCTGTC CGAAGCGCTG
CCCGCCGCGC TGATCCCGCG CCTGGTCGTC GTCGACGAAC TCCCGACCCG CACGTCGGGC
AAGGTGGACC GCGACGCACT GCCGTGGCCG GTCGACAGCG CCGAGAAGGA CACCGCAGAC
GGTTCCGATC TGGGCGGCGG GACGCTCGGG TGGCTGGCCG GCGTGTGGCG CGACGTGCTG
GCCGCCCCGA TCGACGGTCC CGAGGCCGAC TTCCACGCCC TCGGCGGCGG CTCGCTGTCG
GCGGCGCAGC TCGTCGCCGC GCTGCGGCAG CGCTACCCGC AGGTCACCGT CGCCGACCTC
TACGACCATC CCCGACTGGG TTCGCTGGCC GGCTACCTCG ACGAACTCGA TCCGCCCGCG
GCGGTCGAGA CCCGCACGGT TACGCCGGTC TCCAGGCTGA CCCAGGCGGT GCAGGTGGCG
CTGACCGTCC CGCTGGCGGT GCTGACCGGC CTGCAGTGGG TGGTCTGGCT GGCGGTCGCC
AACAACATCG CCGCCGAGTT CTCTCTGGTC GACTGGGTGT CGCCGGTCAG CTGGTGGTGG
GTGCTCGCCG GGTTCCTGGT GTTCGTCACC CCTCCCGGCC GGATGAGCAT CGCGGTGTTC
GGCGCCCGGA CCCTCGTCGG GAGCCTGCAG CCGGGCACCT ATGCCCGCGG CGGTTCGGTC
CATCTGCGCG TATGGCTGGC CGAGCGGCTC GCGGACGCCA GCGGCGCCGA GAACATGGCC
GGGGCACCGT GGCTCGTCTA CTACGCCCGC GCGCTCGGCA ACAGGGTGGG CAGCGGGGTG
GACCTGCACT CCGCGCCGCC GGTGACCGGG ATGCTGACGC TCGGGCATCG CTGCTCGATC
GAACCCGAGG TCGACCTGAC CGGGCACTGG ATCGACGGCG ACCTGTTCCA CGTCGGCCCG
ATCACCGTCG GCAACGACGC GACCGTCGGC GCCCGCACGA CGCTGCTGCC CGGCGCCGTC
GTCGGCAAGA ACGCCGACGT CGCACCGGGT TCCGGCGTGA TCGGCAAGGT CAAGAACGGG
CAGTACTGGA AGGGCTCGCC GGCGGTGAAG TCCGGCAAGG CCAAGCACCC GTGGCCCGAT
CACCGCCCGC CGCGGGCGCC GGTGTGGGTC GCGATGTACG GCGTCACCTC GCTGCTGCTG
GCCGCGCTGC CGCTGACCGC GCTCGCGGCC GGGCTGGCCG TGATCGGTTG GGGCGTGCGC
GGGACGTCGA CGGTGACGTC GGCGATCGTT CCGGCGCTGC TGTGGGCCGT GCCCGCGACG
GCAGCCGCGG TGCTGGTGTA CGCGACGCTG ACGGTCATCG GTGTGCGTCT GCTCGCGATC
GGGCTGACCG AGGGCTACCA CCCGGTGCGC AGCCGGCCGG GCTGGCAGCT GTGGGCCACC
GAGCGTCTGA TGGACTCCGC CCGCACCTAC CTGTTCCCGG TCTACGCGGG CCTGCTCACC
CCGTGGTGGC TGCGGCTGCT CGGCGCGACG GTCGGTAAGG GCACCGAGAT CTCGACCGCG
CTGCTGATCC CGAAGTTCAC CGTCATCGAG GACGGCGCGT TCCTCGCCGA CGACACGATG
GTCGCGTCCT ACGAACTCGG CGGCGGCTGG ATCCACGTCG CGCGGGCCAC CGTCGGCAAG
CGCGCCTTCC TCGGCAACTC GGGCATCACC CAACCCGGGC GCAGGGTTCC CGACGACGGC
CTCGTCGCCG TCCTGTCCGC GACACCGCGC AAGGCCAAGG CCGGCTCGTC GTGGCTGGGC
AGCCCGCCGG TGCGGTTGCG GCGCCGTCCG ACGGCCGCCG ACGCGCTGCG CACCTTCCAT
CCGTCCGTCC GGCTCAAGAT GTTGCGCGCC ACCGTCGAGA TGTTCCGGTT CGTCCCGGTG
GTGGTGACGT TCACGCTCGG GATCGCGGTG CTGTTCGCGA TCCAGGCGGC GGCGGTGCAC
GTCGGGTGGC CGTGGGCGGC GCTCGCCGCG GGTCCGGTGC TGCTGGCCGC CGGCGCGGTG
GCCGGCGCGA TCGCCGTGAT CGCCAAATGG CTTGTGATCG GGCGGATCAC CGCGATCGAA
CATCCGCTGT GGTCGTCGTT CGTGTGGCGC AACGAGGTGG CCGACACCTT CGTCGAGACG
GTGGCCGCGC CGTGGTTCGC CCGCGCCGCA TCGGGTACGC CGGTGATGAA CCTCTGGCTG
AGGGGCCTGG GCGCGACGAT CGGACGCGGT GTGTGGTGTG AGACCTACTG GCTTCCCGAA
GCCGACCTGG TGACGCTCGA CGACGGGGCC ACGGTCAACC GGGGCTGCGT CGTGCAGACC
CACCTGTTCC ACGACAGGAT CATGCGGATG GACACCGTGG TGCTCGAGGC GGGCGCAACG
CTCGGACCGC ACTGCGTCGC GCTGCCCGCC GCGCGCATCG GCGCCGGGGC GACGGTCGGG
CCCGCGTCCC TGGTGATGCG CGGCGACGAG GTGCCGCCGT CGACGAGGTG GCAGGGTAAC
CCGATCGCGG TGTGGAATCC GCCGCGCAAG AAGCGCGAGT CCGACCCCAA ACCCAAGAAG
TCTGCGGCCG CGTGA
 
Protein sequence
MDRTPEIPSQ FLLSAYAPQP RTLIDILAET ARRFPDAPAL DDGTVQLTYA ELLSDVEDSV 
AWLGARGIGR GDRIGIRMPS GSYALYVAIL ATLAAGAAYV PVDADDPQER ADLVFTEAGV
VAIITERGLV RGPGASRGWR AAAPLSRDDA WIIFTSGSTG TPKGVAVSHR NAAAFVDAEA
KMFLQNNPLG PGDRVLAGLS VAFDASCEEM WLAWRHGACL VPAPRSLVRS GMDLGPWLVS
RDITVVSTVP TLAALWPAEA LESVRLLIFG GEACPPELAE RLAAGPDSAG REVWNTYGPT
EATVVACAAR LDGGSGGQER SDQGNRTAAV SIGLPLPGWD LAVVDKAGEP VAVGEVGELI
IGGVGLARYL DPDKDAEKYA PFPTLAWSRA YRSGDLVRLE TDGLYFVGRA DDQVKVGGRR
IELGEVDNAL MNLPGVTGAA AAVRRTASST PLLVGYLAVA PGLETPFDIS RARAQLSEAL
PAALIPRLVV VDELPTRTSG KVDRDALPWP VDSAEKDTAD GSDLGGGTLG WLAGVWRDVL
AAPIDGPEAD FHALGGGSLS AAQLVAALRQ RYPQVTVADL YDHPRLGSLA GYLDELDPPA
AVETRTVTPV SRLTQAVQVA LTVPLAVLTG LQWVVWLAVA NNIAAEFSLV DWVSPVSWWW
VLAGFLVFVT PPGRMSIAVF GARTLVGSLQ PGTYARGGSV HLRVWLAERL ADASGAENMA
GAPWLVYYAR ALGNRVGSGV DLHSAPPVTG MLTLGHRCSI EPEVDLTGHW IDGDLFHVGP
ITVGNDATVG ARTTLLPGAV VGKNADVAPG SGVIGKVKNG QYWKGSPAVK SGKAKHPWPD
HRPPRAPVWV AMYGVTSLLL AALPLTALAA GLAVIGWGVR GTSTVTSAIV PALLWAVPAT
AAAVLVYATL TVIGVRLLAI GLTEGYHPVR SRPGWQLWAT ERLMDSARTY LFPVYAGLLT
PWWLRLLGAT VGKGTEISTA LLIPKFTVIE DGAFLADDTM VASYELGGGW IHVARATVGK
RAFLGNSGIT QPGRRVPDDG LVAVLSATPR KAKAGSSWLG SPPVRLRRRP TAADALRTFH
PSVRLKMLRA TVEMFRFVPV VVTFTLGIAV LFAIQAAAVH VGWPWAALAA GPVLLAAGAV
AGAIAVIAKW LVIGRITAIE HPLWSSFVWR NEVADTFVET VAAPWFARAA SGTPVMNLWL
RGLGATIGRG VWCETYWLPE ADLVTLDDGA TVNRGCVVQT HLFHDRIMRM DTVVLEAGAT
LGPHCVALPA ARIGAGATVG PASLVMRGDE VPPSTRWQGN PIAVWNPPRK KRESDPKPKK
SAAA