Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_12949 |
Symbol | |
ID | 5223635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | + |
Start bp | 3279210 |
End bp | 3283676 |
Gene Length | 4467 bp |
Protein Length | 1488 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640607715 |
Product | phenolpthiocerol synthesis type-I polyketide synthase ppsE |
Protein accession | YP_001288878 |
Protein GI | 148824124 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | [TIGR01720] non-ribosomal peptide synthase domain TIGR01720 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 0.892857 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 223 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCATCC CCGAGAACGC GATCGCGGTG GTCGGCATGG CCGGCCGATT TCCGGGCGCC AAGGATGTTT CGGCGTTCTG GAGCAACCTT CGGCGCGGTA AGGAGTCGAT CGTCACCCTG TCCGAACAGG AGCTGCGCGA CGCCGGCGTC AGCGACAAGA CGCTGGCCGA TCCGGCGTAT GTGCGTCGCG CCCCGCTTCT TGACGGGATC GACGAGTTCG ACGCCGGCTT CTTCGGGTTC CCGCCGCTGG CCGCGCAGGT GCTGGATCCC CAACACCGGT TGTTCCTGCA GTGTGCATGG CATGCGCTCG AGGACGCGGG CGCTGACCCC GCACGGTTCG ACGGCTCGAT CGGCGTATAC GGAACCAGCT CCCCCAGCGG CTATCTGCTG CACAACCTGC TGTCGCATCG CGACCCGAAC GCTGTGTTGG CCGAGGGACT CAACTTCGAC CAGTTCAGCC TGTTCTTGCA GAATGACAAG GACTTTCTGG CAACCCGGAT TTCGCACGCG TTCAACCTGC GCGGGCCGAG CATCGCGGTG CAAACCGCGT GTTCATCGTC GCTGGTAGCG GTGCATCTGG CCTGCCTGAG CCTGCTATCC GGCGAATGCG ACATGGCGTT GGCCGGCGGG TCGTCGCTAT GCATCCCGCA CCGTGTCGGC TACTTCACCT CACCGGGATC GATGGTGTCG GCGGTGGGCC ACTGTCGGCC CTTCGACGTG CGGGCCGACG GCACGGTCTT CGGCAGCGGT GTCGGGTTGG TGGTGCTCAA GCCGCTGGCG GCCGCCATCG ACGCCGGAGA CCGGATTCAC GCCGTCATCC GCGGATCGGC GATCAACAAC GACGGATCGG CGAAGATGGG GTATGCGGCG CCCAACCCGG CCGCTCAAGC CGATGTCATC GCCGAAGCCC ATGCGGTGTC CGGCATCGAT TCGTCGACCG TGAGCTATGT CGAGTGCCAC GGAACCGGCA CCCCGCTCGG TGATCCTATC GAAATCCAGG GCCTGCGAGC GGCGTTCGAG GTGTCGCAGA CGAGCCGTTC GGCCCCTTGT GTTCTGGGGT CGGTCAAGTC GAACATCGGC CACCTGGAAG TTGCTGCCGG CATCGCGGGT CTGATCAAAA CGATTCTGTG CCTAAAGAAC AAGGCACTAC CCGCGACGCT GCACTACACC AGCCCGAACC CGGAACTGCG CTTGGACCAA AGTCCGTTCG TCGTGCAAAG CAAGTACGGC CCCTGGGAGT GCGACGGCGT TCGTCGTGCC GGGGTGAGTT CGTTCGGGGT CGGGGGTACC AACGCGCACG TCGTCTTGGA GGAGGCGCCA GCAGAAGCAT CGGAGGTTTC AGCGCACGCC GAGCCGGCTG GCCCTCAGGT AATCCTGCTC TCGGCGCAAA CGGCCGCGGC GCTCGGCGAG TCGCGGACCG CCCTGGCCGC GGCGCTAGAA ACGCAAGACG GCCCGCGCCT GTCCGACGTG GCCTACACGC TCGCCCGGCG CCGCAAGCAC AACGTCACGA TGGCCGCCGT CGTGCACGAC CGCGAGCACG CGGCCACCGT GCTGCGGGCG GCCGAGCACG ACAACGTTTT CGTTGGCGAA GCCGCCCACG ATGGGGAGCA TGGCGATCGC GCCGACGCCG CACCCACGTC GGATCGCGTC GTTTTCCTGT TTCCCGGACA GGGCGCTCAG CACGTCGGAA TGGCAAAAGG GCTCTATGAC ACCGAGCCGG TCTTCGCCCA ACACTTCGAC ACCTGCGCCG CCGGATTCCG CGACGAGACA GGCATCGACT TGCATGCCGA AGTGTTCGAC GGGACCGCAA CAGATCTTGA GCGCATTGAC CGTTCGCAAC CGGCGTTGTT CACGGTGGAA TACGCGCTCG CGAAGTTGGT CGACACTTTC GGCGTGCGCG CCGGGGCGTA CATCGGATAC AGCACCGGCG AATACATCGC GGCCACCCTG GCCGGCGTAT TCGACCTGCA GACAGCGATC AAAACGGTGT CGCTGCGCGC CCGCCTTATG CATGAGTCGC CGCCCGGTGC CATGGTCGCG GTGGCTCTTG GCCCCGATGA CGTCACGCAG TACCTGCCAC CGGAGGTCGA GCTGTCCGCG GTAAACGATC CTGGTAACTG TGTGGTCGCC GGGCCCAAAG ACCAGATCCG TGCACTGCGC CAACGTCTTA CCGAGGCAGG GATTCCCGTT CGCCGCGTCC GGGCAACCCA CGCGTTCCAT ACCAGCGCGA TGGATCCCAT GCTGGGCCAA TTCCAAGAAT TCCTGTCCCG TCAACAGCTA CGTCCTCCGC GCACACCGCT GCTGAGCAAC CTCACCGGTA GCTGGATGTC CGACCAGCAA GTAGTCGATC CGGCCAGCTG GACGCGTCAA ATCAGCTCCC CCATCAGGTT CGCCGACGAG CTGGACGTGG TGCTGGCAGC TCCAAGTCGA ATCCTGGTCG AGGTTGGTCC GGGCGGCAGC CTGACCGGTT CGGCTATGCG CCACCCGAAG TGGTCGACCA CGCACCGCAC CGTTCGGCTT ATGCGCCACC CACTGCAAGA CGTCGACGAC CGCGACACTT TTCTGCGCGC GCTGGGCGAA CTCTGGTCTG CCGGAGTCGA GGTCGACTGG ACGCCGCGGC GTCCGGCGGT GCCGCACCTC GTTTCCCTGC CGGGTTATCC ATTTGCCCGT CAACGGCATT GGGTCGAACC TAACCACACG GTTTGGGCGC AGGCTCCCGG CGCAAACAAC GGCTCACCGG CCGGCACTGC GGATGGTTCC ACGGCCGCCA CCGTCGATGC AGCCCGCAAC GGAGAGTCGC AGACCGAGGT TACGCTGCAA CGCATCTGGT CACAGTGCCT CGGCGTCAGC TCGGTCGATC GGAACGCCAA TTTCTTCGAC CTCGGCGGCG ATTCTTTGAT GGCGATCAGC ATCGCGATGG CCGCCGCCAA CGAGGGTCTG ACCATCACGC CGCAGGATCT CTACGAATAC CCGACCCTGG CCTCGCTGAC GGCCGCCGTC GACGCGTCGT TCGCGTCCAG CGGGTTGGCG AAGCCCCCGG AGGCACAGGC GAACCCGGCG GTTCCACCCA ACGTCACGTA CTTCCTCGAC CGCGGATTGC GCGACACCGG CCGCTGTCGT GTCCCGCTGA TCCTGCGCCT GGATCCCAAG ATCGGGCTAC CGGATATTCG AGCGGTGCTG ACCGCAGTGG TCAACCACCA CGACGCATTG CGCCTGCACC TGGTCGGCAA CGATGGGATA TGGGAGCAGC ACATCGCGGC ACCCGCAGAA TTCACCGGGC TTTCCAACCG GTCGGTGCCC AACGGCGTGG CTGCAGGCAG CCCCGAGGAA CGGGCCGCGG TCTTGGGCAT CCTGGCCGAA CTCCTTGAGG ATCAAACGGA TCCGAACGCG CCGCTGGCTG CCGTTCATAT CGCCGCCGCG CACGGCGGTC CGCACTATCT GTGCCTTGCC ATACATGCGA TGGTCACCGA CGACTCATCG CGCCAGATCC TGGCGACCGA CATCGTCACC GCGTTTGGAC AACGGCTGGC AGGCGAGGAG ATCACGCTGG AACCGGTCAG CACGGGGTGG CGGGAATGGT CACTGCGTTG CGCGGCCCTC GCGACGCATC CGGCGGCGCT GGACACTCGC TCGTACTGGA TCGAGAATTC GACCAAGGCG ACTTTGTGGC TGGCCGATGC CCTTCCCAAC GCGCATACCG CCCATCCGCC CCGCGCCGAC GAGCTCACCA AGTTGTCGAG CACGCTAAGC GTCGAGCAGA CATCCGAGCT GGACGACGGC CGGCGCAGGT TCCGCCGGTC GATTCAGACG ATCCTGCTGG CCGCCCTCGG CCGCACAATA GCTCAGACGG TAGGTGAGGG TGTGGTCGCC GTGGAGCTCG AAGGCGAGGG CCGCTCGGTG CTGCGGCCGG ATGTCGACCT GCGCAGAACG GTCGGCTGGT TCACGACGTA CTACCCGGTA CCGCTGGCAT GCGCAACAGG GCTGGGCGCG CTTGCGCAGC TGGACGCGGT GCACAACACT CTTAAGTCCG TTCCGCACTA CGGAATTGGA TACGGGCTGC TGCGCTACGT TTACGCCCCG ACCGGACGTG TCCTGGGCGC TCAGCGCACA CCCGACATTC ACTTCCGGTA TGCGGGCGTG ATCCCCGAGC TACCGTCCGG CGATGCTCCA GTACAGTTCG ACTCGGACAT GACGCTTCCG GTGCGCGAAC CGATCCCAGG GATGGGCCAC GCCATCGAAC TTCGGGTGTA TCGGTTTGGT GGCTCACTGC ATCTCGATTG GTGGTACGAC ACCCGCCGGA TCCCGGCGGC AACGGCAGAA GCGCTGGAGC GGACCTTCCC GCTGGCCCTC AGCGCGCTGA TCCAGGAGGC CATCGCGGCC GAGCACACAG AGCACGACGA CAGCGAGATA GTCGGGGAAC CCGAGGCGGG CGCTCTGGTG GACCTGTCGA GCATGGATGC CGGCTGA
|
Protein sequence | MSIPENAIAV VGMAGRFPGA KDVSAFWSNL RRGKESIVTL SEQELRDAGV SDKTLADPAY VRRAPLLDGI DEFDAGFFGF PPLAAQVLDP QHRLFLQCAW HALEDAGADP ARFDGSIGVY GTSSPSGYLL HNLLSHRDPN AVLAEGLNFD QFSLFLQNDK DFLATRISHA FNLRGPSIAV QTACSSSLVA VHLACLSLLS GECDMALAGG SSLCIPHRVG YFTSPGSMVS AVGHCRPFDV RADGTVFGSG VGLVVLKPLA AAIDAGDRIH AVIRGSAINN DGSAKMGYAA PNPAAQADVI AEAHAVSGID SSTVSYVECH GTGTPLGDPI EIQGLRAAFE VSQTSRSAPC VLGSVKSNIG HLEVAAGIAG LIKTILCLKN KALPATLHYT SPNPELRLDQ SPFVVQSKYG PWECDGVRRA GVSSFGVGGT NAHVVLEEAP AEASEVSAHA EPAGPQVILL SAQTAAALGE SRTALAAALE TQDGPRLSDV AYTLARRRKH NVTMAAVVHD REHAATVLRA AEHDNVFVGE AAHDGEHGDR ADAAPTSDRV VFLFPGQGAQ HVGMAKGLYD TEPVFAQHFD TCAAGFRDET GIDLHAEVFD GTATDLERID RSQPALFTVE YALAKLVDTF GVRAGAYIGY STGEYIAATL AGVFDLQTAI KTVSLRARLM HESPPGAMVA VALGPDDVTQ YLPPEVELSA VNDPGNCVVA GPKDQIRALR QRLTEAGIPV RRVRATHAFH TSAMDPMLGQ FQEFLSRQQL RPPRTPLLSN LTGSWMSDQQ VVDPASWTRQ ISSPIRFADE LDVVLAAPSR ILVEVGPGGS LTGSAMRHPK WSTTHRTVRL MRHPLQDVDD RDTFLRALGE LWSAGVEVDW TPRRPAVPHL VSLPGYPFAR QRHWVEPNHT VWAQAPGANN GSPAGTADGS TAATVDAARN GESQTEVTLQ RIWSQCLGVS SVDRNANFFD LGGDSLMAIS IAMAAANEGL TITPQDLYEY PTLASLTAAV DASFASSGLA KPPEAQANPA VPPNVTYFLD RGLRDTGRCR VPLILRLDPK IGLPDIRAVL TAVVNHHDAL RLHLVGNDGI WEQHIAAPAE FTGLSNRSVP NGVAAGSPEE RAAVLGILAE LLEDQTDPNA PLAAVHIAAA HGGPHYLCLA IHAMVTDDSS RQILATDIVT AFGQRLAGEE ITLEPVSTGW REWSLRCAAL ATHPAALDTR SYWIENSTKA TLWLADALPN AHTAHPPRAD ELTKLSSTLS VEQTSELDDG RRRFRRSIQT ILLAALGRTI AQTVGEGVVA VELEGEGRSV LRPDVDLRRT VGWFTTYYPV PLACATGLGA LAQLDAVHNT LKSVPHYGIG YGLLRYVYAP TGRVLGAQRT PDIHFRYAGV IPELPSGDAP VQFDSDMTLP VREPIPGMGH AIELRVYRFG GSLHLDWWYD TRRIPAATAE ALERTFPLAL SALIQEAIAA EHTEHDDSEI VGEPEAGALV DLSSMDAG
|
| |