Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3126 |
Symbol | |
ID | 4646156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 3312389 |
End bp | 3317134 |
Gene Length | 4746 bp |
Protein Length | 1581 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639806603 |
Product | beta-ketoacyl synthase |
Protein accession | YP_953934 |
Protein GI | 120404105 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3321] Polyketide synthase modules and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.228806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.663439 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCTG CTTTTGACGA GGCAGCTGTT CGTCGGTGGT TGGTCGACTA CCTGGTCACG AACAACGGTT GCAGTCCGGA GCACATCGAA CGCGGCGCCT CCATGCACGA CCTCGGCGTG GGATCCCGCG ACGCGGTCGT GCTCACCGGC GTGCTGTCGG AGTACCTCGG CCGCGCCGTG TCGCCCGTGG ACTTCTGGCA GTACCCGACG GTGGACGCGC TGGCCAAGTT CCTGACCGGT GGCGAAGTCG AACCCGTCGA CCCCGGCCCC GGCGTCCGGC CGGCCGCCAC GAACGAGCCG ATCGCGGTGA TCGGGCTCGG TCTGCGTCTG CCCGGTGGCG CCGACCTGGA CTCCAACATC GAAGGGCCCG AAGCCTTCTG GGAGTTCCTC ACCGAGGGCC GCTCGTCGGT GCGGGAGGTC CCCGAAGACC GCTGGGAGTG GTGCGAGGAC GGCACACCCG AGGGCGCCGC CGCGCTGGCA GACACCACCA GGTGGGGTTC GTTCCTGCGC GACCTGGACG CCTTCGACGC CGAGTACTTC GAGATCATCC CGCGCGAGGC CACCCGGATG GATCCGCAGC AGCGGCTGCT GCTCGAGGTC ACCCACGAGG CCCTGGAGAA CGCGGGCATC GCCGCCGACT CGCTGGCCGA GACGCAGACC GGCGTGTTCG CCGGCGCGAG CGCCGCCGAC TACGCACAGC TGGGCGCCAC GGACCTGAGC GGCATCGACG CCTGGTACAG CACAGGTGGA GCACTGAGCA TCATCGCCAA CCGGGTGTCG TACTACTTCG ACCTGCGCGG CCCGTCGGTG ACCGTGGACA CCGCCTGCTC GTCGTCACTG GTGGCCATCC ATCTGGCGTG CCAGAGCCTG CGCTCGGGCG ACTCGGAACT GGCGTTGGCG GCCGGGGTGA ACCTGCTGTT GTCACCGGCC CCGACGAGGA GTTTCGACCG GGCCAAGGCG ATGTCGCCGA CGGGACAGTG TCACGCTTTC GACGCCGGCG CCGACGGCTT CGTGCGCGGC GAGGGTTGCG GGGTGGCGGT GCTCAAGCGG CTGTCCGACG CGCAGCGCGA CGGTGACCGG GTGCTCGCGG TGATCCGCGG GTCCGCGGTG AACCAGGACG GCCGCTCCAA CGGCCTGATG GCCCCGAATC CGTCGGCGCA GATGGCGGTG CTGCGCGCCG CCTACGCGGC GGCCGGGGTG GACCCCCGTG AGGTCGACTA CGTCGAGGCG CACGGCACCG GCACGCTGCT GGGCGATCCG ATCGAGGCGC GGGCGCTGGG CACCGTGCTC GGTAAGGGCC GCGCGGCCGA TGCGCCGCTG CTGCTCGGTG CGGTCAAGTC CAACCTGGGC CACCTCGAAG CCGCGGCAGG CATCGCAGGG TTCGCCAAAG CGGTATTGGC GTTGCAGCAC AACATGATTC CGGCCAACCT CGGCTACCAG AGCCCGAACC CGCACATTCC GTTCGAGAAG CTGCGGTTGA AGGTCGTCGC CGAGCACACC GATTGGGCCC CCGCCGGGCG GCCCCGGCGT GCGGGCATCT CGTCGTTCGG GTTCGGCGGC ACCAACGCCC ACGTCGTGAT CGAACAGGCT CCGGTGTTCG CCCCGTCGCC GGTCGAGGAA CCCGAAGCGG TCACCACGCT GGTGGTGTCG GGCAAGTCGC CCGAGCGCAT CGCGTCGCAG GCCGCCGCAC TGGCGGAGTG GATGGCCGGC GCCGGATCCG ACGCCTCACT GGTGGAGGTG GCGCACTCGC TCAACCACCA CCGCGCCCAG CACGCCAAAT TCGCGACCGT GGTGGCCCGC GACCGTGATC AGGCCATCGC GGGGCTGCAG GCGCTGGCCG CCGGGCAGTC CGCGCCGGGC GTGGTCGGGG TTGCCGCAGG CAACCCGCAG CCTGGCACGG TGTTCGTGTA TTCGGGTCAG GGGTCGCAGT GGCCGGGGAT GGCCCGGCAG CTGTTGGTTG ACGAGCCGGC GTTCGCGAAC GCGTTGGCCG AGATCGAGCC GGTGTTCGTC GAGCAGGTCG GTTTCTCGTT GCGTGACGTC ATCGCAGGTG GCGAGACCGT CAGTGGTGAT GCGCAGGTTC AGCCGGTGCT GATGGGTCTG CAGCTGGCGT TGACCGAGCT GTGGCGGTCC TACGATGTGC ATCCGGATGC GGTGATCGGT CATTCGATGG GTGAGGTGAC CGCCGCAGTG GTGGCCGGGG CGTTGAGTCT GGCCGACGGG CTGAAGGTCA TTGCCGCGCG GTCGTCGATC ATGTCCCGGC TGGCCGGGCA GGGCGCCGTC GCGCTGCTCA ACCTCGACGC CGACGCGGCC CGCTCGCTGA TCGCCGACCA TCCGAGCGTC GAGATCGCCG GATATCTGTC GCCGCGCCAG ACGGTGGTGG CCGGCCTGCC CGAGCAGGTC GACGCCGTCA TCGCCGCGGT CACCGCACAG AACACGTTCG CCCGCCGGGT CAACATGGTC GTCGCCTCCC ACACCGCGCT GATGGACCCG GTGCTGCCCG ACATCCGCGC GGCGCTGGCC GGAATCGAAC CCAACATCCC GACGCTGCCG TTCCTGTCGA CGGTCACCGG GGCCGACAGC GCACCGGTGC TGGACGCCGA CTACTGGGTG GCCAACGTGC GCCAACCGGT CAAATTCAGC CAGGCCATCG TCGCGGCGGC AGCCGACAAC GGCACGTTCA TCGAGATCAG CCCGCACTCG ACGCTGGGTC AGGCGATCGG CAACACGCTT GGTGACGGGC CGGACCATCA CATGCTCGGC ACCCTGGCGC GCGACACCGA CGACACGGTG ACGTTCCACG CGAACCTGAA CGCCACCCAC ACGTCGCGGC CCCCGCACAC CGCCCATCAC GGCGAACCCC ACATCGTGCT GCCCACCACC CCGTGGCACC GCAGCAGACA CTGGGTGGAC GTCCGCCCGC TGCGCCGCGG TGGCGGCGTG CGTGCCGGTG CGTTGCCCGC CGACAGCGCG GTCCCGCCGG AGTGGTTCTG CGGGCTGACG TGGCCCGCCA AACCCCTTGT GGCGCAGGAT GGTCCAGTCG ACCCGGGCGC CAGCTGGCTG GTGGTCGGCG ACGACGCGCT GGCCGCCGAG ATGGGTAAGC TGCTGGGCAG GCCGGTGGCC ACCGGTGACG GAGCGGATCT GTCCTCGGCC ACGCACGTCC TCTACGCGCC GACCTCCGGT GACGAGCTGG GTTACGAGCT CTTCGAGGCG GGCCGTGCGA TTGCGACCAC CGCGAGCCGC GTATCGCAGC CCCCGAAGCT GTTCCTGCTG ACCCGCAATG CCCAGCCGGT CAGCGAGGGG GACCGGGCCA ACCCCGGGCA GGCGGTGCTG TGGGGCCTGG GCCGCACACT GGCGCTGGAG CATCCCGACA TCTGGGGCGC CCTGATCGAC ACCGACGAGT CGGTGCCCGC GGTCGTCGCC GCGCGCTGGG TGCTCGCGGA GGCGCATGCC GGCGACGGTG AAGACCAGCT GGTGTACCGG GCCGGCATCC GCCGGGTGGC CCGCCTCGTG CACGCGCTGC CGCCCGCGCC GACGGGTTCG GGGACACTCG ACGCCGACGG CGCGCACCTG GTCATCGGCG CGACGGGCAA CATCGGCCCC CGACTCGTCG AGCAGCTCGC CGCGATGGGC GCCAAGACCG TGGTGGCGGT GTCCCGTAAC CCGGGCTCAC GGCTGGACGA ACTGACCGCG CGGCTCGCGG CCTCCGGGAC GACGGTCGTG ACCGCCGCAG CGGATGCGTC CGACGCGTCA TCGCTGCGAG CGGTGTTCGA CCGCTTCGGG GCGGACCTGC CCCCGCTGAA GGGCGTCTAT CTGGCGGCGA TGAGCGGCGG TCCGGTCACG CTGGACGAGA TGACCCACGA CGACGTGGTG GCGATGTTCC GGTCCAAGAT GGATGCCGCG GCGCTGCTGC ACCAGCTGTC GGCGGGCCAT CCCGTCGAGC AGTTCGTGCT GTTCTCGTCG ATCTCCGGCG TGCTGGGTTC GCGCTGGCTG GCGCATTATG CGGCGACGAC GACGTTCCTG GACACCTTCG CGTTCGCCCG CCGGGCGGCA GGGCTGCCGG CCTGCGCGAT CAACTGGGGC CTGTGGAAGT CGCTGGCCGA CGCGCAGACC GGGTTCGAAC GTCAGGCCAC CCAGGAGTCG GGTCTGGAGC CGATGGACGA CGCGGTCGCC ATCACCGCGC TGCGCTCGTT CATCGGACCG CAGGCGCCCG CCAGGGCCAC CGTGGTGGCG GCGGACTGGC CCCGGCTGGC CGCGGCCTAC CACACCCGCG CGCAACTGCA CATCCTCGAC GACCTGCTGG CCACAGAGGG CACAGCGGCC ACCGGCCTTA CCCCGACCGG TGACACGAGA TTTCGCCAGG AGCTGAAGGA GTGCGAGCCC CAGCGGCGTG TGGAGTTGCT CACCGATCAC GTGTTGTCGC AGATCGCCGC CGCAATGGGG CTGGCTTCGA CGCACACGCT CGACCCGACG GTGGGGTTCT TCCAGTTCGG AATGGATTCG CTGATGAGCG TAACGTTGCA GCGTTCGCTG AGCGAAAGCC TGGGGGAGGC CCTGCCGGCG TCGGTGGTGT TCGACTACCC GACCGTGGAA GCGCTCACCG ATTATCTGGC GTCGGTTCTT CCTGAGATAA TCGAGACCGC CGGTACCGAC AACGACGTCG ATGGTGGCCC CAGCGTCATT GAGGACGCCT ACGACGACCT CGCCGAGGAC GAGTTGCTGG CGAGACTTTC GGAAAGATTG AGTTGA
|
Protein sequence | MTSAFDEAAV RRWLVDYLVT NNGCSPEHIE RGASMHDLGV GSRDAVVLTG VLSEYLGRAV SPVDFWQYPT VDALAKFLTG GEVEPVDPGP GVRPAATNEP IAVIGLGLRL PGGADLDSNI EGPEAFWEFL TEGRSSVREV PEDRWEWCED GTPEGAAALA DTTRWGSFLR DLDAFDAEYF EIIPREATRM DPQQRLLLEV THEALENAGI AADSLAETQT GVFAGASAAD YAQLGATDLS GIDAWYSTGG ALSIIANRVS YYFDLRGPSV TVDTACSSSL VAIHLACQSL RSGDSELALA AGVNLLLSPA PTRSFDRAKA MSPTGQCHAF DAGADGFVRG EGCGVAVLKR LSDAQRDGDR VLAVIRGSAV NQDGRSNGLM APNPSAQMAV LRAAYAAAGV DPREVDYVEA HGTGTLLGDP IEARALGTVL GKGRAADAPL LLGAVKSNLG HLEAAAGIAG FAKAVLALQH NMIPANLGYQ SPNPHIPFEK LRLKVVAEHT DWAPAGRPRR AGISSFGFGG TNAHVVIEQA PVFAPSPVEE PEAVTTLVVS GKSPERIASQ AAALAEWMAG AGSDASLVEV AHSLNHHRAQ HAKFATVVAR DRDQAIAGLQ ALAAGQSAPG VVGVAAGNPQ PGTVFVYSGQ GSQWPGMARQ LLVDEPAFAN ALAEIEPVFV EQVGFSLRDV IAGGETVSGD AQVQPVLMGL QLALTELWRS YDVHPDAVIG HSMGEVTAAV VAGALSLADG LKVIAARSSI MSRLAGQGAV ALLNLDADAA RSLIADHPSV EIAGYLSPRQ TVVAGLPEQV DAVIAAVTAQ NTFARRVNMV VASHTALMDP VLPDIRAALA GIEPNIPTLP FLSTVTGADS APVLDADYWV ANVRQPVKFS QAIVAAAADN GTFIEISPHS TLGQAIGNTL GDGPDHHMLG TLARDTDDTV TFHANLNATH TSRPPHTAHH GEPHIVLPTT PWHRSRHWVD VRPLRRGGGV RAGALPADSA VPPEWFCGLT WPAKPLVAQD GPVDPGASWL VVGDDALAAE MGKLLGRPVA TGDGADLSSA THVLYAPTSG DELGYELFEA GRAIATTASR VSQPPKLFLL TRNAQPVSEG DRANPGQAVL WGLGRTLALE HPDIWGALID TDESVPAVVA ARWVLAEAHA GDGEDQLVYR AGIRRVARLV HALPPAPTGS GTLDADGAHL VIGATGNIGP RLVEQLAAMG AKTVVAVSRN PGSRLDELTA RLAASGTTVV TAAADASDAS SLRAVFDRFG ADLPPLKGVY LAAMSGGPVT LDEMTHDDVV AMFRSKMDAA ALLHQLSAGH PVEQFVLFSS ISGVLGSRWL AHYAATTTFL DTFAFARRAA GLPACAINWG LWKSLADAQT GFERQATQES GLEPMDDAVA ITALRSFIGP QAPARATVVA ADWPRLAAAY HTRAQLHILD DLLATEGTAA TGLTPTGDTR FRQELKECEP QRRVELLTDH VLSQIAAAMG LASTHTLDPT VGFFQFGMDS LMSVTLQRSL SESLGEALPA SVVFDYPTVE ALTDYLASVL PEIIETAGTD NDVDGGPSVI EDAYDDLAED ELLARLSERL S
|
| |