Gene Mvan_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3301 
Symbol 
ID4644884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3511562 
End bp3514537 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content70% 
IMG OID639806779 
Productacyl-CoA synthetase 
Protein accessionYP_954105 
Protein GI120404276 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II
[COG3243] Poly(3-hydroxyalkanoate) synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGACT TCTCGGCGAT CACCAGGCCT GTGGGGCGGT TGGTCGCGAC CGCCCAGAAC 
GGCCTCGAGG TGTTGCGCTA CGGCGGGCTG GAGACCGGCG CCGTGCCGTC GCCGTTCCAG
ATCATCGAGA GCGTGCCGAT GTACCGGCTG CGCCGCTACT TCCCGCCCGA CGTCCGTCCC
GGCAGCAAGC CCGTCGGCCC CCCGGTGCTG ATGGTCCACC CGATGATGAT GTCGGCCGAC
ATGTGGGACG TCACCCGCGA CGAGGGTGCG GTCGGCATCC TGCACAAGGC CGGGATCGAT
CCGTGGGTCA TCGACTTCGG TTCGCCCGAC AAGGTCGAGG GCGGGATGCA GCGCAACCTG
GCCGACCATG TGGTGGCCCT CAGCGACGCC ATCGACACGG TCAAGACGGT GACGGGACGC
GACGTCCATC TCGCCGGCTA CTCGCAGGGC GGCATGTTCG CCTATCAGAC GGCGGCCTAC
CGCCGGTCCA AGGACCTGGC GAGCATCGTC GGGTTCGGTT CGCCCGTCGA CACCCTGGCC
GCGCTCCCGA TGAACCTGCC GGCCAGCGTC GCCCCGTTGG CCGCCGACTT CATGGCCGAC
CACGTGTTCA GCCGCATCGA CATTCCGGGG TGGCTGGCGC GCACCGGGTT CCAGATGCTC
GACCCCGTCA AGACCGCGCA GTCGCGGCTG GACTTCCTGC GCCAGCTACA TGACCGGGAG
GCGTTGCTGC CCCGCGAGCA GCAGCGGCGG TTCCTCGCCT CGGAGGGCTG GATCGCCTGG
TCCGGTCCGG CGATCTCCGA GCTGCTCAAG CAGTTCATCG CGCACAACCG GATGATGACC
GGTGGCTTCT CCATCCACGG CGACCTTGTC ACGCTGTCCG ACATCGAGTG CCCTGTGCTG
GCCGTGATCG GGGAGGTCGA CGACATCGGT CAGCCCGCCT CGGTGCGCGG CATCAAGAGA
GCCGCTCCGA AGGCCGACGT CTACGAGTTC CTGATCCGCG CCGGGCATTT CGGGCTGGTG
GTCGGGTCGA AAGCCGCCAC GCAGACCTGG CCGACGGTCG CCGAGTGGGT GCGCTGGCTC
GACAGCGGCG GCGCGATGCC GGAGGGCGTC ACACCCATGC CGCTGCAACC GGCCGAGCCG
ACGGAAAGTG GCGTCACGCT GGCCTCGCGG GTGGCCCACA GCACCGCCGC CGCCACCGAG
ATGGCGTTCA GCCTGGCCCG GTCGGCGGCC GACGCCCTCG TCGCGGCGAA CAAGTCCGCA
CGCACCCTGG CCATCGAGAC CGCACGCACC CTGCCCCGGC TGGCCCGTCT TGGCCAGGTC
AACGACCACA CCCGAATCTC CCTGGGCCGC ATCATGAGTG AGCAGGCACG CGACCTGCCA
CACGGTGAGG CGCTGTTGTT CGACGGCCGC GTGCACACCT ACGAGGCCGT CGACCGGCGA
GTCAACAACG TGGTCCGCGG CCTGATCGGG GTCGGGGTGC GGCAGGGCGC CCGCGTCGGG
GTGCTGATGG AGACCCGGCC CAGCGCACTC GTCGCGATCG CGGCGCTGTC GCGGCTCGGC
GCGGTTGCGG TGCTGATGCC CCCCGACGCC GACCTCGCCG AGGCGGCGCG ACTCGGCGCG
GTCACCGAGA TCATCGCCGA CCCGAGCACC CTGGATACGG CCCGCAAGCT CGACATGCGG
GTCCTGGTTC TCGGCGGTGG CGAGTCCCGC GACCTGCACC TGTCCAACGG CGCCGCCGGG
GACGTCATCG ACATGGAGAA GATCGACCCC GACCTCGTCG AACTCCCGGG CTGGTACCGA
CCGAACCCCG GTCTGGCACG GGACCTGGCG TTCATCGGGT TCAGCACGAT CAGCGGCGAG
CTGGTGGCCC GCCAGATCAC CAACTTCCGC TGGGCGCTGT CGGCGTTCGG TACCGCCTCG
GCGGCCAATC TGAGCCGCAA CGACACGGTG TACTGCCTGA CCCCGTTGCA CCATCAGTCC
GGACTGTTGG TCAGCCTCGG CGGGGCGGTC GTGGGCGGTG CCCGCGTCGC GTTGTCGCGG
GAGCTGCGAC CGGACCGCTT CGTCCAGGAG ATCCGGCAGT ACGGGGTGAC GGTGGTGTCC
TACACGTGGG CGATGTTGCG TGAGGTCATC GACGATCCGG CGTTCTCGCT GAACGGGAGT
CACCCGATCC GGCTGTTCAT CGGCTCGGGT ATGCCCGCCG GTCTGTGGAA GCGGGTCGTG
GAGGTCTTCG AGCCCGCCAA CGTCGTCGAG TTCTTCGCCA CCACCGACGG TCAGGCCGTA
CTCGCCAACG TCAAGGGCGC CAAGATCGGC AGCAAGGGCA GGCCGCTACC CGGCGGCGGC
GAAATCGCGC TGGCCGCCTA CGACCCGGAC GACAACCTGA TCCTCGAGGA CGACCGCGGC
TTCGTCCGGC GGGCCGAAAC CGGAGAGGTC GGTGTGCTGC TCGCCCATCC GCGGGGCCCG
GTCGACCCGC TTGCTTCGGT CAAGCGCGGC GTGTTCGCCC CCGCGGACAC CTGGGTGTCC
ACCGAGTACC TGTTCCGCCG CGACGAGGAC GGCGACTACT GGCTGGTGGA CAACCGCGGG
GCGGCGATCC ACACCGCGCG GGGAATGGTG TTCGCCGCCA CCGTCAACGA CGCGGTCGGC
CGGCTCGGCG CCGTCGACAT GGCCGTCACC TACGGAGTCG AAGTCGAAGG TCAGACGCTG
GCCGTCACCG CCCTGGCGCT GTGCCCGGGC GGCAGCATCC CGTCGGCGGA TCTCTCGGAG
GCGCTGGCCG CCCTGCCCGT CGGCAACGCA CCCGACATCG TCCACGTGGT CTCCGACATG
ACGCTGACGG CGACGTTCCG GCCGCTGGCC GGTCCGCTGC AGAAACAGGG CATCCCCAAG
GCGTCCCGTA ACGCCTGGTA CCTGGACCCC GATAGCAATC GGTACAAGCG GTTGACCGTT
GCCGTGCGCA GCGAGCTCGC GGGCATTCGG CAGTGA
 
Protein sequence
MVDFSAITRP VGRLVATAQN GLEVLRYGGL ETGAVPSPFQ IIESVPMYRL RRYFPPDVRP 
GSKPVGPPVL MVHPMMMSAD MWDVTRDEGA VGILHKAGID PWVIDFGSPD KVEGGMQRNL
ADHVVALSDA IDTVKTVTGR DVHLAGYSQG GMFAYQTAAY RRSKDLASIV GFGSPVDTLA
ALPMNLPASV APLAADFMAD HVFSRIDIPG WLARTGFQML DPVKTAQSRL DFLRQLHDRE
ALLPREQQRR FLASEGWIAW SGPAISELLK QFIAHNRMMT GGFSIHGDLV TLSDIECPVL
AVIGEVDDIG QPASVRGIKR AAPKADVYEF LIRAGHFGLV VGSKAATQTW PTVAEWVRWL
DSGGAMPEGV TPMPLQPAEP TESGVTLASR VAHSTAAATE MAFSLARSAA DALVAANKSA
RTLAIETART LPRLARLGQV NDHTRISLGR IMSEQARDLP HGEALLFDGR VHTYEAVDRR
VNNVVRGLIG VGVRQGARVG VLMETRPSAL VAIAALSRLG AVAVLMPPDA DLAEAARLGA
VTEIIADPST LDTARKLDMR VLVLGGGESR DLHLSNGAAG DVIDMEKIDP DLVELPGWYR
PNPGLARDLA FIGFSTISGE LVARQITNFR WALSAFGTAS AANLSRNDTV YCLTPLHHQS
GLLVSLGGAV VGGARVALSR ELRPDRFVQE IRQYGVTVVS YTWAMLREVI DDPAFSLNGS
HPIRLFIGSG MPAGLWKRVV EVFEPANVVE FFATTDGQAV LANVKGAKIG SKGRPLPGGG
EIALAAYDPD DNLILEDDRG FVRRAETGEV GVLLAHPRGP VDPLASVKRG VFAPADTWVS
TEYLFRRDED GDYWLVDNRG AAIHTARGMV FAATVNDAVG RLGAVDMAVT YGVEVEGQTL
AVTALALCPG GSIPSADLSE ALAALPVGNA PDIVHVVSDM TLTATFRPLA GPLQKQGIPK
ASRNAWYLDP DSNRYKRLTV AVRSELAGIR Q