Gene Mvan_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1051 
Symbol 
ID4645362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1102496 
End bp1104130 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content69% 
IMG OID639804552 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_951895 
Protein GI120402066 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCT TCACCGAAAC CATGTTCGCC AACGCCCACC TCAGCACCAA GGGGTTCAAC 
ACCGGCGAGC CCCATTCGCC GGTCCGCCAT TCCTGGGAAG AGGTCCACCA GCGTGCCCGC
AGGGTGGCCG GCGGGCTGGC CGCCGCCGGT GTCGGCCCGG GCGACGCGGT CGCGATGCTG
GCCGGCGCAC CCGTCGAGAT CGCCCCGGCC GCGCAGGGCG TCTGGATGCG TGGCGCCAGC
GTGACCATGG TGCACCAGCC CACACCGCGC ACCGACCTGG TGCGCTGGGC CGAGGAGACC
ACCGCCGTCA TCACGATGAT CGACGCCAAA GCTGTCGTCA TCTCCGATCC GTTCATGGCC
GCCGCCCCAG TGCTCGAAGG CCTCGGCATG ACCGTGCTGA CCATCGCCGA CCTCCTCGCC
GCCGATCGCG TCGATCCCGT CGAAACCAGC GACGACGACA TCGCGCTGAT GCAGTTGACC
AGCGGTTCCA CCGGATCCCC GAAGGCCGTC CAGATCACCC ACGCCAACAT CGTCGCCAAC
GCCGACGCGA TGACCGCCGG CTGCAACTTC GACATCGACA CCGATGTGAT CGTCAGCTGG
CTGCCGTGCT TCCACGACAT GGGCATGACC GGCTACCTCA CGGTGCCGAT GTACTTCGGC
GCCGAACTGG TGAAGGTCAC GCCCATGGAC TTTCTGAGCG ATATCCTGTT GTGGCCCAGG
CTGATCGACA AATACCGGGG CACGATGACC GCCGCGCCGA ACTTCGCCTA CAACCTGCTG
GCCCGACGCC TGCGCCGCCA GGCCGACCCC GGCGAGTTCG ACCTGTCCTC GTTGCGGTGG
GCGCTCTCGG GCGCCGAACA GGTGGATCCG CTCGACGTCG AGGACCTCTG CGACGCCGGC
GCCCCCTTCG GTCTCAAACC CGAGGCGATC GTCCCCGCGT ACGGCATGGC CGAGACCACG
GTCGCGGTGT CGTTCTCCGA ATGCGGCGGC GGCATGGTGG TCGACGAAGT GGACGCCGAC
CTGCTCGCCG TCCTGCACCG CGCGGTGCCG GCCACCAAGG GGCACACCCG ACGCCTCGTC
ACCCTCGGGC GACCCCTGCA GGGCCTGGAA CTTCGGGTCG TCGACGAGGA CGGCGCCGAA
CTGCCCGCCC GCGGTGTCGG GGTCATCGAG GTGCGCGGCG AACCTGTCAC CAAGGGATAC
ACCACCGTCG CCGGATTCAT TGGCGCGCAA GATGATCGCG GCTGGTACGA CACCGGCGAC
ATCGGCTATC TCACCGAGGT CGGCGACGTC GTGGTGTGCG GACGGCTCAA AGACGTGATC
ATCATGGCCG GACGCAACAT CTACCCCACC GACATCGAAC GGGCCGCGGG CCGCGTCGAC
GGGGTCCGGC CCGGCTGCGC CGTCGCGGTG CGCCTGGAAG CGGGCCGGTC ACGGGAGACG
TTCGCAGTCG CCGTGGAGTG CAAGGACTTC GGTGACGAAA CCCAGGTGCG CCGGGTCGAA
CGGCAGGTCG CCCACGAGGT GTTCGCCGAG GTCGACGTGC GGCCGCGCAA CGTGGTGGTG
CTGGCGCCGG GCACGATCCC GAAGACGCCG TCGGGAAAGC TGCGGCGCGC GCATGCCCTC
TCGCTCGTGA GCTGA
 
Protein sequence
MSRFTETMFA NAHLSTKGFN TGEPHSPVRH SWEEVHQRAR RVAGGLAAAG VGPGDAVAML 
AGAPVEIAPA AQGVWMRGAS VTMVHQPTPR TDLVRWAEET TAVITMIDAK AVVISDPFMA
AAPVLEGLGM TVLTIADLLA ADRVDPVETS DDDIALMQLT SGSTGSPKAV QITHANIVAN
ADAMTAGCNF DIDTDVIVSW LPCFHDMGMT GYLTVPMYFG AELVKVTPMD FLSDILLWPR
LIDKYRGTMT AAPNFAYNLL ARRLRRQADP GEFDLSSLRW ALSGAEQVDP LDVEDLCDAG
APFGLKPEAI VPAYGMAETT VAVSFSECGG GMVVDEVDAD LLAVLHRAVP ATKGHTRRLV
TLGRPLQGLE LRVVDEDGAE LPARGVGVIE VRGEPVTKGY TTVAGFIGAQ DDRGWYDTGD
IGYLTEVGDV VVCGRLKDVI IMAGRNIYPT DIERAAGRVD GVRPGCAVAV RLEAGRSRET
FAVAVECKDF GDETQVRRVE RQVAHEVFAE VDVRPRNVVV LAPGTIPKTP SGKLRRAHAL
SLVS