Gene Mmcs_5332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5332 
Symbol 
ID4114159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5615810 
End bp5617450 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID638034488 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_642489 
Protein GI108802292 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.7188 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGCA CCATGCAGGA CGTCCCGCTG ACAGTCGCCG CGATTCTGCG GTACGCCGCC 
ACAGTCCACG GCGACCGAAC GGTGACGACC GCGACGGGAA ACGGTGGTTA CCGGCACGCC
ACCTACCGTG AGGTCGGACA GCAGGCGGCC CGACTGGCCC ATGCGCTGCG CCGGTTCGGG
ATCGAGGGTG ACGACCGCGT CGGGACGTTC ATGTGGAACA ACCAGGAGCA TCTGGAGGCC
TACGTCGCGG TGCCCTCGAT GGGCGCGGTG CTGCACACGC TGAACATCCG GCTGTTCCCG
GAGCAGATCG AGTTCGTCGC CTACGAGGCC GAGGACCGCG TGCTGATCGC CGATCTGTCG
CTGGCGCCGG TGCTTGCTCC GGTGCTGCGC TCGCTGGAGA CCGTGCACAC GGTGATCGCC
GTCGGTGAGG GTGACCTGGC GCCGTTCGAG GAGTCGGGCA AACGAGTGGT GCGCTACCAC
GAGGTGACCG CCGCCGAATC CGACGAGTAC GACTGGCCCG ACATCGACGA GAACTCCGCT
GCCGCAATGT GTTACACCAG CGGAACCACC GGCCACCCGA AGGGCGTCGT ATACGGGCAC
CGGTCGAGCT ACCTGCACTC GATGGCGGTC TGCGGGGGCA ACGGACTCGG GATGAGCTTC
TCCGACAAGG CGCTGCCGAT CGTGCCGATG TTCCACGCCA ATGCCTGGGG TCTGCCCTAC
GCCGCCCTGA TGGCCGGAGC TGACCTGGTG CTTCCCGACC GCTTCATGGA CGCCACGTCC
CTCGTCGACC TGATCGAGAC GCAGCGGCCG ACTGTCGCCG GTGCGGTACC CACAATCTGG
AACGACGTCA TGCACCACCT CGACCAGAAC CCCGGCCACG ACATCTCCTC GCTGCGGCTC
GTCGGCTGCG GCGGGTCGGC GGTGCCGGTG TCGCTGATGA AGGCGTTCGA GGAGAAGTTC
GGCGTGCAGA TCCGGCAGTT GTGGGGGATG ACGGAGACGT CCCCGGTTGC GACGATCGCG
TGGCCGCCGC CGGACACCCC CGCGGAGAAG CACTGGCAGA TCCGCAGCAC CCAGGGCCGT
CCGCTCTGCG GTGTGGAGGC CCGCATCGTC GACGACGACG GTGCGGTGCT GCCCAACGAC
GGTGAGTCAG TCGGTGAACT CGAGGTCCGC GGGCCCTGGA TCACCGGGTC CTATTACCGC
AACACCGACG ACTCGAAGTT CCAGTCGGGT TGGCTGCGCA CCGGCGACGT GGGCCGTATC
GATCCGCAGG GTTACATCAC CCTGACCGAC CGCGCCAAGG ACGTCATCAA GTCCGGCGGT
GAGTGGATCT CGTCGGTGGA GTTGGAGAAC CACCTCATCG CGCATCCGGC GGTGCGAGAA
GCCGCCGTGG TCGGGGTGCC CGACGAGCGT TGGCAGGAGC GGCCATTGGC GGCGGTCGTC
GTCCAGGAGG GTGCCCAGGT GGACGCTGAC GAACTGCGGA ACTTCCTGGC CGACAAGGTT
GTCCGGTGGT GGCTTCCCGA GCGCTGGACC TTCGTCGACG AAATCCCACG CACCAGCGTC
GGTAAGTACG ACAAGAAGGT GATCCGGGCG CGCTACGCGG ACAACGCATA CCAGGTCGCC
GACCTGCGCG AGCACACGTA G
 
Protein sequence
MYSTMQDVPL TVAAILRYAA TVHGDRTVTT ATGNGGYRHA TYREVGQQAA RLAHALRRFG 
IEGDDRVGTF MWNNQEHLEA YVAVPSMGAV LHTLNIRLFP EQIEFVAYEA EDRVLIADLS
LAPVLAPVLR SLETVHTVIA VGEGDLAPFE ESGKRVVRYH EVTAAESDEY DWPDIDENSA
AAMCYTSGTT GHPKGVVYGH RSSYLHSMAV CGGNGLGMSF SDKALPIVPM FHANAWGLPY
AALMAGADLV LPDRFMDATS LVDLIETQRP TVAGAVPTIW NDVMHHLDQN PGHDISSLRL
VGCGGSAVPV SLMKAFEEKF GVQIRQLWGM TETSPVATIA WPPPDTPAEK HWQIRSTQGR
PLCGVEARIV DDDGAVLPND GESVGELEVR GPWITGSYYR NTDDSKFQSG WLRTGDVGRI
DPQGYITLTD RAKDVIKSGG EWISSVELEN HLIAHPAVRE AAVVGVPDER WQERPLAAVV
VQEGAQVDAD ELRNFLADKV VRWWLPERWT FVDEIPRTSV GKYDKKVIRA RYADNAYQVA
DLREHT