Gene Mmcs_5237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5237 
Symbol 
ID4114065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5523711 
End bp5525345 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content69% 
IMG OID638034394 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_642395 
Protein GI108802198 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0398716 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACTT TTACCGAGAC CATGTACAGC AATGCCCGCT CGAGCACCAA AGGGATGGTC 
ACCGGAGAAC CGGGCGACCC CGTCCGCCAC ACCTGGCTCG AGGTCCACGA GCGTGCCTTG
AGAATCGCCG GCGGGCTCGC CGCGGCAGGG GTCGGGCACG GCGACGCGAT CGCGGTGCTG
GCCGGGGCGC CCGTCGAGAT CGCGCCGACA GCCCAGGGCA TCTGGATGCG CGGTGCGAGC
CTGACCATGC TGCACCAGCC GACCCCGCGC ACCGATCTGC AGCGCTGGGC CGACGAGACC
ACCGCGGTCA TCGACATGAT CGACGCGAAG GCCGTGATCG TGTCCGAGCC GTTCATGCCC
GCGGCTCCGC TGCTGTCCGG GCTCGGGATG CTGGTGCTGA CGGTGGCCGA GCTCCTCGCC
CACGAACCGG GTGAGATGGT CCCGACCGGT GACGACGACA TCGCCCTCAT GCAGTTGACG
TCCGGGTCCA CGGGTTCACC GAAGGCCGTG CAGATATCGC ATGCCAACGT GGTGGCCAAC
GCCGAGGCGA TGATCGTCGG CTGCGAGTTC GACATCGAGA CCGACGTGAT CGTGAGTTGG
CTGCCGTGTT TCCACGACAT GGGCATGACG GGGTACCTCA CCGTGCCCAT GTACTTCGGC
GCGGAACTGG TCAAGGTCAC GCCGATGGAC TTTCTGCACG ACACGCTGCT GTGGGCCAAG
CTCATCCACA AATACCGCGG CACCATGACC GCGGCGCCCA ACTTCGCCTA CACCCTGTTC
GCCAAGCGGC TGCGCCGGCT CGCCACCCCC GGCGAGTTCG ACCTGTCGTC GCTGCGGTGG
GCGCTGTCGG GCGCCGAGCA GGTCGACCCG CTCGACGTCG AGGATCTCTG CGAGGCCGGT
GTGCCGTTCG GGTTGAAGCC GGAGGCGATC ATCCCGGCGT ACGGCATGGC CGAGACGACG
GTCGCGGTGT CGTTCTCCGA ATGCGGTCGC GGCATGGTCG TCGACGAGGT GGACGCCGAC
CTGCTGTCCG TGCTCCACCG CGCCGTGCCT GCGAACAAGG GCCACACCCG GCGGCTCGTA
TCGCTGGGCC GGCCGCTGCC GGGGCTCGAG GTGCGGGTGC TCGACGAGGA CGGTGCGGTC
CTGCCCGCCC GCGGGGTGGG GGTGATCGAG GTGCGGGGCC GGCCGGTGAG CCGCGGCTAC
ACCACGACCG CCGGATTCGT CCCGGCGCAG GACGAGTTGG GTTGGTACGA CACCGGTGAC
CTCGGATACC TCACCGAGGC AGGCGAGGTG GTGGTGTGCG GCCGGCTCAA GGACGTGATC
ATCATGGCCG GCCGCAACAT CTATCCGACG GACATCGAGC GGGCCGCGGC CCGGGTCGAC
GGTGTGCGGC CGGGTTGCGC AGTGGCCATC CGCCTCGACG GAGGCCACCC ACGCGAGACA
TTCGCCGTGG CGGTGGAGAG CAAGCACTTC GAGGACGTCA AGCAGGTCCG GCGCATCCAG
CGTCAGGTGG CCCACGAGGT GGTGGCGGAG GTCGACGTGC GGCCCCGCAA CGTGGTGGTG
CTCGAGCCCG GCATGATCCC GAAGACACCG TCGGGCAAGC TGCGCCGGGC CCACGCGCTA
TCGCTCATCG ACTGA
 
Protein sequence
MSTFTETMYS NARSSTKGMV TGEPGDPVRH TWLEVHERAL RIAGGLAAAG VGHGDAIAVL 
AGAPVEIAPT AQGIWMRGAS LTMLHQPTPR TDLQRWADET TAVIDMIDAK AVIVSEPFMP
AAPLLSGLGM LVLTVAELLA HEPGEMVPTG DDDIALMQLT SGSTGSPKAV QISHANVVAN
AEAMIVGCEF DIETDVIVSW LPCFHDMGMT GYLTVPMYFG AELVKVTPMD FLHDTLLWAK
LIHKYRGTMT AAPNFAYTLF AKRLRRLATP GEFDLSSLRW ALSGAEQVDP LDVEDLCEAG
VPFGLKPEAI IPAYGMAETT VAVSFSECGR GMVVDEVDAD LLSVLHRAVP ANKGHTRRLV
SLGRPLPGLE VRVLDEDGAV LPARGVGVIE VRGRPVSRGY TTTAGFVPAQ DELGWYDTGD
LGYLTEAGEV VVCGRLKDVI IMAGRNIYPT DIERAAARVD GVRPGCAVAI RLDGGHPRET
FAVAVESKHF EDVKQVRRIQ RQVAHEVVAE VDVRPRNVVV LEPGMIPKTP SGKLRRAHAL
SLID