Gene Mjls_5000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5000 
Symbol 
ID4880698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5238739 
End bp5239941 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content66% 
IMG OID640142310 
Productvirulence factor Mce family protein 
Protein accessionYP_001073255 
Protein GI126437564 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.587247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGATA TCGACGCAAA GCGCAGTCAC GTACGCATCG CCGCCGCGAT CATGGCGTCG 
ATCATCGTCG CCGCCGCGGT GTTCACCTAC CTGTCGTACA CCGCGGCGTT CACCTCGACC
GACACCGTCA CCGTCTTCTC ACCGCGCGCC GGGCTGGTCA TGGAGACCGA TGCGAAGGTC
AAGTACCGCG GCATCCAGAT CGGCAAGGTC AAGGAGATCG AGTACGCCGG GGACCAGGCG
AAGCTGACCC TGGCCATCCG CAGCGACGAG ATGAAGTACA TCCCGGCCAA CGCCCCCGTG
CGCATCGCGG GTACGACGGT GTTCGGCGCC AAGGCCGTCG AGTTCATCCC GCCGGAGAAG
GCGCAGCAGA CGTCGTTGCG GCCCGGGGCC GAAGTGCAGG CCTCCGACGT CCAACTCGAG
GTCAACACGC TGTTCCAGAC CCTGACCGAT GTGCTCGGCA AGATCGACCC GATCAACCTC
AACGCCACCA TCAGCGCGCT GGGGGAGGGC TTACGCGGTA ACGGCGACGA TGTGGGCGCC
CTGCTCGAGG GCCTCAATTA CTACGTGGCG CGGCTGAACC CGAAGCTGCC CACACTGCAG
GAGGACTTCC GCAGGGCCGC CGAGGTGACC AACATCTACG GCGACGCCGG CCCGGACATC
GCGCGGATCC TCGACAACGC CCCGACGATC AGCAACACGA TCGTCGACCA GCAGGACAAC
CTCAATGCGA CACTGCTGGC CGCCACGGGT CTGGCCAACA ACGGCACCGC CACGCTGGAA
CCGGCCGCCG ACAACTACAT CGCGGCGATC CAGCGGTTGC GGGCGCCGTT GAAGGTGGCC
GGTGAGTACT CCCCGGTGAT CGGCTGCGTG CTCAAGGGCA CCGCCGTCGC CGTCGAGCGG
TTCGCCCCGA TCATCGGCGG TATCCGGCCG GGCCTGTTCG TGTCCTCGAA CTTCCTCCCC
GGCTCACCGG CGTACACGTA CCCGGAGAGC CTGCCCATCG TCAACGCCTC CGGCGGTCCC
AACTGCCGCG GCCTGCCGGA CGTGCCCAAC AAGCAGTACG GCGGCTCCTG GTACCACACC
CCGTTCGTGG TCACCGACAA CGCCTATGTG CCGTACCAGC CGAACACCGA GCTGCAGTTC
GACGCTCCCT CGACGCTGCA GTTCCTGTTC AACGGCGCGT TCGCGGAGAA GGACGAGTAC
TGA
 
Protein sequence
MPDIDAKRSH VRIAAAIMAS IIVAAAVFTY LSYTAAFTST DTVTVFSPRA GLVMETDAKV 
KYRGIQIGKV KEIEYAGDQA KLTLAIRSDE MKYIPANAPV RIAGTTVFGA KAVEFIPPEK
AQQTSLRPGA EVQASDVQLE VNTLFQTLTD VLGKIDPINL NATISALGEG LRGNGDDVGA
LLEGLNYYVA RLNPKLPTLQ EDFRRAAEVT NIYGDAGPDI ARILDNAPTI SNTIVDQQDN
LNATLLAATG LANNGTATLE PAADNYIAAI QRLRAPLKVA GEYSPVIGCV LKGTAVAVER
FAPIIGGIRP GLFVSSNFLP GSPAYTYPES LPIVNASGGP NCRGLPDVPN KQYGGSWYHT
PFVVTDNAYV PYQPNTELQF DAPSTLQFLF NGAFAEKDEY