Gene Mjls_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3374 
Symbol 
ID4879086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3537613 
End bp3538899 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content72% 
IMG OID640140677 
Producthypothetical protein 
Protein accessionYP_001071643 
Protein GI126435952 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism
[T] Signal transduction mechanisms 
COG ID[COG2508] Regulator of polyketide synthase expression 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.238884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA ATCGGTACGT GCCGCCGGCC TCGACGGTCG AGGTGCTGCA GACCGTGCCG 
GATTCCGTGC TGCGCCGGTT GAAGCAGTAC TCGGGGCGAT TAGCCACCGA GGCGGTGGCC
TCCATGCAGG ACCGGCTGCC GTTCTTCGCC GGCCTGGAGG CCTCGCAGCG CGCGAGCGTG
CACCTGGTGG TGCAGACGGC GGTGGTCAAC TTCGTCGAGT GGATGCGCAA CCCGCAGAGC
AACGTCAGCT ACACCGCGCA GGCGTTCGAA GTCGTGCCCC AGGACCTGAC CCGCCGCATC
GCGTTGCGCC AGACGGTCGA GATGGTGCGC ACCACGATGG AGTTCTTCGA AGAGGCCGTG
CCGCTGCTCG CGCGCTCGGA CGAGCAGGTC ACGGCCCTGA CCGCCGGCAT CCTGCGCTAC
AGCCGGGATC TGGCGTTCGC CGCCGCCTCG GCCTACGCCG ACGCGGCGGA GGCACGCGGC
GCGTGGGACA CCCGGATGGA GGCCAACGTC GTCGACGCGG TGGTCCGCGG TGACGTCGGG
CCGGAGTTGC AGTCCCAGGC GGCCACGCTC AACTGGGACG CCACCGCCCC GGCCACCGTC
ATCGTCGGCC TCCCCCACCC CGACCGAATC GATCTCGCCG CCGACGACGT GCGCGACATC
GTGACCCGCA ACGACCGGGC CGCGCTGTCG GATGTGCACG GCACCTGGCT GGTCGCGATC
GTCTCGGGGC CGATCTCACC CACCGACCGC TTCCTCAAGG AGATCCTCGT CGCGTTCGCC
GACGGCCCGG TCGTCGTCGG CCCGACCGCG CCGACGCTGT CGGCGGCCCA CCTGTCGGCC
ACCGAGGCCA TCGCCGGCAT GAACGCGGTC GCCGGGTGGA GCGGTGCGCC CCGGCCGGTG
TCGGCGCGCG AACTGCTGCC CGAACGTGCG CTGCTCGGGG ACGCGACGGC CATCGCGGCG
CTGGAAACCG AGGTGATGCG GCCGCTCGGC GACGCGGGCC CGGCACTCAC CGAGACCCTC
GACGCCTACC TCGACGCGGG GGGCGCGATC GAGGCGTGCG CCCGTCAATT GTTCGTTCAT
CCAAATACCG TCCGCTACCG GCTCAAACGG ATCGCCGACT TCACCGGGCG CGACCCCACC
CTGCCGCGCG ATGCCTACGT GCTCCGGGTG GCCTCCACCG TCGGCCGGCT CAACCGGCAA
GCGGCCCAGT CCAGCCGGGC GAGTGGGCCA TCCCCGCAGG TCCCGGCCGC GCGGCCGGCC
AGTGCGACCC TCTACCGCAG CGGATGA
 
Protein sequence
MPDNRYVPPA STVEVLQTVP DSVLRRLKQY SGRLATEAVA SMQDRLPFFA GLEASQRASV 
HLVVQTAVVN FVEWMRNPQS NVSYTAQAFE VVPQDLTRRI ALRQTVEMVR TTMEFFEEAV
PLLARSDEQV TALTAGILRY SRDLAFAAAS AYADAAEARG AWDTRMEANV VDAVVRGDVG
PELQSQAATL NWDATAPATV IVGLPHPDRI DLAADDVRDI VTRNDRAALS DVHGTWLVAI
VSGPISPTDR FLKEILVAFA DGPVVVGPTA PTLSAAHLSA TEAIAGMNAV AGWSGAPRPV
SARELLPERA LLGDATAIAA LETEVMRPLG DAGPALTETL DAYLDAGGAI EACARQLFVH
PNTVRYRLKR IADFTGRDPT LPRDAYVLRV ASTVGRLNRQ AAQSSRASGP SPQVPAARPA
SATLYRSG