Gene Mjls_1470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1470 
Symbol 
ID4877205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1575650 
End bp1579075 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content74% 
IMG OID640138777 
ProductSARP family transcriptional regulator 
Protein accessionYP_001069762 
Protein GI126434071 
COG category[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTGGGAC CGCTGCAGGT GACGCATGCG GACTGCCCGG TCGACATCGG GCCGCCGAAG 
CAGCGCGCGG TGCTCGCCGT GCTCCTGCTC GCCGCGGGCC GGGTCGTATC GGTGGACCGG
CTGATCGACG CGGTCTGGGG TGACGATGCC CCGGGCAGCG CGACGGCCAG CCTGCAGGCC
TACATCTCCA ATCTGCGCCG GGCCCTGCGC GACGCGGGTC AGTCGCAGGT CGCCTCGCCG
ATCGTGCGGC AACCGCCGGG GTATTTCCTC AACGTCGAAC CCGGCCAGGT CGATCTGGCG
GTGTTCGCCT CCGGCTGCGC CAGGGCGGTG GCCGCCGTGG ACAGCGGGGA CTGGGACGCG
GCGCTGGCCG CCGCCGACGA GGCCCTGGCG TGGTGGCGTG GGCCGCTGCT GGCCGACCTG
TCCGACGAAC CGTGGGTGGC CGACGAGGCG GCCCGCGCCG AGCACCTGCG CGCCGATTGT
CTGGATGCGC GGATCACCGC GCTGCTGGGG CTGGGCCGGG TGCCGCAGGC GCTGGCGGCG
GCGGCCGAGT TGCGCTCCGC CACACCGCTG GCCGACCGCG GCTGCTGGCT GCACATGCTC
GCGCTGTACC GGGCGGGCCG GGTGACCGAC GCGCTGGACG TCTACACCCG CCACGCTCGG
CTGCTCGACG ACGAACTCGG CGTGCAGCCG GGACGCGAGG TCCGCGAGCT GCAGACCGCG
ATGCTGCGGC AGGCACCCGA GTTGGCGGCG TGGCCGCGGT CCCCGGAGTG GACGGGCGCC
GGTGCGGTGG CCACCCCGGC CGCGCCCACG GTCGAGCCGT CGGTGGTCCC GGCGGGCCCG
AGCCGGGGCG CGCTGATCGG CCGCAGCCGC GAATTGTCCA CCGCCGCAGG TGTGCTCTCG
GACGTCACGG CGGGTGCGGC GCGGTGGCTG GTGCTGTCGG GTCCGGCGGG AATCGGCAAG
ACCCGGCTGG CGGAGGAGGT CGCGGCCCGG GTGGTCGCCG ACGGCGGGGA CATGGTGTGG
GTGAGCTGTC CCGACGAGCG GGCGACCCCG CCGTGGTGGC CGATGCGGCA GCTGGTGCGG
GCGCTGGGCG CCGATCCCGA CGACGTGCTG GAGGTGCCGC CGGACGCCGA TCCGGACACC
GCGAGATTCC GTGTCTACGA ACGCATCCAG ACCCTGCTGG AGTCGGCGCC GCGCACGCTT
GCGGTGGTCA TCGACGACGT GCAGTGGGCG GACACCACGT CGGCGGCGTG CCTGGCCTAC
ATCGCCGGGG CGCTGCGCGA CCACGCGGTG GCGCTGATCC TGACCGTGCG CGACGGCGAA
CACAGCGCCG AGGTTTCCAG GCTGGTGACG ACCGTGGCCC GTGGCGACCG CAACCGCCAC
GTGGCGGTTC CGGCGTTGTC CACCGAAGAT GTTGCGGCGC TGGCGAATCA GGTCGCCGAC
GATCCGGTGA CCGAGGCGGA GGCGGCGCTG CTGGCCGACC GGACGGGCGG CAACCCGTTC
TTTGTCTCCG AGTACGCTCG GCTGCCCCGC GCCGACCGGG TCGGCAGCGA GATCCCGGTC
GCGGTGAAAT CGGTGCTGGA CCGCCGCCTG GCCGGGCTCG ACCCGGCCGC CGTGCAGGTG
CTGCGGACGG CCGCGATCAT CGGTGACACG CTCGATTCGG ACGCCGTGCC GGTGCTGGCC
CAGGCCACCG GAATGGACGT CGACACGCTG GCCGACCATC TCGACGATGC GGCCGACGAG
CGCATCGTGA TCGCCGCGCA CACCGGTGAC GGGTACGCGT TCGCGCACGG ACTCCTCCGT
GACCATCTCA TCGCGGGGAT CCCCCCGCTG CGCCGCCAAC GCCTGCACGC CAAGATCGCC
GACGTGCTCG ACGGCAGCAC CGCCGAGGGT GCGCTGACGC GCCGCGCCCA GCACCTCATC
GCCGCGCAGC CCCTGGTGGA CGCCGGCGCG GTGGTGCAGG CGTGCCGACT GGCCGCCGAG
GATGCCACGG CGCGGTGGAG TTCGGACATC GCGGCGGTGT GGTGGCAGGC CGCGCTGGAC
GCCTACGACC GCCTCCCGGC GGCGTCGCGC TCGGAAGAGG AGCGCGACGG GCTGACCGTG
GCAATGCTCG AGGCGCATTC GCGCGCCGGG CGCGGCCGGC TGGTCCTCGA CACCGTCGCC
GCACAACTCG GTGATGCCGT GCGCACCGGT CGGGCCGCGA CGGCCGGTCG GCTGGCCAGC
GCGCTGCTGC GGGCCAGCGG CGGGTGGCCG TGGCTGGCCC CCGGCCATGA TCCCGGTGGG
GTGCTCGCGC TGCTGGAGGG GGCCGCGGTG CTGGCCGAGA GTGACCCCGC CGCGGGGGCG
CGGGTGCTGA CCGCCCTCGC CGTCGGGCAC TGCTACCACC CCGACGCCGC GGTGTCGGCC
GGGCATCTCG AACGGGCCGC GCGGTTGGCC GAGGCCACCG GGGACCGCGA TGTCATTGCC
GACGTGTTGA TGGGCCGGTT GATCACCTAC TCCGGGGTGG CCGCGTACAG CCATCAGACT
TTGGAGTGGG TCGCGGAACT GAACGCGCTC GGGCACAGCA GGTCCCGGGA GGACTCCGTG
ATCGCGCACT CGGTGGCCAC GATGGCCGCG GTGAACCTGA CCGAGATCGA CCTGGCGAAA
CTGCATCTGC AGGAGGGCAT CTCGGGCAGT GAGGAACTGC GGCTGCCGGT GCTGCGGGCA
CAGCTGCGCT GGATGGAGGC GGTGCTGGCG GTGTGGCGGG GGGACTTCGC CGAAGCCGAA
CGCCACCACC GGATCGCGGC GGAGGTTCAT GAGCAGACCG AATTGTACGA AGCCGGAAGC
GGTTTGATCG CAACGGTGAT TCTGATCCGT GAGAGGGGCG GCCCCGTCGA GCCGGGTTGG
CCGGGCTCGC GTGCCGACAC CGAGAGCGGG GGACAGGGCA TGGTCGGCCT GGTGCACACC
GCTCTGCTCA CCGTGGACAG CGGCGACGAG GCGCGGGCGC AGGCGCTGAT GCGACTGCGG
GAGTGGGACG CCCAACCGCA CCGGGCACAT GTTTGGACGA CGCTCGGGCA CGCGACGCTG
CTGGCCCATC TGGCGTGCGA CCACGGATAC GCCGAGTTCG CCCCCGCGCT GCTGGAGAGG
CTGCTGCCGT TCGTCGACCG CATCGCGGAG ATCGGTCAGG TCGGCGTGGT GGGGCCGGTC
GCCCTGGCGA CCGCGCGTCT GCGGGCGCTG ATGGGCGACA CCGACCGCGC GCTGGCCGAC
CTGGCCGACG CCGAGGACAT CGCCGCGCGC ACCGGGGGTG TTCCCGTCCT ACTGCGGTGC
CGGCTGCTGC GTGCGGAACT GACCCCGCCG GGTGAGCAGC GGAAGGCGGC GGCGCGGGCG
CTCGCCACTG ATGCCGATGC GCTGGGCATG CGCGGCGTGG CCGATTTGGC ACGTCGGCTC
GCGTGA
 
Protein sequence
MLGPLQVTHA DCPVDIGPPK QRAVLAVLLL AAGRVVSVDR LIDAVWGDDA PGSATASLQA 
YISNLRRALR DAGQSQVASP IVRQPPGYFL NVEPGQVDLA VFASGCARAV AAVDSGDWDA
ALAAADEALA WWRGPLLADL SDEPWVADEA ARAEHLRADC LDARITALLG LGRVPQALAA
AAELRSATPL ADRGCWLHML ALYRAGRVTD ALDVYTRHAR LLDDELGVQP GREVRELQTA
MLRQAPELAA WPRSPEWTGA GAVATPAAPT VEPSVVPAGP SRGALIGRSR ELSTAAGVLS
DVTAGAARWL VLSGPAGIGK TRLAEEVAAR VVADGGDMVW VSCPDERATP PWWPMRQLVR
ALGADPDDVL EVPPDADPDT ARFRVYERIQ TLLESAPRTL AVVIDDVQWA DTTSAACLAY
IAGALRDHAV ALILTVRDGE HSAEVSRLVT TVARGDRNRH VAVPALSTED VAALANQVAD
DPVTEAEAAL LADRTGGNPF FVSEYARLPR ADRVGSEIPV AVKSVLDRRL AGLDPAAVQV
LRTAAIIGDT LDSDAVPVLA QATGMDVDTL ADHLDDAADE RIVIAAHTGD GYAFAHGLLR
DHLIAGIPPL RRQRLHAKIA DVLDGSTAEG ALTRRAQHLI AAQPLVDAGA VVQACRLAAE
DATARWSSDI AAVWWQAALD AYDRLPAASR SEEERDGLTV AMLEAHSRAG RGRLVLDTVA
AQLGDAVRTG RAATAGRLAS ALLRASGGWP WLAPGHDPGG VLALLEGAAV LAESDPAAGA
RVLTALAVGH CYHPDAAVSA GHLERAARLA EATGDRDVIA DVLMGRLITY SGVAAYSHQT
LEWVAELNAL GHSRSREDSV IAHSVATMAA VNLTEIDLAK LHLQEGISGS EELRLPVLRA
QLRWMEAVLA VWRGDFAEAE RHHRIAAEVH EQTELYEAGS GLIATVILIR ERGGPVEPGW
PGSRADTESG GQGMVGLVHT ALLTVDSGDE ARAQALMRLR EWDAQPHRAH VWTTLGHATL
LAHLACDHGY AEFAPALLER LLPFVDRIAE IGQVGVVGPV ALATARLRAL MGDTDRALAD
LADAEDIAAR TGGVPVLLRC RLLRAELTPP GEQRKAAARA LATDADALGM RGVADLARRL
A