Gene Mjls_1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1717 
Symbol 
ID4877441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1810288 
End bp1812669 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content67% 
IMG OID640139016 
ProductABC transporter related 
Protein accessionYP_001069998 
Protein GI126434307 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.778386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGAA AGACGGGTAA GCACGCCGCC GACACCCACG ACGTCATCCG CGTGGTCGGC 
GCGCGGGAGA ACAACCTCAA GAACATCGAC GTCGAACTGC CGAAACGGCG GCTGACCGTG
TTCACCGGGG TGTCGGGATC GGGTAAGAGC TCGTTGGTGT TCGGCACCAT CGCCGCGGAA
TCCCAGCGTC TGATCAACGA GACGTACAGT GCGTTCCTGC AGGGCTTCAT GCCGTCGATG
TCGCGGCCGG ACGTGGACGT CCTCGAAGGG CTGACGACGG CGATCATCGT CGACCAGGAG
CGGATGGGCG CCAACCCGCG CTCGACGGTC GGCACGGCGA CCGACGCCCA TGCCATGCTG
CGGATCCTCT TCAGCCGCCT CGGTGAGCCG CACATCGGTT CACCGCAGGC GTTCTCGTTC
AACGTCGCCT CCGTCAGCGG GGCCGGCGCG GTGACGTTCG ACAAGGGCGG CAGGACCGTC
AAGGAGAGGC GCGAGTTCTC GATCACCGGC GGGATGTGTC CACGGTGCGA GGGCCGCGGG
TCGGTGTCCG ACATCGACCT CACCGCGCTC TACGACGACT CCAAATCCCT CAACGAGGGC
GCGCTGAGCA TCCCCGGCTA CAGCATGGAG GGTTGGTACG GCCGAATCTT CCGTGGCTGT
GGGTATTTCG ACCCCGACAA ACCGATCCGC AAGTACACCA AGAAGGAACT CAACGACCTC
CTGTACCGCG AGGCCACGAA GATCAAGGTC GACGGGGTCA ACCTCACCTA CGCCGGACTG
ATCCCCACGA TCCAGAAATC GTTCCTGTCC AAGGACGTCG ACGCGATGCA ACCCCACATC
CGCGCATTCG TCGAACGGGC GGTGACGTTC GCGACCTGCC CCGAGTGCGA GGGCACGCGC
CTGACCGAAC AGGCACGGTC GTCGAAGATC AAGGGCTGCA GCATCGCCGA CGTGTGCGCG
ATGCAGATCA GCGACCTCGC CGAGTGGATC CGCGGACTCG ACGAAGCCTC CGTCCGTCCC
CTGCTGGACG GTTTGGGCCA CCTTCTGGAT TCGTTCACCG AGATCGGCCT GGGCTACCTC
TCGCTGGACC GTCCCGCAGG CACGCTGTCC GGGGGAGAAG CTCAGCGCAC GAAGATGATC
CGCCACCTCG GCTCGTCACT GACCGACGTC ACCTACGTCT TCGACGAGCC CACCATCGGC
CTGCATCCCC ACGACATCGA ACGGATGAAC ACGCTGCTGC TGCGCCTGCG GGACAAGGGC
AACACGGTGC TCGTCGTCGA GCACAAACCC GAGACGATCG TCATCGCCGA CCGCGTCGTG
GACCTCGGAC CCGGTGCGGG TACCGGTGGC GGCGAGGTGG TCTTCGAGGG CCCCGTCGCC
CAGCTCCGTC GCAGCGGCAC GCTCACCGGA CGTCACCTCG ACGACCGGGC GGCCATGAAG
AAGTCTGTGC GCCAAGCACA AGGCGCCCTG GAGATCCGCG GTGCCACGAC GAACAACCTG
CGTGACGTCG ACGTCGACAT CCCGCTCGGT GTCCTCACGG TGCTCACCGG TGTCGCGGGG
TCGGGGAAGA GCTCGCTCAT CGACGGTTCG GTGGCCGGCC GCGACGAGGT CGTCTCGATC
GATCAGGGCG CGATCCGAGG TTCCCGGCGA AGCAACCCCG CCACCTATAC CGGCCTGCTC
GACTCCATCC GCAAGGCGTT CGCCAAGGCC AACGGCGTGA AGCCGGCGCT GTTCAGTTCC
AACTCCGAAG GCGCCTGCCC GGCCTGCAAG GGCGCCGGTG TCATCTACAC CGAACTCGGC
GTCATGGCGA CCGTGGAATC ACCGTGCGAG GAATGTGAGG GACGACGGTT CCAGGCCTCG
GTCCTCGAGT ACACGCTCGG CGGCCGGAAC ATCGCCGACG TGCTCGAGAT GTCGGTGGCG
GACGCGCTCG GCTTCTTCGC GGACGGCGAG GCCGCGACCC CGGCCGCGCA CAAGGTGCTC
GACCGTCTCG CCGATGTGGG GCTCGGATAC CTCAGCCTCG GTCAGCCGCT CACCACACTC
TCCGGGGGCG AACGGCAGCG CCTCAAGCTG GCCACCCGGC TGGGGGACAC CGGTGCCGAC
AAGAAGGACG TCTACGTACT CGACGAGCCG ACCTCGGGTC TGCACCTCGC CGACGTCGAG
CAGCTGCTCG CCCTGCTCGA CCGGCTGGTC GACTCCGGCA AGTCGGTCAT CGTGATCGAG
CACCACCAGG CCGTGATGGC GCACGCGGAC TGGATCATCG ACCTCGGTCC CGGCGCCGGC
CACGACGGGG GCCGGATCGT CTTCGAGGGA CCTCCGGCAG ACCTCGTGGC CAGCCGGGCG
ACCCTCACCG GTGAGCATCT CGCCGACTAC GTCGGCGGCT GA
 
Protein sequence
MAGKTGKHAA DTHDVIRVVG ARENNLKNID VELPKRRLTV FTGVSGSGKS SLVFGTIAAE 
SQRLINETYS AFLQGFMPSM SRPDVDVLEG LTTAIIVDQE RMGANPRSTV GTATDAHAML
RILFSRLGEP HIGSPQAFSF NVASVSGAGA VTFDKGGRTV KERREFSITG GMCPRCEGRG
SVSDIDLTAL YDDSKSLNEG ALSIPGYSME GWYGRIFRGC GYFDPDKPIR KYTKKELNDL
LYREATKIKV DGVNLTYAGL IPTIQKSFLS KDVDAMQPHI RAFVERAVTF ATCPECEGTR
LTEQARSSKI KGCSIADVCA MQISDLAEWI RGLDEASVRP LLDGLGHLLD SFTEIGLGYL
SLDRPAGTLS GGEAQRTKMI RHLGSSLTDV TYVFDEPTIG LHPHDIERMN TLLLRLRDKG
NTVLVVEHKP ETIVIADRVV DLGPGAGTGG GEVVFEGPVA QLRRSGTLTG RHLDDRAAMK
KSVRQAQGAL EIRGATTNNL RDVDVDIPLG VLTVLTGVAG SGKSSLIDGS VAGRDEVVSI
DQGAIRGSRR SNPATYTGLL DSIRKAFAKA NGVKPALFSS NSEGACPACK GAGVIYTELG
VMATVESPCE ECEGRRFQAS VLEYTLGGRN IADVLEMSVA DALGFFADGE AATPAAHKVL
DRLADVGLGY LSLGQPLTTL SGGERQRLKL ATRLGDTGAD KKDVYVLDEP TSGLHLADVE
QLLALLDRLV DSGKSVIVIE HHQAVMAHAD WIIDLGPGAG HDGGRIVFEG PPADLVASRA
TLTGEHLADY VGG