Gene Mjls_3856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3856 
Symbol 
ID4879566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4079065 
End bp4080555 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content72% 
IMG OID640141165 
ProductDNA-3-methyladenine glycosylase II / DNA-O6-methylguanine--protein-cysteine S-methyltransferase / transcriptional regulator Ada 
Protein accessionYP_001072123 
Protein GI126436432 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.26438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00733732 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTACGACG ATTTCGACCG CTGCTACCGG GCCGTGCAGT CCAAGGACGC GCGGTTCGAT 
GGTTGGTTCG TCACGGCGGT GCTGACGACG CGGATCTACT GCCGCCCAAG CTGTCCCGTC
CGGCCGCCGT TCGCCCGCAA CGTGCGCTTC TATCCGACCG CCGCGGCCGC TCTGGCGGCG
GGATTCCGCG CCTGTAAGCG GTGCAGGCCC GACGCGTCGC CCGGTTCTCC CGAGTGGAAC
GTCCGCGGCG ACGTGGCCGC CAGGGCGATG CGCCTGATCG CCGACGGCAC GGTCGACCGG
GACGGTGTCA CGGGTCTGGC CGGCCGGCTC GGCTACACCA CGCGGCAGTT GCAGCGCATC
CTGCAGGCCG AGGTGGGGGC GAATCCGCTG GCGTTGGCCC GTGCGCAGCG GGCACAGACC
GCACGCGTGC TGATCGAGAC CACCGACCTG CCGTTCTCCG ATGTGGCGTT CGCCGCGGGG
TTCTCGAGCA TCCGGCAGTT CAACGACACG GTGCGCGCCA CCTCCGCGTG CACCCCGACC
GCGATGCGGG AGCGGGCGCG ACGCCGCTTC GGGGCGGCCA CCGCCGGCGC GGGGTCGCTG
GCACTGCGCC TGCCGGTGCG TAGGCCGTTC GCCTACGAAG GGGTGTTCGG GCACCTGGCG
GCCAGCGCCG TACCGGGTGT CGAGGAGTTC CGTGACGGGG CCTTCCGCCG CACGCTGCGG
CTTTCGCGGG GCCACGGCAT CGTCGGCCTC ACCCCCCGCG ACGGTCACGT CGACTGCGTG
CTGCACCTCG AGGACCTTCG GGACCTGTCC AGCGCCATCG CGCGGTGCCG GCGCCTGCTG
GACCTCGACG CCGACCCGGA GGCCGTCGTC GACGTACTCG GCGCCGACCC GGACCTCACC
GCGTTGGTGA CGAAGGCGCC CGGGCAGCGC ATCCCGCGCA CTGTCGACGA GGCGGAACTG
GCCGTGCGGG TGGTTCTGGG CCAACAGGTC TCCCTGAAGG CCGCCCGCAC GCACGCCGCG
CGGCTCGTCA CCCACTACGG TCGCCCGATC AGCGATCCAC ACGGTGGCCT GACCCGCGTG
TTTCCCACCG TGGAGGAACT CGCCGACATC GCTGCGCCCC ATCTGGCCGT ACCGCGCAGC
CGGCAGTCCA CCGTGCGCTC GCTCATCGCG GCACTGGCGT CGGGCGACGT GCGACTCGAT
CCCGGATGTG ACTGGAACGA GGCACGGGCA CAACTCACCG TACTGCCCGG CATCGGCACA
TGGACTGCGG AGGTGATCGC GATGCGCGGA CTCGGCGATC CCGACGCCTT CCCCGTCACC
GATCTGGGCG TGCTCACCGC CGCTCGCCAC CTCGGCCTGG CCGAGGATGC CCGGGCCCTT
GCAGCGCACG GCGCCCGGTG GCGTCCGTGG CGGGCCTACG CGACGCAGCA CCTGTGGACG
GCGCTCGATC ATCCGGTCAA CGACTGGCCC CCGAAGGAGA TCCGACAGTG A
 
Protein sequence
MYDDFDRCYR AVQSKDARFD GWFVTAVLTT RIYCRPSCPV RPPFARNVRF YPTAAAALAA 
GFRACKRCRP DASPGSPEWN VRGDVAARAM RLIADGTVDR DGVTGLAGRL GYTTRQLQRI
LQAEVGANPL ALARAQRAQT ARVLIETTDL PFSDVAFAAG FSSIRQFNDT VRATSACTPT
AMRERARRRF GAATAGAGSL ALRLPVRRPF AYEGVFGHLA ASAVPGVEEF RDGAFRRTLR
LSRGHGIVGL TPRDGHVDCV LHLEDLRDLS SAIARCRRLL DLDADPEAVV DVLGADPDLT
ALVTKAPGQR IPRTVDEAEL AVRVVLGQQV SLKAARTHAA RLVTHYGRPI SDPHGGLTRV
FPTVEELADI AAPHLAVPRS RQSTVRSLIA ALASGDVRLD PGCDWNEARA QLTVLPGIGT
WTAEVIAMRG LGDPDAFPVT DLGVLTAARH LGLAEDARAL AAHGARWRPW RAYATQHLWT
ALDHPVNDWP PKEIRQ