Gene Mjls_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4520 
Symbol 
ID4880223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4744916 
End bp4746334 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content67% 
IMG OID640141827 
ProductNLP/P60 protein 
Protein accessionYP_001072780 
Protein GI126437089 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCCC TACGCGAATG GGTCAGCCGC CTCACCGCCG TGACGCTCGG TGTGGCCGTG 
CTGGTCTACG GTGGCGGCAC CGCGCAGGCA TCCCACGATG GCACCAGCCA TTCGGGTTCG
CAGATCTCCG CCCTGGTCGC CGACCTCGCG CAGGCCAATC AACGCCTGGC CGACATCGGC
GCGCAGATCC AGGGTCAACA AGAAGGTGTC AATAAAGCGC TGGTGGACGT GGCCAACGCC
CGCGATGCCG CCGCGGCCGC GCGCCGCGAC GTGCAAGCCA GCGAACAGGG TCTGGCCGAT
GCGAACGCGG CGATCGCCGC CGCCCAACGA CGTTTCGATG ACTTCGCCGC CGCCACCTAT
GTCAATGGCC CACCGCAGGC GCTGGTTTCG GCGGCCAGCC CCGAGGACAT CATCGCCACC
GCCAGCGCCA ATCAGACCCT GGCACTCAGT GCCAGCAATA CGGTGACCGA CCTGCAGCAC
GCCCGCACCG AACAAGCCAA CCGCGCGTCC ACAGCGCGGG CGGCCCAGCA ACGCGCCGAC
CAGGCGGCCG CCGATGCCGA GCAAAGCCAA CAGGCAGCGG TGGCCGCCCT GACCGAGGCG
CAACGCCAGT TCGGTCTGCA GCAGGCCGAA GTCGACCGGC TCGTCGCCGC CCGAGACACC
GCCAAAGCGC GTCTGGACGC GGCGCGTCCC CAAACTCCAC CCGACAACAC CGCACCGCTG
ATTGCCGCCG GTGGCACGGC GCCGGCGCCG GACCGGTGGG ACCGGCAAAC CCCGGCAGGT
GCGACGGCAC CCGCAGACAC CAGTCAGTGG GACACAACCT TGCCGATGGT TCCCAGCGCC
AACGTCGCCG GCGACCCCAT TGCCATCGTG AACGCGGTGC TGCAGATCTC GTCAACGTCG
GCGCAGCTGA CTGCCGACAT GGGTCGCAAA TTCCTTACCC AACTCGGGAT CCTGCCCCAA
GCCTCGGCAC CCGCCGACCC CGGCTTCACC AACGGGCGCA TCCCACGGGT ATATGGCCAG
CAGGCAATGG AATTGGTGAT CCGCCGGGCC ATGTCGCAGC TGGGTGTGCC CTATTCGTGG
GGTGGCGGCA ACGCCAACGG CCCGTCCCGA GGTATCGACC AGGGCGCCAA CACCGTGGGA
TTCGACTGCT CCGGGCTGAT CCTCTACGCT TTTGCCGGCG TGGGCATCAA ACTGGAACAT
TACTCGGGCA CGCAGTACAA CTCCGGACGC AAAATCCCCT CGTCACAAAT GCGACGCGGC
GATCTGATCT TCTACGGCCC CAACGCAAGC CAGCATGAGG CGATGTACCT CGGCAACGGT
CAGATGATCG AAGCGCCCTA CACCGGCTCA CAGGTGCGCA TCGCGCCGGT ACGTACCAGC
GGCATGATGC CCTACGTCAC CCGACTCATC GAGTATTGA
 
Protein sequence
MRPLREWVSR LTAVTLGVAV LVYGGGTAQA SHDGTSHSGS QISALVADLA QANQRLADIG 
AQIQGQQEGV NKALVDVANA RDAAAAARRD VQASEQGLAD ANAAIAAAQR RFDDFAAATY
VNGPPQALVS AASPEDIIAT ASANQTLALS ASNTVTDLQH ARTEQANRAS TARAAQQRAD
QAAADAEQSQ QAAVAALTEA QRQFGLQQAE VDRLVAARDT AKARLDAARP QTPPDNTAPL
IAAGGTAPAP DRWDRQTPAG ATAPADTSQW DTTLPMVPSA NVAGDPIAIV NAVLQISSTS
AQLTADMGRK FLTQLGILPQ ASAPADPGFT NGRIPRVYGQ QAMELVIRRA MSQLGVPYSW
GGGNANGPSR GIDQGANTVG FDCSGLILYA FAGVGIKLEH YSGTQYNSGR KIPSSQMRRG
DLIFYGPNAS QHEAMYLGNG QMIEAPYTGS QVRIAPVRTS GMMPYVTRLI EY