Gene Mjls_5440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5440 
Symbol 
ID4881137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5701393 
End bp5702397 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content68% 
IMG OID640142757 
Producthypothetical protein 
Protein accessionYP_001073694 
Protein GI126438003 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.763333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCCG TGACCGCGCT GCGCCAGATC GCGTACTACA AGGACCGCGC CCGCGAGGAC 
TCTCGACGGG TGATGGCCTA CCGCAACGCC GCCGACGTCG TGGAGCGGCT CACCGAAGCC
GAACGTGACC GCCACGGCGC CGCCGATTCG TGGCAGTCGC TGCCCGGAAT CGGGCCCAAG
ACGGCGAAGG TGATCGCGCA GGCGTGGGCC GGCCGCGAAC CCGACGTGCT CATCGAATTG
CGGGAGAACG CAGTCGATCT CGGCGGCGGT GAGATCCGCG CGGCACTCAA GGGCGATCTG
CACGTGCACT CCAACTGGTC GGACGGGTCG GCGCCGATCG AGGAGATGAT GCTCGCCGCC
CGCGACCTCG GGCACGAGTA CTGCGTGTTG ACCGACCACT CACCGCGGTT GACCATCGCC
AACGGGCTGT CCCCGGACCG GCTGCGCAAA CAGCTCGACG TCATCGACGA ACTCCGGGAA
AGCGTTGCAC CCCTTCGCAT TCTGACCGGC ATCGAAGTCG ACATCCTCGA GGACGGCTCC
CTCGACCAGG AGGAGGAACT GCTCGAGCGC CTCGACGTCG TGGTGGCCAG CGTGCACTCG
AAACTGGCGA TGGACGCCCC GGCGATGACA CGCCGCATGC TCAAGGCCGT CGCCAATCCG
CACACCGACG TGCTCGGCCA CTGCACCGGG CGGTTGGTCA CCGGAAACCG CGGAATCCGG
CCTGAATCGA AATTCGACGC CGAGAAGGTG TTCACCGCGT GCCGCGACAA CGGCACCGCC
GTCGAGATCA ACTCCCGCCC CGAACGGCGG GATCCCCCCA CCCGGCTGTT GAAGCTCGCG
CTCGACATCG GTTGCGTGTT CTCGATCGAC ACCGATTCGC ACGCGCCGGG TCAGCTGGAC
TTCCTCGGCT ACGGCGCACA ACGGGCGCTC GACGCCGGCG TGCCCGCCGA GCGGATCGTC
AACACCTGGC CCGCCGACGA TCTGCTGGCG TGGACCTCCT CCTGA
 
Protein sequence
MDPVTALRQI AYYKDRARED SRRVMAYRNA ADVVERLTEA ERDRHGAADS WQSLPGIGPK 
TAKVIAQAWA GREPDVLIEL RENAVDLGGG EIRAALKGDL HVHSNWSDGS APIEEMMLAA
RDLGHEYCVL TDHSPRLTIA NGLSPDRLRK QLDVIDELRE SVAPLRILTG IEVDILEDGS
LDQEEELLER LDVVVASVHS KLAMDAPAMT RRMLKAVANP HTDVLGHCTG RLVTGNRGIR
PESKFDAEKV FTACRDNGTA VEINSRPERR DPPTRLLKLA LDIGCVFSID TDSHAPGQLD
FLGYGAQRAL DAGVPAERIV NTWPADDLLA WTSS