Gene Mjls_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4037 
Symbol 
ID4879745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4268406 
End bp4269746 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID640141348 
ProductTetR family transcriptional regulator 
Protein accessionYP_001072302 
Protein GI126436611 
COG category[K] Transcription 
COG ID[COG1309] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.755951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAG CCTCGATCGT CCGCCGAGCC AGCTACGGCC CCTCCAGCCC TGCTGTGGGT 
GCCCGCGGCG CCACCACGCG TAGCCGGATC ACCGAGGTGT CGCTGGAGCT GTTCGGCCGA
CTCGGGTACT TCGACACCTC GGTCGACGCG ATCGCCAAGG CGGCAGGTGT GTCCCGGGCC
ACTCTCTATC AGTACTTCCA GGGCAAGGAC GAGATCTTCC TCGAGCTGCT CAACGAGTGC
GGTAGCGCGC TGTTCCGGGT GGCCCGCCGC ATCGGCCCAC TCGGCCCCGA CGAAGTCGGC
TTCGACAATC TGAACTGGTG GCTGGGCGAG TGGAGCTGGG TGTTCGAGAA GTACTCCACC
ATGTTCGTGC AGTGGACGGC GATCGCCTCG TCGGACACAA AGGTGCGGCC GCAGATCACC
CGGTTCGTCC GTAGCTACAA CCACCGCGTC GCCGAGCGGC TGGCCGCGTC CGGACTGCAG
GGTCTGGACC CGGAGGTGGC GGCCATGACC ATGACCGCAC TGGTGCACCG CATCAACCTG
TTCGTGCACA CCGACGGTGC CTATGGCCGA AGCGCGAAGG ACGCGGTCGA CACGCTTTCG
GTGTTTCTGC AGCTGGCGTT GTTCCCCGAC ACCCCGCCGT CGGTGCTGAC GTCGCTGCGT
CTGCGCGCCA GCGCCGACCC GGCGGCCGAC GTGGACGCCG TCGAGGTGCC TGCGGCTCCG
GACGTCGAAG GACTGTCCAT CAGCGAGCGC ACCGCCACCC TGAGCAAGCG AGCCGTGAGT
ACCGTGACGG CGTTGGCCGC CGCGGGCGCC GCCCAGTTCC GTGCCCACGG CTACCGCAGC
ACGAGTGTGG ACGACATCGT GGAGGCGGCC GCCGTCGCCC GGGGCACCTT CTACAAGTAC
TTCAACGACA AGCAGGATTT ACTGGCCGCG GTGGCCGCCG AGATCTATAC CGCTGCAATG
ACGTTCGCGG AGCGCATCGC CGACGTGGAC CCCGTGGCGG ACGAGCAGAC GCTGCGGAAC
TGGCTGGCCA CCTACGTTGA GTTCTACGAC CGGTACTCCG GCTGCATCGA AGCGTGGGCG
GAAGGCGCCA CCGACGACCC CACGATCGTC GGGATCGGGG AGAACGGCCA GGTCCTGATG
GATGTCGGCG CGGCCAGGAT GTTGATCGGC CGACCGGGCC CCTACCCGTT CGACCCGGTA
GTCGCAGCGC TGATCCTGCG CGCACTGGTC ACCCGTGTCC GGCAGGCCGC GCTGGATCTG
CCCGAGCCGA TCCACGACGA CGAGATCGTG GAGTTGTTGA TGACGCTGAT CCGGCGCGGC
TTCTTCGGCC TCGCGACGTA G
 
Protein sequence
MAEASIVRRA SYGPSSPAVG ARGATTRSRI TEVSLELFGR LGYFDTSVDA IAKAAGVSRA 
TLYQYFQGKD EIFLELLNEC GSALFRVARR IGPLGPDEVG FDNLNWWLGE WSWVFEKYST
MFVQWTAIAS SDTKVRPQIT RFVRSYNHRV AERLAASGLQ GLDPEVAAMT MTALVHRINL
FVHTDGAYGR SAKDAVDTLS VFLQLALFPD TPPSVLTSLR LRASADPAAD VDAVEVPAAP
DVEGLSISER TATLSKRAVS TVTALAAAGA AQFRAHGYRS TSVDDIVEAA AVARGTFYKY
FNDKQDLLAA VAAEIYTAAM TFAERIADVD PVADEQTLRN WLATYVEFYD RYSGCIEAWA
EGATDDPTIV GIGENGQVLM DVGAARMLIG RPGPYPFDPV VAALILRALV TRVRQAALDL
PEPIHDDEIV ELLMTLIRRG FFGLAT