Gene Mjls_5673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5673 
Symbol 
ID4881370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5924123 
End bp5925289 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID640142991 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_001073927 
Protein GI126438236 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCCG ACCCGCTGGA CGCGGCGCGG TTGCGCGACC TGATTCCCGG CGTGCTGGCC 
GCCCTCGTCC ACCGCGGGGC GGACTTCGCG ACCGCCGAGG ACGCGGTGCA GGAGGCGTTG
ATCCGAGCCG TCGAGACCTG GCCGGAGCAT CCGCCCGACG AGCCGAAGGG CTGGCTGATC
ACGACGGCGT GGCGCCGGTT CCTCGATCTC TCCCGGTCGG ACACTGCGCG TCGCCGGCGG
GAGGAACGGG TGTCCGACGA GCCGCCGCCG GGCCCGACCG AATCGGCCGA CGACACGCTG
CAGCTCTGTT TCCTGTGCGC CCATCCCAGT CTCACTCCGG CTTCGGCAGT GGCTCTGACG
CTGCGGGCCG TCGGCGGCCT GACCACCCGG CAGATCGCGC AGGCCTACCT GGTGCCCGAA
GCGACGATGG CGCAGCGGAT CAGCCGCGCC AAGCGAACCG TGAGCGGTGT CCGGCTGGAC
AGCCCCGGCG ACCTGCGCAC GGTGTGCCGG GTGCTGTACC TGATCTTCAA CGAGGGCTAC
AGCGGCGACG TCGACCTCGC CGGGGAGGCG ATCCGGTTGG CCCGCCAGCT CGCGCGCATG
ACCGACGATC CGGAGGTCGC CGGACTGCTC GCGCTCTTCC TGCTCCACCA CGCGCGCCGG
CCCGCGCGGA TCCGCGCCGA CGGCAGCCTG GTGCCGTTGG CCGACCAGGA CCGCAGCCGG
TGGCGACGTG ACCTGATCGC AGAGGGCGTG ACGATCCTGC AGGCCGCCCT GGCCCGCGAC
CGGCTCGGCG AGTACCAGGC CCAGGCCGCG ATCGCGGCCC TGCACGCCGA CGCCCGAACC
GTCGAAGAGA CCGACTGGGT GCAGATCGTC GAGTGGTACG ACGAGCTGGT CCGGCTCACC
GACAGCCCCG TCGTCCGGCT CAACCGGGCG GTCGCCGTCG GAGAGGCGGA CGGGCCGCGG
GCGGGACTCG CGGCGCTCGC CGAACTCGAC CCGTCACTGC CGCGGTACAC CGCATCGGCC
GCCCACCTCC ACGAGCGGGC GGGTGAAATC GACACGGCCG CAGAGCTTTA CGTGCAGGCC
GCGAATCAGG CGCAGAACCT CGCCGAGCGG AACCATCTCA CGGTCCGCGC GGCCGCCCTC
CGTCAGCGCC TCGCGGGTGA CATCTAG
 
Protein sequence
MAADPLDAAR LRDLIPGVLA ALVHRGADFA TAEDAVQEAL IRAVETWPEH PPDEPKGWLI 
TTAWRRFLDL SRSDTARRRR EERVSDEPPP GPTESADDTL QLCFLCAHPS LTPASAVALT
LRAVGGLTTR QIAQAYLVPE ATMAQRISRA KRTVSGVRLD SPGDLRTVCR VLYLIFNEGY
SGDVDLAGEA IRLARQLARM TDDPEVAGLL ALFLLHHARR PARIRADGSL VPLADQDRSR
WRRDLIAEGV TILQAALARD RLGEYQAQAA IAALHADART VEETDWVQIV EWYDELVRLT
DSPVVRLNRA VAVGEADGPR AGLAALAELD PSLPRYTASA AHLHERAGEI DTAAELYVQA
ANQAQNLAER NHLTVRAAAL RQRLAGDI