Gene Mjls_2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2174 
Symbol 
ID4877894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2271209 
End bp2272651 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content69% 
IMG OID640139471 
ProductRNA polymerase sigma factor 
Protein accessionYP_001070451 
Protein GI126434760 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.512894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00228971 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAGCGA CAAAGGCAAG CCCGGCAACC GAAGAGCCGG TGAAGCGCAC CGCTACCAAG 
ACCCCCGCGA AGAAGACCGC CGCGGCGAAG GCCCCAGCCA AACGCGCGGC CAAGGGTACG
GCGACGACCC GCGGCCCGGC CAAGAAGGAC GGCGCCGCCC CGCGCGGTCG TGGCAAGAAG
TCCACCGCAC CCGAGGCCGG TGCCGCCGAC GCCCTCGCCG ACGACGACCT CGACACCGAC
GACACCCTCG AGGCCGAACC GGATATCGAC GTCGACGACG CCGACCTGGA CCTCGAGGAT
CTCGACACCG ACGACGACTC GTCCGACGAC GGCGACGACG CCGACACCCC CGACGCCAAG
GTCAAGGCCG CCCCCAAGGG CGGCGCGGTG CCCGCCGCCC CCGCCACCGA GGACGAGGAG
ATCGCCGAGC CCTCCGAGAA GGACAAGGCC TCCGGCGACT TCGTCTGGGA CGAGGAGGAG
TCAGAGGCGC TGCGGCAGGC CCGCAAGGAC GCCGAGCTCA CCGCCTCCGC CGACTCGGTG
CGCGCGTATC TCAAGCAGAT CGGCAAGGTG GCGCTGCTCA ACGCCGAGGA GGAAGTCGAG
CTCGCCAAGC GCATCGAGGC CGGTCTGTTC GCCACCCAGA AGCTGGCCGA ACTCGCCGAA
AAGGGTGAGA AGCTGCCGGT GCAGCAGCGC CGCGACATGC AGTGGATCTG CCGCGACGGC
GACCGCGCCA AGAACCACCT GCTGGAGGCG AACCTCCGCC TGGTGGTGTC GCTGGCCAAG
CGCTACACCG GCCGTGGCAT GGCGTTCCTG GACCTCATCC AGGAGGGCAA CCTCGGTCTG
ATCCGCGCGG TCGAGAAGTT CGACTACACC AAGGGTTACA AGTTCTCCAC CTACGCCACC
TGGTGGATCC GGCAGGCGAT CACCCGCGCG ATGGCCGACC AGGCGCGCAC CATCCGCATC
CCGGTGCACA TGGTCGAGGT CATCAACAAG CTGGGCCGCA TCCAGCGCGA GCTGCTCCAG
GACCTGGGTC GCGAACCCAC GCCCGAAGAG CTCGCCAAGG AGATGGACAT CACGCCGGAG
AAGGTGCTGG AGATCCAGCA GTACGCGCGT GAGCCGATCT CGCTGGACCA GACGATCGGC
GACGAGGGCG ACAGCCAGCT CGGCGACTTC ATCGAGGACT CCGAGGCCGT GGTGGCCGTG
GACGCGGTCT CGTTCACGCT TCTGCAGGAT CAGCTGCAGT CGGTGCTGGA GACGCTGTCG
GAGCGCGAGG CCGGCGTGGT ACGGCTGCGG TTCGGCCTCA CCGACGGCCA GCCGCGCACG
CTCGACGAGA TCGGCCAGGT CTACGGCGTC ACGCGGGAAC GCATCCGCCA GATCGAGTCG
AAGACGATGA GCAAGCTCCG GCACCCCAGC CGGTCGCAGG TGCTGCGCGA CTACCTCGAC
TGA
 
Protein sequence
MAATKASPAT EEPVKRTATK TPAKKTAAAK APAKRAAKGT ATTRGPAKKD GAAPRGRGKK 
STAPEAGAAD ALADDDLDTD DTLEAEPDID VDDADLDLED LDTDDDSSDD GDDADTPDAK
VKAAPKGGAV PAAPATEDEE IAEPSEKDKA SGDFVWDEEE SEALRQARKD AELTASADSV
RAYLKQIGKV ALLNAEEEVE LAKRIEAGLF ATQKLAELAE KGEKLPVQQR RDMQWICRDG
DRAKNHLLEA NLRLVVSLAK RYTGRGMAFL DLIQEGNLGL IRAVEKFDYT KGYKFSTYAT
WWIRQAITRA MADQARTIRI PVHMVEVINK LGRIQRELLQ DLGREPTPEE LAKEMDITPE
KVLEIQQYAR EPISLDQTIG DEGDSQLGDF IEDSEAVVAV DAVSFTLLQD QLQSVLETLS
EREAGVVRLR FGLTDGQPRT LDEIGQVYGV TRERIRQIES KTMSKLRHPS RSQVLRDYLD