Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_5294 |
Symbol | |
ID | 4114121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 5579842 |
End bp | 5581008 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 638034450 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_642451 |
Protein GI | 108802254 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCCGCCG ACCCGCTGGA CGCGGCGCGG TTGCGCGACC TGATTCCCGG CGTGCTGGCC GCCCTCGTCC ACCGCGGGGC GGACTTCGCG ACCGCCGAGG ACGCAGTGCA GGAGGCGTTG ATCCGAGCCG TCGAGACCTG GCCGGAGCAT CCGCCCGACG AGCCGAAGGG CTGGCTGATC ACGACGGCGT GGCGCCGGTT CCTCGATCTC TCCCGGTCGG ACACTGCGCG TCGCCGGCGG GAGGAACGGG TGTCCAACGA GCCGCCGCCG GGCCCGACCG AATCGGCCGA CGACACGCTG CAGCTCTGTT TCCTGTGCGC CCATCCCAGT CTCACTCCGG CTTCGGCAGT GGCTCTGACG CTGCGGGCCG TCGGCGGCTT GACCACCCGG CAGATCGCGC AGGCCTACCT GGTGCCCGAA GCGACGATGG CGCAGCGGAT CAGCCGCGCC AAGCGCACCG TGAGCGGTGT CCGGCTGGAC AGCCCCGGCG ACCTGCGCAC GGTGTGCCGG GTGCTGTACC TGATCTTCAA CGAGGGCTAC AGCGGCGACG TCGACCTCGC CGGGGAGGCG ATCCGGTTGG CCCGTCAGCT TGCGCGCATG ACCGACGATC CGGAGGTCGC CGGACTGCTC GCGCTGTTCC TGCTCCACCA CGCGCGCCGG CCCGCGCGGA TCCGGGCCGA CGGCAGCCTG GTGCCGTTGG CCGACCAGGA CCGCAGCCGG TGGCGACGTG ACCTGATCGC AGAGGGCGTG ACGATCCTGC AGGCCGCCCT GGCCCGCGAC CGGCTCGGCG AGTACCAGGC CCAGGCCGCG ATCGCGGCCC TGCACGCCGA CGCCCGCACC GTCGAAGAGA CCGACTGGGT GCAGATCGTC GAGTGGTACG ACGAGCTGGT CCGGCTCACC GACAGCCCCG TCGTCCGGCT CAACCGGGCG GTCGCCGTCG GAGAGGCGGA CGGGCCGCGG GCGGGACTCG CGGCGCTCGC CGAACTCGAC CCGTCACTGC CGCGGTACAG CGCATCGGCC GCCCACCTCC ACGAGCGGGC GGGGGAAATC GCCACGGCCG CAGAGCTTTA CGTGCAGGCC GCGAATCAGG CGCAGAACCT CGCCGAGCGG AACCATCTCA CGGTCCGCGC GGCCGCCCTC CGTCAGCGCC TCGCGGGTGA CATCTAG
|
Protein sequence | MAADPLDAAR LRDLIPGVLA ALVHRGADFA TAEDAVQEAL IRAVETWPEH PPDEPKGWLI TTAWRRFLDL SRSDTARRRR EERVSNEPPP GPTESADDTL QLCFLCAHPS LTPASAVALT LRAVGGLTTR QIAQAYLVPE ATMAQRISRA KRTVSGVRLD SPGDLRTVCR VLYLIFNEGY SGDVDLAGEA IRLARQLARM TDDPEVAGLL ALFLLHHARR PARIRADGSL VPLADQDRSR WRRDLIAEGV TILQAALARD RLGEYQAQAA IAALHADART VEETDWVQIV EWYDELVRLT DSPVVRLNRA VAVGEADGPR AGLAALAELD PSLPRYSASA AHLHERAGEI ATAAELYVQA ANQAQNLAER NHLTVRAAAL RQRLAGDI
|
| |