Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3355 |
Symbol | |
ID | 4644402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 3570937 |
End bp | 3572157 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639806833 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_954158 |
Protein GI | 120404329 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.717574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.451695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCCC TGGACGGCGT CTTCCGGCGC GAATGGGGTC GCACGGTGGC CGCGATCGCA CGTTGGTGCG GCGATCTCAC CGTCGCCGAG GACGCCGTCC AGGAGGCCTG CGCCGACGCG CTGCGGTCCT GGCCGCGCGA CGGGGTGCCC GAGGTGCCCG GCGCCTGGCT GGTGACGACG GCCCGCAACC GGGCACGCGA CCGTCTGCGG CGCGAGTCGG CTCGCCCGGG GAAGGAGTTG GCAGCGGTGC TCGACGACAT CATTGCCCGC ACCGACGGAC CCGAAGTGCC CCACCGGGTG CGCGACGACG AGCTGCGGAT GATGTTCACC TGCGCGCACC CTGCGCTGGA GCGGCCGTCG CAGCTTGCGC TGACGCTGCG ACTGGTGTCC GGGCTCACGG TCGCCGAGAT CGCGCGGGCC CTGTTGCAGA GCGAAGCCGC GGTCGGGCAG CGAATCACCC GCGCCAAAGC CAAGATCCGG CACGCGAACA TTCCGCTGCG GGTTCCGCCG GGCGAGTTGC TGGCCCAGCG CACACCACAC GTGCTGGCCT GCATCTACTC GGTGTTCACC GAGGGCTACT GGTCCACCGC CGGCCCCTCG GCGATCCGCG ATGAGCTGTG CGACGAGGCG GTGCGCCTCA CCGGTGAACT GTGCGTGGCG ATGCCCGAGG AACCCGAGGG GCATGCGCTG GCGGCCCTTG TGCTGCTGCA TGATTCGCGT CGCGCCACCC GGACCGACGG ACGTGGGACG CTGGTGCCGT TGGAGGAGCA GGACCGGCGG CGCTGGGATC GCGGCAAGAT CGCCCGTGGC CTGGAGCAGT TACGCCGAGC CGGCGGGTCG CCCGGACCGT ACCTGCCGCA GGCGGTGATC GCCGCCGTAC ACGCGACTGC TCCCTGCTGG GAGCAGACCG ACTGGGTCAC CATCTGCGCG GCCTACGACC GGCTGCTCGG TATCGCCGAT TCGCCGGTGG TGCGGGCCAA CCGCGCCATG GCCGTCGGAT TCCGCGACGG CCCGGATGCC GGGCTGGCAG CGTTGGAGAC GGTGGCCGAC GATCCGCGGC TGGCGCGCTC TCCCCTGGTG GCCACGGTGC GCGCCGATCT GCTCCGGCGC GCGGGCCGTG ACGACGAGGC CGTGGTGTCG TATCGACATG CCCTGGCCGC GAACGGGTCA ATACCCGGGC GGGAGTTCCT GGCGCGCCGG ATCGCCGAAT GCGGCGGCTG A
|
Protein sequence | MQSLDGVFRR EWGRTVAAIA RWCGDLTVAE DAVQEACADA LRSWPRDGVP EVPGAWLVTT ARNRARDRLR RESARPGKEL AAVLDDIIAR TDGPEVPHRV RDDELRMMFT CAHPALERPS QLALTLRLVS GLTVAEIARA LLQSEAAVGQ RITRAKAKIR HANIPLRVPP GELLAQRTPH VLACIYSVFT EGYWSTAGPS AIRDELCDEA VRLTGELCVA MPEEPEGHAL AALVLLHDSR RATRTDGRGT LVPLEEQDRR RWDRGKIARG LEQLRRAGGS PGPYLPQAVI AAVHATAPCW EQTDWVTICA AYDRLLGIAD SPVVRANRAM AVGFRDGPDA GLAALETVAD DPRLARSPLV ATVRADLLRR AGRDDEAVVS YRHALAANGS IPGREFLARR IAECGG
|
| |