Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DvMF_0167 |
Symbol | |
ID | 7172042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfovibrio vulgaris str. 'Miyazaki F' |
Kingdom | Bacteria |
Replicon accession | NC_011769 |
Strand | - |
Start bp | 182620 |
End bp | 184020 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643538660 |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_002434595 |
Protein GI | 218885274 |
COG category | [K] Transcription [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.000521336 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCATCC CGTTCTCCCT GGCTTCCATC GTGTGCATGG CCCGCCGGGG CGATGCCGAT GCCTGGCGGG CGCTGGTGGC CCGGCTGACG CCCTGGGCGC TGGGCCTGGC CCGGCGCAAG CTGCCGCGCA CGCTTGATGC CGAAGACGCG GTACAGGACG CCTTTCTCAC CTGCTTCCGC AAGCTCGATT CCCTGCGCTC GCCGGACGCC TTCCCCTCGT GGTTCGCGGC CATCCTCACC ACCCACTGCC ACCGCATCAC GGCCAGCCGC CCGACGGCCA TTTCGCTGGA AGTGCTGGAC GACAACGCCC TGCTGCCCGT CACGGCCCGC ACGCCGGAAG ACGAAGTGGG CACCGCGCAA CTGCTGGCCG CGTGCGATGC CGCCCTTGCC GCCCTGCCCC CGCATCTGCT CGACGTGAAC CGCCTGCACA TCCGCCACGG GCTTTCCATA CCAGAGGTGG CCGCCGCCTG CGGCCTGCCC GAAGGCACGG TAAAGAAACG CCTGTTCACC GCCCGCCCGC TGCTGCAACA GCAACTTGCC CGCTTCCGGG GCGACCTGCT GTTCCGCGTG GGCTACCTGC CCGTCTCCGA CCACCTGCTG GCCATGTGCG CCGACCACCT GCTGCGCGGT CGGGGCCTGC CCCTGCTTTC GCGCCGCTAC CTTTCATGGG CCGTACTGGC GGGCGACCTG ACCCATGGCC GCCTGGACGC CGCCTTCATC ATGGCGCCGC TGGCCCTCAG CCTGCGCGGC GCGGGTGCGC CCCTGCGCTA CGTCATGGAT GGCCACCACG AGGGCAGTGC GCTGTCCCTG TCGCGGCAGG CGGAGCGGCG CAGGGTCATG GGGCTGCCGG GGGCTTTCTC CACCCACCGG GTGCTGCTGG CCCATCTGGG GCTGGAATGC TCCGGCGTCT CGAACCTGCC CACACTGGTC GTCAACCCGT CGTCGGTCAT CGCCTCCATG CAGCACAACG AGATCGGCGC GTTCTTCTGC GCGGAACCGT GGAGCACCAA ATGCCTGCAC GAGGGCGTGG GGTACACCGC GTTGCGCTCG GCAGACATCA TGCCCCACCA CCTGTGCTGC ATCCTTGCCG TGCGCCAGCC CTTTGCGGAC CGGCACGGCC AGGTGGTGGC GGATTACGTG CGGGCCTTGC TGGCCGCGCG CGACAGGGTG CGCCGAGACC CGGCCTTCGG GGCTGCCGTG CAGTCCGCCC TCACCGGCGT GGATGCCGGG GTGGCGCGGC AGGTGCTGGA ACGCGAGGCC GTGACATTCG ACGACCTGGA GCCGGACGCC CCGCGCATGG CCGCCTTTGC CCGCCTGGCC GTAAACGCGG GCGTGCTGTC AGAACCCGTG GCCCTGCCGG GCTTTGCCTG CCCGGACTTT CTGCCCGTGC CCGCCCCCTG A
|
Protein sequence | MGIPFSLASI VCMARRGDAD AWRALVARLT PWALGLARRK LPRTLDAEDA VQDAFLTCFR KLDSLRSPDA FPSWFAAILT THCHRITASR PTAISLEVLD DNALLPVTAR TPEDEVGTAQ LLAACDAALA ALPPHLLDVN RLHIRHGLSI PEVAAACGLP EGTVKKRLFT ARPLLQQQLA RFRGDLLFRV GYLPVSDHLL AMCADHLLRG RGLPLLSRRY LSWAVLAGDL THGRLDAAFI MAPLALSLRG AGAPLRYVMD GHHEGSALSL SRQAERRRVM GLPGAFSTHR VLLAHLGLEC SGVSNLPTLV VNPSSVIASM QHNEIGAFFC AEPWSTKCLH EGVGYTALRS ADIMPHHLCC ILAVRQPFAD RHGQVVADYV RALLAARDRV RRDPAFGAAV QSALTGVDAG VARQVLEREA VTFDDLEPDA PRMAAFARLA VNAGVLSEPV ALPGFACPDF LPVPAP
|
| |