Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_47000 |
Symbol | rpoD |
ID | 7763563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4770914 |
End bp | 4772776 |
Gene Length | 1863 bp |
Protein Length | 620 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643807544 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_002801780 |
Protein GI | 226946707 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTAA AAGCTCAACA GCAGTCTCGC ATCAAAGAGT TGATCGCTCG CGGCCGCGAA CAGGGTTACC TGACCTATGC CGAGGTCAAC GACCACCTGC CCGAGGATAT TTCCGATCCG GAACAGGTGG AAGACATCAT CCGCATGATC AACGACATGG GGATCAACGT ATTCGAAAGT GCCCCGGATA CGGATGCCCT GTTGTTGGCC GAAGCCGACA CCGACGAAGC TGCCGCCGAG GAAGCCGCCG CGGCCCTGGC CGCCGTGGAA ACCGACATCG GTCGTACCAC CGACCCGGTG CGCATGTACA TGCGCGAAAT GGGTACCGTG GAACTGCTCA CCCGCGAAGG CGAAATCGAG ATCGCCAAGC GCATCGAGGA AGGCATCCGC GAGGTGATGA GCGCCATCGC CCACTTCCCG GGCACCGTGG ACGGCATCCT CGCGGAATAC CAGCGCGTCA CCAGCGAAGG CGGCCGCCTG TCCGACATCC TCAGCGGCTA CATCGATCCC GACGACGATT CGGCGGTGCC TGCCGAGGCC GAGGTTCCCG TCGATCTCAA GAGCAAGAGC GCCGCGACCC CCGCCGCATC CGAGGACGAG GACGAGGACG AGAGCGAGGA AAGCGACAGC GACGACGAGG AAGGCGACGG TGGTCCCGAC CCGGAGATCG CCCGCCAGCG CTTCGGCGCT GTCGCCGAGC AGTTGGGCAG GACCCGTGTG GCTCTCGAAC GGCACGGGCG CCACAGCGTC GAGGGCCTTG AGGCGCTGCA GGCGCTGGCC AGCCTGTTCA TGCCGATCAA GCTGGTGCCC AAACAGTACG ACACCCTGGT CGAGCAGGTG CGTGACGCCC TGACCCGCGT GCGCGCCCAG GAACGCGCGA TCATGCAGTT GTGCGTGCGC GATGCGCGCA TGCCGCGCGC CGACTTCCTG CGCCAGTTTC CCGGCAACGA GACCGATCAG GACTGGGTCG ACCTCCTCGC CAAGGGCAAG GCCAAGTACG CCGAAGCCCT GGGCAATCTG GCCGACGACA TCAAGCAGTG CCAGCAGAAG CTGATCGACC TCGAGCAGAA GGTCGGCCTG ACCATCGCCG AAATCAAGGA CATCAACCGC CAGATGTCCA TCGGCGAAGC CAAGGCCCGC CGCGCCAAGA AGGAAATGGT CGAGGCCAAC CTGCGCCTGG TGATCTCCAT CGCCAAGAAA TACACCAACC GCGGCCTGCA GTTCCTCGAT CTGATCCAGG AAGGCAACAT CGGCCTGATG AAGGCGGTGG ACAAGTTCGA ATACCGCCGC GGCTACAAGT TCTCGACCTA CGCCACCTGG TGGATCCGCC AGGCGATCAC CCGCTCGATC GCCGACCAGG CGCGCACCAT CCGCATCCCG GTGCACATGA TCGAGACCAT CAACAAGCTC AACCGCATCT CCCGCCAGAT GCTCCAGGAG ATGGGCCGCG AACCCACGCC GGAAGAACTC GGCGAGCGCA TGGACATGCC CGAGGACAAG ATCCGCAAGG TGCTGAAGAT CGCCAAGGAA CCGATCTCCA TGGAAACCCC GATCGGCGAC GACGAGGACT CGCATCTGGG CGACTTCATC GAGGACTCGA CCATGCAGTC GCCGATCGAG GTGGCGACCG TGGAAAGCCT CAAGGAGGCC ACCCGCGAAG TCCTCGCCGG CCTCACCGCC CGGGAAGCCA AGGTGCTGCG CATGCGCTTC GGCATCGACA TGAACACCGA CCACACCCTC GAGGAGGTCG GCAAGCAGTT CGACGTGACC CGCGAGCGCA TCCGTCAGAT CGAGGCCAAG GCCCTGCGCA AGCTGCGCCA CCCCTCGCGA AGCGAGCACC TGCGCTCCTT CCTTGACGAG TGA
|
Protein sequence | MSVKAQQQSR IKELIARGRE QGYLTYAEVN DHLPEDISDP EQVEDIIRMI NDMGINVFES APDTDALLLA EADTDEAAAE EAAAALAAVE TDIGRTTDPV RMYMREMGTV ELLTREGEIE IAKRIEEGIR EVMSAIAHFP GTVDGILAEY QRVTSEGGRL SDILSGYIDP DDDSAVPAEA EVPVDLKSKS AATPAASEDE DEDESEESDS DDEEGDGGPD PEIARQRFGA VAEQLGRTRV ALERHGRHSV EGLEALQALA SLFMPIKLVP KQYDTLVEQV RDALTRVRAQ ERAIMQLCVR DARMPRADFL RQFPGNETDQ DWVDLLAKGK AKYAEALGNL ADDIKQCQQK LIDLEQKVGL TIAEIKDINR QMSIGEAKAR RAKKEMVEAN LRLVISIAKK YTNRGLQFLD LIQEGNIGLM KAVDKFEYRR GYKFSTYATW WIRQAITRSI ADQARTIRIP VHMIETINKL NRISRQMLQE MGREPTPEEL GERMDMPEDK IRKVLKIAKE PISMETPIGD DEDSHLGDFI EDSTMQSPIE VATVESLKEA TREVLAGLTA REAKVLRMRF GIDMNTDHTL EEVGKQFDVT RERIRQIEAK ALRKLRHPSR SEHLRSFLDE
|
| |