Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2057 |
Symbol | |
ID | 3784375 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2348764 |
End bp | 2350887 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637812146 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_412743 |
Protein GI | 82703177 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00133668 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATCA AGGAATCTGA AGATATGAAG AAGCTGTTGA GAGCGCAGGC GAAAGAAGCG GCAGCAAAAG AACTGACGAA GGTGGATGAA GAAATGTTCG ATAATGACAG TGAAACGGAA ACGGATTCAA TGGCTAAAGC TCCGGAGCAG GTGACCGAGA TTGCGAAGGC AGTGATGGCG GCGAAAGAGC CCACCCTCGC CAATGGCAAG ATCGCAAGAT CAAGAGCGGC CCGGGAAGGA AAAAACTCGC ACGCCAAGGA TCCGGAAAAA GCTGCCGCCT CCCAGGAGCT CGAGGCGCGC CGCATGCGTC TCAAGAATCT GATCGTGCAG GGCAAGGAAC GCGGCTATCT CACTTACGCC GAAATCAATG ATCATCTCCC CGACGATATG CTCGACGCGG AGCAGATCGA GAACATCATC AGCATGATCA ATGACGTCGG GATTTCCGTT TATGACGAGG CGCCCGACGC GGAAACGCTG CTCATGTCTG AAACTGCCCC TACCGTGGCC GATGAGGATG TGGTGGAGGA AGCGGAAGCC GCGCTTTCCA CTGTGGATTC GGAGTTCGGG CGCACAACCG ACCCTGTCCG GATGTATATG CGCGAAATGG GTTCCGTGGA ACTGCTGACG CGTGAGAGCG AAATCGAAAT CGCAAAACGT ATCGAGGACG GCTTGAAGCA CATGATACAG GCGATTTCCG CCTGTCCGAC AACCATAGCC GGAATTCTCG AATTCGCCGA CAGAGTGTCG AAAGACGAGA TGCGTGTGGA TGAGCTGGTG GATGGGTTGC TCGATCCCAG CACAGAGGAA ATTATCAGCG AAGAGATTTC CGATGAATCC CTGGAACAGG AATTGAACTC GGATGCGGAA GATGAGGATG TCACCGCAGT CGCAAATGCC AACCTCCTCA AGCTGAAAAA TGACGCGCTG GAGCGTTTTG CAGTGGTTCA AAAGGCTTAT GACGAGATGC AGAAGGTGCT CGAAAAGAAA GGATCGGGCA ACAAGGCCTA TAAAGACATC CAGGAGCAGA TTTCGTCCGA GCTGATGGCT ATCCGTTTCT CTGCCAAAAT GGTTGAACGG TTGTGCGATA CGCAGCGGGC ACTGGTGGAT GAAATGCGCG GTTACGAGCG AAAAATAATG GAGCTTTGCG TAAGCAAGGT GGGAATGTCG CGTAACCACT TCATCAAGAC CTTCCCCGGT AACGAGAGTA ACCTGAACTG GGTGGATGAG GAGATTGCAC TCGGCAAACC CTACAGCGCA GCCCTGGAAC GTTATCGTCC CGAGATTGTG GAACAGCAAC AGAATCTGCT GGCACTGCAA AAGCAGGTAG GCATTCCCTT GAAGGAACTC AAGGAAATCA ACCGCCGCAT GTCCACGGGT GAGGCGAAGG CGCGCCGGGC CAAACGTGAA ATGACCGAAG CAAATTTGCG ACTGGTGATT TCCATCGCTA AAAAATATAC CAATCGGGGA TTGCAGTTCC TCGATCTCAT TCAGGAAGGC AACATCGGCC TGATGAAGGC AGTCGATAAA TTCGAATACC GGCGGGGATA CAAGTTTTCC ACCTACGCAA CCTGGTGGAT TCGTCAGGCC ATCACACGTT CCATTGCGGA TCAGGCGCGT ACCATCCGTA TCCCGGTGCA CATGATCGAA ACGATTAACA AGATGAACCG CATTTCCCGC CAGATCCTGC AGGAAACCGG GCAGGAGCCG GAGCCCGCCG TCCTCGCACA GAAAATGGAA ATGCCGGAAG AGAAGATTCG TAAAATCCTC AAGATTTCCA AGGAACCAAT TTCCATGGAG ACCCCGATCG GAGACGACGA AGATTCTCAT CTCGGGGATT TCATCGAGGA TTCAGCTACC ATGGCTCCTG CGGATGCGGC AGTTTATGCC AGCCTGCGCG ATGTTACGAA AGATATACTG GATTCGCTGA CTCCGCGCGA AGCAAAAGTA CTGCGCATGC GCTTCGGCAT CGAAATGAAT ACCGACCACA CGCTGGAGGA AGTCGGCAAG CAGTTCGACG TAACGCGCGA GCGCATCCGC CAGATCGAGG CCAAGGCACT GCGCAAGCTG CGCCATCCGT CCCGTTCCGA GCGCCTGCGC AGCTTCCTGG ATACTGAAGG CTGA
|
Protein sequence | MAIKESEDMK KLLRAQAKEA AAKELTKVDE EMFDNDSETE TDSMAKAPEQ VTEIAKAVMA AKEPTLANGK IARSRAAREG KNSHAKDPEK AAASQELEAR RMRLKNLIVQ GKERGYLTYA EINDHLPDDM LDAEQIENII SMINDVGISV YDEAPDAETL LMSETAPTVA DEDVVEEAEA ALSTVDSEFG RTTDPVRMYM REMGSVELLT RESEIEIAKR IEDGLKHMIQ AISACPTTIA GILEFADRVS KDEMRVDELV DGLLDPSTEE IISEEISDES LEQELNSDAE DEDVTAVANA NLLKLKNDAL ERFAVVQKAY DEMQKVLEKK GSGNKAYKDI QEQISSELMA IRFSAKMVER LCDTQRALVD EMRGYERKIM ELCVSKVGMS RNHFIKTFPG NESNLNWVDE EIALGKPYSA ALERYRPEIV EQQQNLLALQ KQVGIPLKEL KEINRRMSTG EAKARRAKRE MTEANLRLVI SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKMNRISR QILQETGQEP EPAVLAQKME MPEEKIRKIL KISKEPISME TPIGDDEDSH LGDFIEDSAT MAPADAAVYA SLRDVTKDIL DSLTPREAKV LRMRFGIEMN TDHTLEEVGK QFDVTRERIR QIEAKALRKL RHPSRSERLR SFLDTEG
|
| |