Gene Nmul_A2057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2057 
Symbol 
ID3784375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2348764 
End bp2350887 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content54% 
IMG OID637812146 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_412743 
Protein GI82703177 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00133668 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATCA AGGAATCTGA AGATATGAAG AAGCTGTTGA GAGCGCAGGC GAAAGAAGCG 
GCAGCAAAAG AACTGACGAA GGTGGATGAA GAAATGTTCG ATAATGACAG TGAAACGGAA
ACGGATTCAA TGGCTAAAGC TCCGGAGCAG GTGACCGAGA TTGCGAAGGC AGTGATGGCG
GCGAAAGAGC CCACCCTCGC CAATGGCAAG ATCGCAAGAT CAAGAGCGGC CCGGGAAGGA
AAAAACTCGC ACGCCAAGGA TCCGGAAAAA GCTGCCGCCT CCCAGGAGCT CGAGGCGCGC
CGCATGCGTC TCAAGAATCT GATCGTGCAG GGCAAGGAAC GCGGCTATCT CACTTACGCC
GAAATCAATG ATCATCTCCC CGACGATATG CTCGACGCGG AGCAGATCGA GAACATCATC
AGCATGATCA ATGACGTCGG GATTTCCGTT TATGACGAGG CGCCCGACGC GGAAACGCTG
CTCATGTCTG AAACTGCCCC TACCGTGGCC GATGAGGATG TGGTGGAGGA AGCGGAAGCC
GCGCTTTCCA CTGTGGATTC GGAGTTCGGG CGCACAACCG ACCCTGTCCG GATGTATATG
CGCGAAATGG GTTCCGTGGA ACTGCTGACG CGTGAGAGCG AAATCGAAAT CGCAAAACGT
ATCGAGGACG GCTTGAAGCA CATGATACAG GCGATTTCCG CCTGTCCGAC AACCATAGCC
GGAATTCTCG AATTCGCCGA CAGAGTGTCG AAAGACGAGA TGCGTGTGGA TGAGCTGGTG
GATGGGTTGC TCGATCCCAG CACAGAGGAA ATTATCAGCG AAGAGATTTC CGATGAATCC
CTGGAACAGG AATTGAACTC GGATGCGGAA GATGAGGATG TCACCGCAGT CGCAAATGCC
AACCTCCTCA AGCTGAAAAA TGACGCGCTG GAGCGTTTTG CAGTGGTTCA AAAGGCTTAT
GACGAGATGC AGAAGGTGCT CGAAAAGAAA GGATCGGGCA ACAAGGCCTA TAAAGACATC
CAGGAGCAGA TTTCGTCCGA GCTGATGGCT ATCCGTTTCT CTGCCAAAAT GGTTGAACGG
TTGTGCGATA CGCAGCGGGC ACTGGTGGAT GAAATGCGCG GTTACGAGCG AAAAATAATG
GAGCTTTGCG TAAGCAAGGT GGGAATGTCG CGTAACCACT TCATCAAGAC CTTCCCCGGT
AACGAGAGTA ACCTGAACTG GGTGGATGAG GAGATTGCAC TCGGCAAACC CTACAGCGCA
GCCCTGGAAC GTTATCGTCC CGAGATTGTG GAACAGCAAC AGAATCTGCT GGCACTGCAA
AAGCAGGTAG GCATTCCCTT GAAGGAACTC AAGGAAATCA ACCGCCGCAT GTCCACGGGT
GAGGCGAAGG CGCGCCGGGC CAAACGTGAA ATGACCGAAG CAAATTTGCG ACTGGTGATT
TCCATCGCTA AAAAATATAC CAATCGGGGA TTGCAGTTCC TCGATCTCAT TCAGGAAGGC
AACATCGGCC TGATGAAGGC AGTCGATAAA TTCGAATACC GGCGGGGATA CAAGTTTTCC
ACCTACGCAA CCTGGTGGAT TCGTCAGGCC ATCACACGTT CCATTGCGGA TCAGGCGCGT
ACCATCCGTA TCCCGGTGCA CATGATCGAA ACGATTAACA AGATGAACCG CATTTCCCGC
CAGATCCTGC AGGAAACCGG GCAGGAGCCG GAGCCCGCCG TCCTCGCACA GAAAATGGAA
ATGCCGGAAG AGAAGATTCG TAAAATCCTC AAGATTTCCA AGGAACCAAT TTCCATGGAG
ACCCCGATCG GAGACGACGA AGATTCTCAT CTCGGGGATT TCATCGAGGA TTCAGCTACC
ATGGCTCCTG CGGATGCGGC AGTTTATGCC AGCCTGCGCG ATGTTACGAA AGATATACTG
GATTCGCTGA CTCCGCGCGA AGCAAAAGTA CTGCGCATGC GCTTCGGCAT CGAAATGAAT
ACCGACCACA CGCTGGAGGA AGTCGGCAAG CAGTTCGACG TAACGCGCGA GCGCATCCGC
CAGATCGAGG CCAAGGCACT GCGCAAGCTG CGCCATCCGT CCCGTTCCGA GCGCCTGCGC
AGCTTCCTGG ATACTGAAGG CTGA
 
Protein sequence
MAIKESEDMK KLLRAQAKEA AAKELTKVDE EMFDNDSETE TDSMAKAPEQ VTEIAKAVMA 
AKEPTLANGK IARSRAAREG KNSHAKDPEK AAASQELEAR RMRLKNLIVQ GKERGYLTYA
EINDHLPDDM LDAEQIENII SMINDVGISV YDEAPDAETL LMSETAPTVA DEDVVEEAEA
ALSTVDSEFG RTTDPVRMYM REMGSVELLT RESEIEIAKR IEDGLKHMIQ AISACPTTIA
GILEFADRVS KDEMRVDELV DGLLDPSTEE IISEEISDES LEQELNSDAE DEDVTAVANA
NLLKLKNDAL ERFAVVQKAY DEMQKVLEKK GSGNKAYKDI QEQISSELMA IRFSAKMVER
LCDTQRALVD EMRGYERKIM ELCVSKVGMS RNHFIKTFPG NESNLNWVDE EIALGKPYSA
ALERYRPEIV EQQQNLLALQ KQVGIPLKEL KEINRRMSTG EAKARRAKRE MTEANLRLVI
SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYRRGYKFS TYATWWIRQA ITRSIADQAR
TIRIPVHMIE TINKMNRISR QILQETGQEP EPAVLAQKME MPEEKIRKIL KISKEPISME
TPIGDDEDSH LGDFIEDSAT MAPADAAVYA SLRDVTKDIL DSLTPREAKV LRMRFGIEMN
TDHTLEEVGK QFDVTRERIR QIEAKALRKL RHPSRSERLR SFLDTEG