Gene GM21_3390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3390 
Symbol 
ID8138757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3921038 
End bp3922585 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content62% 
IMG OID644871008 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_003023173 
Protein GI253701984 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones121 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATAG AGATGCGCCA GCAGATGAAA ATGAGTCAGC AACTGGTGAT GACGCCCCAG 
TTGCAGCAGG CCATCAAGCT CCTCCAGCTT TCCCGGCTGG AGTTGCAGGA CGTAGTACGT
CAGGAGTTGG AGGAGAACCC CATACTCGAC GAGGTGATCG AGCAGGAGGA GATCCGGGAA
CCCGAGCAGA TCGAATTGCG CGAGAAGGAA GCCGAGCCGG AGGCCGCCGC GAGCGATTTC
CAGGAAGTGC GGGCCGGCGA GGAGACGCGC GAGGCGGACT GGGATTCCTA CATAGACGGC
TACAACTACA GCTCCGGCGA GCAGTACTAC GACGACGAGG ACCGTCCCTC CTTCGAGAAC
CTTCTCACCA AGAAATCCAC CCTGTTCGAC CACCTGATGT GGCAGTTGAG CCTCACCCGT
CTCACGGAGC GCGAGATGGC GGTGGGAGCC GAGATCATCG GCAACATCGA CGAGGAGGGG
TACCTCCGCG CCTCCCTCGA GGACGTAGCG TCGGCCTGCG TGCAGGTAAC CCCGTTCCAG
GAAGAGATGC TCGAGTGGTC GGGGCTTACC AGCGACGCCT GCGAGGAAGA GATAGCCGAT
GCGGCGGGCG GTTTCTCCAC TACCGTGCTG GTTCCGCTGG TCGATTCGGT GCTGAAGCGG
ATCCAAGAGT TCGACCCGGT GGGCGTCGGG GCCCGCGACC TGCGCGAGTG CCTCCTGATC
CAGGTGGGTA GCCTCGGCAT GGGGGGGAGC CTCGTGGAGT CGCTGTTGCG CGACCACCTG
AAGGATCTGG AGAGCCACAA GTACAAGCAG GCCGCGAAGG TGCTGGGGGT GGATGTGAAC
GACATCCTCG CCGCCACGAG GATCATCGCG GAACTCGATC CCAAGCCCGG CCGGGTCTTC
GGCAGCGACG ACGTGCAGTA CATCTCGGCC GACATCTTCG TGCACAAGGT GGGTGACGAG
TACGTGGTGA TGCTGAACGA CGAGGGGATG CCCAACTTGA GGATCAACCC CATCTACGCC
CCCGAGGCGA AGAGCAGCCG TCCGGTCGAC AAGGTGGCCG AGGATTACAT CGGCGAGAAG
ATGCGCTCCG CCCTGTGGCT CATCAAGAGC ATCCAGCAGC GCCAGCGCAC CATCTTCAAG
GTGGCCAAGA GCATCGTGAA GTTCCAGCGC GACTTTCTCG ACCGCGGCAT CGAGCATCTG
CGCCCGCTGG TGTTGAGGGA CATCGCCGAG GACATCGGCA TGCACGAGTC CACCATCAGC
CGGGTCACCA CCAACAAATA CATGCAGACC CCGCAAGGGC TCTTCGAGCT GAAGTACTTC
TTCAACTCCG GCATCTCGAC CGGGGAGGGG GACTTCATCG CCTCCGAGAG CGTGAAGAGC
AAGATCAAGG AACTGGTGGA CAACGAGGAC TCCAAGCGCC CCTACAGCGA TCAGCGCCTG
GCGGAACTCC TCTCGGACCA CAACATCGTC ATCGCCCGCC GCACCGTTAC CAAGTATCGC
GAGATGCTTC GCATCGGCTC GTCCTCGGAG CGCAAGAAGC ATTTCTAA
 
Protein sequence
MAIEMRQQMK MSQQLVMTPQ LQQAIKLLQL SRLELQDVVR QELEENPILD EVIEQEEIRE 
PEQIELREKE AEPEAAASDF QEVRAGEETR EADWDSYIDG YNYSSGEQYY DDEDRPSFEN
LLTKKSTLFD HLMWQLSLTR LTEREMAVGA EIIGNIDEEG YLRASLEDVA SACVQVTPFQ
EEMLEWSGLT SDACEEEIAD AAGGFSTTVL VPLVDSVLKR IQEFDPVGVG ARDLRECLLI
QVGSLGMGGS LVESLLRDHL KDLESHKYKQ AAKVLGVDVN DILAATRIIA ELDPKPGRVF
GSDDVQYISA DIFVHKVGDE YVVMLNDEGM PNLRINPIYA PEAKSSRPVD KVAEDYIGEK
MRSALWLIKS IQQRQRTIFK VAKSIVKFQR DFLDRGIEHL RPLVLRDIAE DIGMHESTIS
RVTTNKYMQT PQGLFELKYF FNSGISTGEG DFIASESVKS KIKELVDNED SKRPYSDQRL
AELLSDHNIV IARRTVTKYR EMLRIGSSSE RKKHF