Gene GM21_1557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1557 
Symbol 
ID8136887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1815755 
End bp1816756 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID644869170 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003021371 
Protein GI253700182 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.389546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACG AAGAGCATTT AGAGGAGAAG GAACCTCTAT TCCTGGACGC CGAGCAGGAC 
CCCGACTCAC TCGAACTGGA GGAAGTCGAG GCTGAAGCCG ATGAACCTGC CGAGGTAGAG
GAAGAGGAAA TCAAGGCCGC GGTGGTGGAG CATTTCGATG ACGCCATCAA GCTCTACCTG
CGCGAAATCC AGAAGACCAA GCTCCTCACC GCCGACGAGG AGAAGGAGTT GGCCGCGCGC
ATTGACCTGG GCGACAAGGC CGCCCGGGAC CGGATGATCG TCTCCAACCT CCGCCTGGTG
GTGAAGATCG CCAAGCGCTA CATAAACCGC GGGCTCCCCT TCCTGGACCT GATCGAAGAA
GGGAACATGG GGCTCATCAA GGCGGTCGAG CGCTTCAAGC TCTCCAAGGA GTGCCGCTTC
TCCACCTACG CCACCTGGTG GATCCGGCAG TCCATCGAGC GCGCGCTGGT GAACCAGTCG
CGCACTATCC GTCTGCCGGT GCACGTCTCC GACGACATCA ACAAGATGCT AAGGGTGACG
CGCGAGCTGG TGCAGAAGAT GAACCGCGAG CCGACCATCA AGGAAGTCGC CGACACCCTT
GAAGTGAACA TCACCTACGT GCGCAGGCTC ATGGTCCTCT TGAAGAAGAC CTACTCCATC
GAGCGTCCCA TGGGGGAGAA CAACGACTAC TTCCTCATCG ACACCATAGA GGACACCTCC
ACCATATCGC CCGCGGTACT TCTGGAAGAC CTCAACAAGT ACGAGCTGGT CTCCAAGTGG
TTCGAGACCC TCTCCGACGC CGAGAAAAAG ATACTCACGC TCCGTTTCGG TCTCGACGAC
AAGGACCCCC AGACCCTCGA CACCATCGGG CGCAGCTTCG GGGTGACCCG CGAAAGGATC
AGGCAGATCG AGGCGAAATC GCTGGAAAAG CTGAGAAAGA TAGTGGAAGC GACCGACATC
ATGGGGCGCC CGGCCGTCCC CCCGACAACT ACAGGCACAT AA
 
Protein sequence
MENEEHLEEK EPLFLDAEQD PDSLELEEVE AEADEPAEVE EEEIKAAVVE HFDDAIKLYL 
REIQKTKLLT ADEEKELAAR IDLGDKAARD RMIVSNLRLV VKIAKRYINR GLPFLDLIEE
GNMGLIKAVE RFKLSKECRF STYATWWIRQ SIERALVNQS RTIRLPVHVS DDINKMLRVT
RELVQKMNRE PTIKEVADTL EVNITYVRRL MVLLKKTYSI ERPMGENNDY FLIDTIEDTS
TISPAVLLED LNKYELVSKW FETLSDAEKK ILTLRFGLDD KDPQTLDTIG RSFGVTRERI
RQIEAKSLEK LRKIVEATDI MGRPAVPPTT TGT