Gene GM21_3789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3789 
Symbol 
ID8139163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4362595 
End bp4364349 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content60% 
IMG OID644871408 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003023566 
Protein GI253702377 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones118 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGA AAAGCATGGA CGAAGTGAAG CAGCTCATCG ACCTGGGCAA GGAAAAGGGT 
TTTCTCACCT ATGAAGAGGT GAACGACCTG CTTCCCCCTG ACATCGTCTC CTCCGACCAG
ATCGACGACG TCATGAGCAT GTTTGGCGAT ATGGACATCG AGATAGTCGA CTCAGCCCAG
AAAGTAAAGA TCCCGAAGAT CAAGATGGAT CTGGAAGAAG AGGAAGAGCA CGAAGGGAGT
GAGGAGGAAG TCGAGTTCGA GCCCGGCACC CTCGGTCGTA CCAGCGATCC CGTTCGCATG
TACCTGCGCG AGATGGGGTC CGTTTCGCTT TTGACCCGCG AGGGAGAGGT AGAGATCGCC
AAGAGGATCG AGGTGGGCGA GCGCGACGTC GCCAGCGTCA TCCTGAACAC TCCGATCACG
GTGAGAGAGG TCGTCTCCCT TGGTGAGCGT CTCAGGAAGC AGCAGATCGG CGCCATCGAG
ATCTCCAAGG ACGTCGAGGA AGAGGTGCTG GAGGAGGGGG AAGAGGATCT CCAGGCCCTC
AAGGTGCTCA CCATCATCGA CGAGATCAAG GAAATCGAAC AGCGGATGAG CGAGATCCAG
TCTGCTCTGG AGGCCGAGAA GGTCGCCGCA AAGGAGCGCG AGGCTCTGGG CGCCGAGCAC
GCCGAGCTCA AGGTGAAGAT GGCGGAGACC CTCAAGTCGC TGCGCCTGAA GGACCGCCAC
ATCGAGAAGA TCGCCCAGCG CCTGAAGGAG CTTTCCTGCA AGGTCGACAC GGTCATGCAG
GAGATCGCCG AGCTGGAGAA GGAAAGCGGC GCCGAGAGGG AGCCTTTCCT TACCGCCTTC
GAAGGGGCGA AGGGGGGCTC CGAGGCGGAC TTCCAGAAGA AGCTGAACAT GACGCTGGAG
GAGGGGCAGA AGCTCGAGAA GCGCTTCCGT TCCAGCGAAG CGAAATTGAA GAAGATCGAG
CAGGAGTCCG GCTTCAAGGC GAGCGAGCTC GCCAACGCCC TGCTCGCCAT CGAAGAAGGT
GAGCACAAGG CGAAGCTGGC CAAGAGCGAA CTGGTCGAGG CCAACCTCCG CCTGGTCGTC
TCCATTGCCA AGAAGTACAC CAACCGCGGT CTGCAGTTCC TGGACCTGAT CCAGGAAGGG
AACATCGGCC TCATGAAGGC TGTCGATAAG TTCGAGTACC AGCGAGGTTA CAAGTTCTCG
ACCTACGCCA CCTGGTGGAT CCGCCAGGCC ATCACCCGCG CCATCGCGGA CCAGGCCCGT
ACCATCAGGA TCCCGGTGCA CATGATCGAG ACCATCAACA AGCTGATCCG TACCAGCCGT
CAGCTGGTGC AGGAGATCGG CCGCGAGCCC TCCCCCGAGG AAATCGCCGA GCGCATGGCG
CTGCCGCTGG ACAAGGTGCG CAAGGTCCTG AAAATCGCCA AGGAGCCGAT CTCCCTGGAG
ACCCCGATCG GCGAGGAAGA AGATTCCCAT CTTGGGGACT TCATCGAGGA CAAGGGTGTG
GTCTCCCCCC TGGAGGCGGT GATCAAGGCG AACCTTTCCG AGCAGACCTC CCGGGTGCTC
TCCACCCTCA CCCCCCGCGA GGAAAAGGTG CTCCGGATGC GCTTCGGCAT CGGCGAGAAG
AGCGACCATA CCCTCGAGGA GGTGGGCCAG GACTTCGAGG TCACGCGCGA AAGGATCCGG
CAGATCGAGG CGAAGGCGCT CAGGAAGCTG CGCCATCCCA GCCGGGCCAA GAAGCTGAAA
AGCTTCGTAG AGTAA
 
Protein sequence
MAKKSMDEVK QLIDLGKEKG FLTYEEVNDL LPPDIVSSDQ IDDVMSMFGD MDIEIVDSAQ 
KVKIPKIKMD LEEEEEHEGS EEEVEFEPGT LGRTSDPVRM YLREMGSVSL LTREGEVEIA
KRIEVGERDV ASVILNTPIT VREVVSLGER LRKQQIGAIE ISKDVEEEVL EEGEEDLQAL
KVLTIIDEIK EIEQRMSEIQ SALEAEKVAA KEREALGAEH AELKVKMAET LKSLRLKDRH
IEKIAQRLKE LSCKVDTVMQ EIAELEKESG AEREPFLTAF EGAKGGSEAD FQKKLNMTLE
EGQKLEKRFR SSEAKLKKIE QESGFKASEL ANALLAIEEG EHKAKLAKSE LVEANLRLVV
SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYQRGYKFS TYATWWIRQA ITRAIADQAR
TIRIPVHMIE TINKLIRTSR QLVQEIGREP SPEEIAERMA LPLDKVRKVL KIAKEPISLE
TPIGEEEDSH LGDFIEDKGV VSPLEAVIKA NLSEQTSRVL STLTPREEKV LRMRFGIGEK
SDHTLEEVGQ DFEVTRERIR QIEAKALRKL RHPSRAKKLK SFVE