Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3789 |
Symbol | |
ID | 8139163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4362595 |
End bp | 4364349 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871408 |
Product | RNA polymerase, sigma 70 subunit, RpoD subfamily |
Protein accession | YP_003023566 |
Protein GI | 253702377 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 118 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAAGA AAAGCATGGA CGAAGTGAAG CAGCTCATCG ACCTGGGCAA GGAAAAGGGT TTTCTCACCT ATGAAGAGGT GAACGACCTG CTTCCCCCTG ACATCGTCTC CTCCGACCAG ATCGACGACG TCATGAGCAT GTTTGGCGAT ATGGACATCG AGATAGTCGA CTCAGCCCAG AAAGTAAAGA TCCCGAAGAT CAAGATGGAT CTGGAAGAAG AGGAAGAGCA CGAAGGGAGT GAGGAGGAAG TCGAGTTCGA GCCCGGCACC CTCGGTCGTA CCAGCGATCC CGTTCGCATG TACCTGCGCG AGATGGGGTC CGTTTCGCTT TTGACCCGCG AGGGAGAGGT AGAGATCGCC AAGAGGATCG AGGTGGGCGA GCGCGACGTC GCCAGCGTCA TCCTGAACAC TCCGATCACG GTGAGAGAGG TCGTCTCCCT TGGTGAGCGT CTCAGGAAGC AGCAGATCGG CGCCATCGAG ATCTCCAAGG ACGTCGAGGA AGAGGTGCTG GAGGAGGGGG AAGAGGATCT CCAGGCCCTC AAGGTGCTCA CCATCATCGA CGAGATCAAG GAAATCGAAC AGCGGATGAG CGAGATCCAG TCTGCTCTGG AGGCCGAGAA GGTCGCCGCA AAGGAGCGCG AGGCTCTGGG CGCCGAGCAC GCCGAGCTCA AGGTGAAGAT GGCGGAGACC CTCAAGTCGC TGCGCCTGAA GGACCGCCAC ATCGAGAAGA TCGCCCAGCG CCTGAAGGAG CTTTCCTGCA AGGTCGACAC GGTCATGCAG GAGATCGCCG AGCTGGAGAA GGAAAGCGGC GCCGAGAGGG AGCCTTTCCT TACCGCCTTC GAAGGGGCGA AGGGGGGCTC CGAGGCGGAC TTCCAGAAGA AGCTGAACAT GACGCTGGAG GAGGGGCAGA AGCTCGAGAA GCGCTTCCGT TCCAGCGAAG CGAAATTGAA GAAGATCGAG CAGGAGTCCG GCTTCAAGGC GAGCGAGCTC GCCAACGCCC TGCTCGCCAT CGAAGAAGGT GAGCACAAGG CGAAGCTGGC CAAGAGCGAA CTGGTCGAGG CCAACCTCCG CCTGGTCGTC TCCATTGCCA AGAAGTACAC CAACCGCGGT CTGCAGTTCC TGGACCTGAT CCAGGAAGGG AACATCGGCC TCATGAAGGC TGTCGATAAG TTCGAGTACC AGCGAGGTTA CAAGTTCTCG ACCTACGCCA CCTGGTGGAT CCGCCAGGCC ATCACCCGCG CCATCGCGGA CCAGGCCCGT ACCATCAGGA TCCCGGTGCA CATGATCGAG ACCATCAACA AGCTGATCCG TACCAGCCGT CAGCTGGTGC AGGAGATCGG CCGCGAGCCC TCCCCCGAGG AAATCGCCGA GCGCATGGCG CTGCCGCTGG ACAAGGTGCG CAAGGTCCTG AAAATCGCCA AGGAGCCGAT CTCCCTGGAG ACCCCGATCG GCGAGGAAGA AGATTCCCAT CTTGGGGACT TCATCGAGGA CAAGGGTGTG GTCTCCCCCC TGGAGGCGGT GATCAAGGCG AACCTTTCCG AGCAGACCTC CCGGGTGCTC TCCACCCTCA CCCCCCGCGA GGAAAAGGTG CTCCGGATGC GCTTCGGCAT CGGCGAGAAG AGCGACCATA CCCTCGAGGA GGTGGGCCAG GACTTCGAGG TCACGCGCGA AAGGATCCGG CAGATCGAGG CGAAGGCGCT CAGGAAGCTG CGCCATCCCA GCCGGGCCAA GAAGCTGAAA AGCTTCGTAG AGTAA
|
Protein sequence | MAKKSMDEVK QLIDLGKEKG FLTYEEVNDL LPPDIVSSDQ IDDVMSMFGD MDIEIVDSAQ KVKIPKIKMD LEEEEEHEGS EEEVEFEPGT LGRTSDPVRM YLREMGSVSL LTREGEVEIA KRIEVGERDV ASVILNTPIT VREVVSLGER LRKQQIGAIE ISKDVEEEVL EEGEEDLQAL KVLTIIDEIK EIEQRMSEIQ SALEAEKVAA KEREALGAEH AELKVKMAET LKSLRLKDRH IEKIAQRLKE LSCKVDTVMQ EIAELEKESG AEREPFLTAF EGAKGGSEAD FQKKLNMTLE EGQKLEKRFR SSEAKLKKIE QESGFKASEL ANALLAIEEG EHKAKLAKSE LVEANLRLVV SIAKKYTNRG LQFLDLIQEG NIGLMKAVDK FEYQRGYKFS TYATWWIRQA ITRAIADQAR TIRIPVHMIE TINKLIRTSR QLVQEIGREP SPEEIAERMA LPLDKVRKVL KIAKEPISLE TPIGEEEDSH LGDFIEDKGV VSPLEAVIKA NLSEQTSRVL STLTPREEKV LRMRFGIGEK SDHTLEEVGQ DFEVTRERIR QIEAKALRKL RHPSRAKKLK SFVE
|
| |