Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4047 |
Symbol | |
ID | 5166826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4706383 |
End bp | 4708143 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640551526 |
Product | RpoD family RNA polymerase sigma factor |
Protein accession | YP_001232764 |
Protein GI | 148266058 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000898252 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAA AAAATATGGA TGAAGTAAAA CAACTGATTG ATCTGGGTAA GGAAAAAGGA TTTCTTACCT ATGAGGAGGT CAACGATCTC CTCCCCCCGG ACATTGTTTC TTCCGAGCAG ATTGACGACG TCATGAGTAT GTTCGGGGAC ATGGATATCG AAATAGTCGA TTCCGCGCAA AAGGTTAAGA TTCCCAAGGT CAAGCTTGAC CTTGAAGATG AAGAGGAACT GGAAGGCGAA CAGGAAGAGG TGGAGTTTGA GCCGGGGACG CTCGGACGGA CCAGCGACCC TGTCCGGATG TACCTGCGGG AAATGGGTTC CGTTTCCCTT CTGACCCGTG AAGGCGAGGT GGAAATAGCC AAGAGGATCG AGGTCGGCGA GAGGGATGTG GCGAGTGTTA TCCTCAACAC CCCGATCACG GTCAAAGAAG TGATCTCGCT TGGGGATAAG CTGCGCAAGT TGCAGATTGC GGCCATAGAG ATAAGCAAAG AGGTCGAAGA GGAAGAGCTC GAGGAAGGTG AGGAAGATGT CCAGGCGACC AGGCTCCTGA CCGTAATCGA TGAGATTCGT GCCGATGACA CGCGCATGGA AGAGCTTCAT GCCGCCCTTG AGCAGGAGAC AGTAACCAAA AGCGAGCGGG AAAATATTTT GGCGGAACAG CAGGAATTGA AGGCGAAAAT GGCCGATCTT CTGAAATCCC TCCGCCTCAA GGACCGCCAT ATTGAGAAGA TATCCCAGCG GCTGAAGGAG CTTTCCTGGA AGGTGGACAA GGTGATGCAG GAGATCGCCG ACCTGGAGAA GGAAGCCGGT GTTGCCCCTG CTGATCTGAT GCCTATGCTG GACAAGATGC ATAAAGGTGC GGATGAGGAG AAGAAGGTCT TAAAGAAACT CGGGATTTCT CTGGAAGAAG CGCAGAAGCT GGAGAAAAGA TTCAGGAACG CCGAGAAAAA GCTGAAAAAG ATAGAGCAGG AGTCCGGCTT CCAGGCGACC GAGCTTTCCA CCGCCCTGCT GGCCATTGAA GAGGGCGAAA GGAAGGCCAA GCTGGCCAAG TCGGAACTGG TGGAGGCCAA CCTCCGTCTC GTAGTTTCCA TCGCCAAGAA GTACACAAAC CGTGGCCTGC AGTTTCTCGA TCTGATCCAG GAAGGGAATA TCGGCCTGAT GAAGGCGGTG GACAAGTTCG AGTACCAGCG CGGCTACAAA TTCTCGACCT ACGCCACCTG GTGGATCCGG CAGGCCATTA CCCGCGCCAT TGCCGACCAG GCCCGCACCA TTCGCATTCC GGTCCACATG ATCGAGACCA TCAACAAACT GATCCGCACC AGCCGCCAGC TGGTCCAGGA GATTGGCCGC GAGCCGTCGC CGGAGGAGAT TGCCGAACGG ATGAGCCTGC CGCTGGACAA GGTGCGCAAG GTCCTCAAGA TCGCCAAGGA GCCGATCTCC CTGGAGACAC CCATCGGCGA GGAAGAAGAT TCCCACCTGG GGGACTTCAT CGAGGACAAG GGGGTTGTTT CTCCCCTGGA GGCGGTCATC AAGGCCAACC TTTCCGAACA GACCTCGCGG GTGCTGTCCA CCCTCACCCC GCGCGAAGAA AAGGTGCTGC GGATGCGTTT CGGCATCGGT GAGAAGAGCG ACCATACCCT GGAGGAGGTC GGTCAGGATT TCGAGGTGAC CCGTGAGCGT ATCCGGCAGA TAGAGGCCAA GGCGCTGCGG AAGCTCCGCC ATCCGAGCCG GGCAAAGAAA CTCAAGAGCT TCGTGGAATA G
|
Protein sequence | MAKKNMDEVK QLIDLGKEKG FLTYEEVNDL LPPDIVSSEQ IDDVMSMFGD MDIEIVDSAQ KVKIPKVKLD LEDEEELEGE QEEVEFEPGT LGRTSDPVRM YLREMGSVSL LTREGEVEIA KRIEVGERDV ASVILNTPIT VKEVISLGDK LRKLQIAAIE ISKEVEEEEL EEGEEDVQAT RLLTVIDEIR ADDTRMEELH AALEQETVTK SERENILAEQ QELKAKMADL LKSLRLKDRH IEKISQRLKE LSWKVDKVMQ EIADLEKEAG VAPADLMPML DKMHKGADEE KKVLKKLGIS LEEAQKLEKR FRNAEKKLKK IEQESGFQAT ELSTALLAIE EGERKAKLAK SELVEANLRL VVSIAKKYTN RGLQFLDLIQ EGNIGLMKAV DKFEYQRGYK FSTYATWWIR QAITRAIADQ ARTIRIPVHM IETINKLIRT SRQLVQEIGR EPSPEEIAER MSLPLDKVRK VLKIAKEPIS LETPIGEEED SHLGDFIEDK GVVSPLEAVI KANLSEQTSR VLSTLTPREE KVLRMRFGIG EKSDHTLEEV GQDFEVTRER IRQIEAKALR KLRHPSRAKK LKSFVE
|
| |