Gene Gura_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_4047 
Symbol 
ID5166826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4706383 
End bp4708143 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content56% 
IMG OID640551526 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001232764 
Protein GI148266058 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000898252 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAA AAAATATGGA TGAAGTAAAA CAACTGATTG ATCTGGGTAA GGAAAAAGGA 
TTTCTTACCT ATGAGGAGGT CAACGATCTC CTCCCCCCGG ACATTGTTTC TTCCGAGCAG
ATTGACGACG TCATGAGTAT GTTCGGGGAC ATGGATATCG AAATAGTCGA TTCCGCGCAA
AAGGTTAAGA TTCCCAAGGT CAAGCTTGAC CTTGAAGATG AAGAGGAACT GGAAGGCGAA
CAGGAAGAGG TGGAGTTTGA GCCGGGGACG CTCGGACGGA CCAGCGACCC TGTCCGGATG
TACCTGCGGG AAATGGGTTC CGTTTCCCTT CTGACCCGTG AAGGCGAGGT GGAAATAGCC
AAGAGGATCG AGGTCGGCGA GAGGGATGTG GCGAGTGTTA TCCTCAACAC CCCGATCACG
GTCAAAGAAG TGATCTCGCT TGGGGATAAG CTGCGCAAGT TGCAGATTGC GGCCATAGAG
ATAAGCAAAG AGGTCGAAGA GGAAGAGCTC GAGGAAGGTG AGGAAGATGT CCAGGCGACC
AGGCTCCTGA CCGTAATCGA TGAGATTCGT GCCGATGACA CGCGCATGGA AGAGCTTCAT
GCCGCCCTTG AGCAGGAGAC AGTAACCAAA AGCGAGCGGG AAAATATTTT GGCGGAACAG
CAGGAATTGA AGGCGAAAAT GGCCGATCTT CTGAAATCCC TCCGCCTCAA GGACCGCCAT
ATTGAGAAGA TATCCCAGCG GCTGAAGGAG CTTTCCTGGA AGGTGGACAA GGTGATGCAG
GAGATCGCCG ACCTGGAGAA GGAAGCCGGT GTTGCCCCTG CTGATCTGAT GCCTATGCTG
GACAAGATGC ATAAAGGTGC GGATGAGGAG AAGAAGGTCT TAAAGAAACT CGGGATTTCT
CTGGAAGAAG CGCAGAAGCT GGAGAAAAGA TTCAGGAACG CCGAGAAAAA GCTGAAAAAG
ATAGAGCAGG AGTCCGGCTT CCAGGCGACC GAGCTTTCCA CCGCCCTGCT GGCCATTGAA
GAGGGCGAAA GGAAGGCCAA GCTGGCCAAG TCGGAACTGG TGGAGGCCAA CCTCCGTCTC
GTAGTTTCCA TCGCCAAGAA GTACACAAAC CGTGGCCTGC AGTTTCTCGA TCTGATCCAG
GAAGGGAATA TCGGCCTGAT GAAGGCGGTG GACAAGTTCG AGTACCAGCG CGGCTACAAA
TTCTCGACCT ACGCCACCTG GTGGATCCGG CAGGCCATTA CCCGCGCCAT TGCCGACCAG
GCCCGCACCA TTCGCATTCC GGTCCACATG ATCGAGACCA TCAACAAACT GATCCGCACC
AGCCGCCAGC TGGTCCAGGA GATTGGCCGC GAGCCGTCGC CGGAGGAGAT TGCCGAACGG
ATGAGCCTGC CGCTGGACAA GGTGCGCAAG GTCCTCAAGA TCGCCAAGGA GCCGATCTCC
CTGGAGACAC CCATCGGCGA GGAAGAAGAT TCCCACCTGG GGGACTTCAT CGAGGACAAG
GGGGTTGTTT CTCCCCTGGA GGCGGTCATC AAGGCCAACC TTTCCGAACA GACCTCGCGG
GTGCTGTCCA CCCTCACCCC GCGCGAAGAA AAGGTGCTGC GGATGCGTTT CGGCATCGGT
GAGAAGAGCG ACCATACCCT GGAGGAGGTC GGTCAGGATT TCGAGGTGAC CCGTGAGCGT
ATCCGGCAGA TAGAGGCCAA GGCGCTGCGG AAGCTCCGCC ATCCGAGCCG GGCAAAGAAA
CTCAAGAGCT TCGTGGAATA G
 
Protein sequence
MAKKNMDEVK QLIDLGKEKG FLTYEEVNDL LPPDIVSSEQ IDDVMSMFGD MDIEIVDSAQ 
KVKIPKVKLD LEDEEELEGE QEEVEFEPGT LGRTSDPVRM YLREMGSVSL LTREGEVEIA
KRIEVGERDV ASVILNTPIT VKEVISLGDK LRKLQIAAIE ISKEVEEEEL EEGEEDVQAT
RLLTVIDEIR ADDTRMEELH AALEQETVTK SERENILAEQ QELKAKMADL LKSLRLKDRH
IEKISQRLKE LSWKVDKVMQ EIADLEKEAG VAPADLMPML DKMHKGADEE KKVLKKLGIS
LEEAQKLEKR FRNAEKKLKK IEQESGFQAT ELSTALLAIE EGERKAKLAK SELVEANLRL
VVSIAKKYTN RGLQFLDLIQ EGNIGLMKAV DKFEYQRGYK FSTYATWWIR QAITRAIADQ
ARTIRIPVHM IETINKLIRT SRQLVQEIGR EPSPEEIAER MSLPLDKVRK VLKIAKEPIS
LETPIGEEED SHLGDFIEDK GVVSPLEAVI KANLSEQTSR VLSTLTPREE KVLRMRFGIG
EKSDHTLEEV GQDFEVTRER IRQIEAKALR KLRHPSRAKK LKSFVE