Gene GSU3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3089 
SymbolrpoD 
ID2686774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3394412 
End bp3396145 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content58% 
IMG OID637127782 
ProductRNA polymerase sigma factor RpoD 
Protein accessionNP_954130 
Protein GI39998179 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAGA AAAGCACGGA CGAAGTCAAG CAACTCATCG ACCTGGGAAA GGAAAAGGGA 
TTCCTCACCT ATGATGAGGT AAACGACCTT CTTCCTCCCG ATATTTCATC CGAGCAGATC
GACGACGTAA TGAGCATGTT CGGTGAGATG GATATTGACA TCGTCGATTC GGCCCAGAAG
GTAAAAATCC CCAAGATGAA ACTCGACCTG GAGGAAGAGG AAGAGATTGA AGGGGAACAG
GAAGACTTCG AGTACGAGCC GGGCGCTCTG GGAAGGACAA GCGATCCGGT ACGGATGTAT
CTGCGCGAGA TGGGATCGGT GTCGCTACTC ACCCGCGAGG GTGAGGTGGA AATTGCCAAG
CGTATTGAGG AGGGCGAGCG GGATATCGCC GGCGTCATCC TCAATACCCC CATCACGGTG
AAGGAGATCA TCGCACTTGG CGAGAAGCTC CTGAGGGGGC AGGTCAGCGC CGCCGAGATC
AGCAAGGAGG TGGAGGAAGA GGAGCTGGAG GAGGATGAAG AGGATGTGCA GAAGACCCGT
CTTCTTGCCC AGGTGGAGGA AATTGCCGCC GTCGATGTCC GCCTTGCTGC CCTGAATGCC
GAGCTGGAGG AGGAATCCCT TTCCGCCTCC CGTCGGCAGG AGTTGCTCGC TGAACGGCAG
GAGTTGAAGG AAAAGCTTGC CGAGATGGTT ACCTCCCTGC GTCTCAAGGA TCGGCACATA
GCCAAGATCG CCCAGCGGCT CAAGGAGTTG TCGGCCAAGG TTGATACCAT TATGGCGGAG
ATTGCCGCCA TTGAGAAGGA GGCGGGAGTC TCCGCCGACA CCCTCAAGGA GATTGCCGCC
AGCGAAGCTG TGGGAAAGGG GATCAAGCTT TCTCTGGAAG ATGCTCAGAA GCACGAGAAG
AAGGTGCGCT CAGCTGAGAA GAAACTGAAG AAGGTTGAGG AAGAGTCCGG GTTCAAGGCC
CGTGAGCTTT CCGACGCCCT CAGGGCCATC GATCGGGGTG AAGCCAAGTC GAAGCTGGCC
AAGTCGGAAC TGGTGGAGGC GAACCTGCGG CTCGTGGTAT CCATTGCCAA GAAGTACACC
AACCGGGGGC TCCAGTTTCT CGACCTCATT CAGGAGGGGA ACATCGGTCT CATGAAGGCG
GTGGACAAGT TTGAGTACCA GCGCGGGTAC AAGTTCTCGA CGTACGCCAC TTGGTGGATT
CGACAGGCCA TTACCCGAGC TATCGCCGAC CAGGCCCGCA CCATCCGGAT TCCGGTCCAT
ATGATCGAGA CCATCAACAA GCTGATCCGG ACCAGTCGCC AACTGGTGCA GGAAATCGGT
CGTGAGCCGT CGCCGGAGGA AATCGCCGAA CGGATGAATC TCCCCCTCGA CAAGGTACGC
AAGGTCCTCA AGATCGCCAA GGAGCCCATC TCACTCGAAA CTCCCATCGG GGAAGAGGAA
GATTCACACC TGGGGGATTT CATCGAAGAC AAGGGTGTGG TTTCGCCTCT GGAGGCGGTC
ATCAAGGCCA ACCTTTCGGA GCAGACCGCC CGCGTGCTTG CCACCCTCAC CCCCCGGGAG
GAAAAAGTTC TGCGGATGCG TTTCGGCATC GGCGAGAAGA GCGATCATAC CCTTGAGGAG
GTGGGGCAGG ACTTCGAGGT GACCCGGGAG CGGATTCGTC AGATCGAGGC CAAGGCCCTG
CGCAAGCTCC GCCATCCGAG CCGGGCCAAG AAACTCCGCA GCTTCGTGGA ATAG
 
Protein sequence
MAKKSTDEVK QLIDLGKEKG FLTYDEVNDL LPPDISSEQI DDVMSMFGEM DIDIVDSAQK 
VKIPKMKLDL EEEEEIEGEQ EDFEYEPGAL GRTSDPVRMY LREMGSVSLL TREGEVEIAK
RIEEGERDIA GVILNTPITV KEIIALGEKL LRGQVSAAEI SKEVEEEELE EDEEDVQKTR
LLAQVEEIAA VDVRLAALNA ELEEESLSAS RRQELLAERQ ELKEKLAEMV TSLRLKDRHI
AKIAQRLKEL SAKVDTIMAE IAAIEKEAGV SADTLKEIAA SEAVGKGIKL SLEDAQKHEK
KVRSAEKKLK KVEEESGFKA RELSDALRAI DRGEAKSKLA KSELVEANLR LVVSIAKKYT
NRGLQFLDLI QEGNIGLMKA VDKFEYQRGY KFSTYATWWI RQAITRAIAD QARTIRIPVH
MIETINKLIR TSRQLVQEIG REPSPEEIAE RMNLPLDKVR KVLKIAKEPI SLETPIGEEE
DSHLGDFIED KGVVSPLEAV IKANLSEQTA RVLATLTPRE EKVLRMRFGI GEKSDHTLEE
VGQDFEVTRE RIRQIEAKAL RKLRHPSRAK KLRSFVE